Anthropic's latest AI model identifies 'thousands of zero-day vulnerabilities' in 'every major operating system and every major web browser' — Claude Mythos Preview sparks race to fix critical bugs, some unpatched for decades

Wednesday, April 08, 2026

The capabilities of AI coding agents like Claude Code and OpenAI's Codex are already causing seismic shifts for the software industry, but if Anthropic's latest disclosure is to believed, even more disruption is in the pipe. In a new blog post today, the frontier lab behind Claude revealed that its latest model, Claude Mythos Preview, is so capable at teasing out bugs that it's found "thousands of high-severity vulnerabilities, including some in every major operating system and web browser."

Given Claude Mythos Preview's potentially disruptive and wide-ranging capabilities, Anthropic isn't simply releasing it to the world, consequences be damned. Instead, the lab has convened key players across the software and hardware industries in order to use Mythos's bug-finding prowess to proactively patch the vulnerabilities it exposes before other frontier AI labs are able to deploy models of similar capabilities without similar guardrail s.

Under the umbrella of "Project Glasswing," Anthropic says it's working with Amazon Web Services, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, Nvidia, and Palo Alto Networks to help those companies secure their products. The lab also says it's extending access to "a group of over 40 additional organizations that build or maintain critical software infrastructure" so that they can benefit from Mythos' capabilities. Beyond industry, the lab says it's working with the United States government to share information about the model's potential for offensive and defensive use in cyberspace and its implications for national security.

Article continues below You may like

Anthropic's alarm stems from both the breadth of Mythos's capabilities and also the subtlety of the exploits it's able to identify and capitalize on. For just one example, the lab's researchers say the model "wrote a web browser exploit that chained together four vulnerabilities, writing a complex JIT heap spray that escaped both renderer and OS sandboxes." That kind of vulnerability chaining might only be within the hands of the most skilled human hackers today, but if a similarly capable AI model were to be released, it might be like handing script kiddies a nuclear weapon.

A graph showing the exploit capabilities of Claude Mythos Preview

(Image credit: Anthropic)

As those same researchers tell it, current versions of Claude are able to identify vulnerabilities well, but usually fail miserably at the task of turning those vulnerabilities into active exploits. Mythos, by contrast, is able to turn a whopping 72.4% of vulnerabilities it identifies into sucessful exploits within the domain of Firefox's JavaScript shell, and it is able to achieve register control in a further 11.6% of attempted attacks.

Anthropic's Frontier Red Team extensively describes the threat that an unbridled Mythos release might have on an unsuspecting software industry, and one example of its internal benchmarking practices vividly illustrates what's at stake: "We regularly run our models against roughly a thousand open source repositories from the OSS-Fuzz corpus, and grade the worst crash they can produce on a five-tier ladder of increasing severity, ranging from basic crashes (tier 1) to complete control flow hijack (tier 5).

With one run on each of roughly 7000 entry points into these repositories, Sonnet 4.6 and Opus 4.6 reached tier 1 in between 150 and 175 cases, and tier 2 about 100 times, but each achieved only a single crash at tier 3. In contrast, Mythos Preview achieved 595 crashes at tiers 1 and 2, added a handful of crashes at tiers 3 and 4, and achieved full control flow hijack on ten separate, fully patched targets (tier 5)."

Anthropic also provides several real-world examples of the kinds of bugs that Mythos has exposed, including a 27-year-old vulnerability in the famously hardened OpenBSD operating system that would have allowed an attacker to crash a system simply by connecting to it, a 16-year-old vulnerability in the foundational FFmpeg library that Anthropic says was "hit five million times by automated testing tools without ever catching the problem," and another exploit chain in the Linux kernel that would allow an attacker to achieve root access to the host system.

WIth a tool so capable of identifying exploits, Anthropic says that it is conducting responsible disclosure of the vulnerabilities it finds, but due to the volume of issues being discovered, the lab says that fewer than 1% of the potential bugs it's uncovered have been fully patched.

Going forward, Anthropic says it will not be making Claude Mythos Preview available for general use, and is instead characterizing much of its behavior through the model's system card. In the longer term, the lab hopes that by making Mythos available to a restricted subset of partners now, it can help lay the groundwork to help those companies and institutions prepare for a world where models of this class do become commonplace.

In any case, it's clear that the growth in capability of frontier AI models isn't slowing down within certa in domains of expertise, and the potentially disruptive effects of those models on the world are just one Hugging Face repository away from wreaking havoc in the wrong hands. We can only hope that labs pursuing similar capabilities with their frontier models are as responsible as Anthropic seems to be in characterizing and mitigating those risks before they cause real-world harm.

Google Preferred Source

Follow Tom's Hardware on Google News, or add us as a preferred source, to get our latest news, analysis, & reviews in your feeds.

Older Posts

Newer Posts

Facebook WhatsApp Twitter Telegram Line Tumblr LinkedIn Pinterest

Blog authors

eSIM Studios

eSIM STUDIOS

Anthropic's latest AI model identifies 'thousands of zero-day vulnerabilities' in 'every major operating system and every major web browser' — Claude Mythos Preview sparks race to fix critical bugs, some unpatched for decades

Share

Blog authors

You may like these posts

#{title}

No comments

Upgrading an MSI Claw 8 EX AI+ handheld gaming PC with a 2TB SSD

MANGMI drops the full specs for its Game Boy-like handhelds

Orange Pi Zero 3W is a tiny Allwinner A733 computer with up to 16GB RAM and PCIe 3.0 support

From Oxygen OS to Color OS: These OnePlus phones and tablets could make the jump