Factlen Deep DiveAI DefenseEvidence PackJun 13, 2026, 1:44 AM· 6 min read· #4 of 4 in technology

Inside Project Glasswing: How Anthropic's Claude Mythos is Autonomously Patching the Internet

Anthropic has deployed its unreleased Claude Mythos AI to a coalition of tech giants to autonomously find and fix thousands of zero-day vulnerabilities. The initiative aims to secure critical global infrastructure before equivalent AI models fall into the hands of threat actors.

By Factlen Editorial Team

Share this story

Enterprise Defenders 40%Frontier AI Developers 35%Government Evaluators 25%

Enterprise Defenders: View AI-driven vulnerability discovery as a necessary force multiplier to overcome chronic technical debt.
Frontier AI Developers: Argue that highly capable models must be restricted and deployed defensively first to prevent catastrophic exploitation.
Government Evaluators: Focus on empirical testing and the urgent need for foundational security hygiene.

What's not represented

· Independent Security Researchers
· Nation-State Threat Actors

Why this matters

For decades, cyber defenders have been losing the race against attackers who only need to find one flaw to breach a system. By using frontier AI to autonomously audit and patch the world's most critical codebases at machine speed, this coalition is fundamentally flipping the economics of cybersecurity in favor of the defenders.

Key points

Anthropic's Claude Mythos AI has demonstrated unprecedented ability to autonomously find and exploit software vulnerabilities.
To prevent misuse, Anthropic restricted the model's public release, launching Project Glasswing to share it exclusively with defensive partners.
The coalition includes major tech firms like Microsoft, Google, and Apple, as well as financial institutions like the NYSE.
In its first month, the initiative uncovered over 10,000 high-severity vulnerabilities in critical software infrastructure.
The UK's AI Security Institute verified the model's capabilities, noting it could execute multi-stage attacks that would take humans days.

10,000+

Critical vulnerabilities found in month one

200

Global partners enrolled in Project Glasswing

$100M

Defensive AI usage credits committed

27 years

Age of OpenBSD bug discovered by Mythos

Average steps completed in 32-step cyber range

For decades, the fundamental math of cybersecurity has heavily favored the attacker. A defensive team must secure millions of lines of code against every conceivable vector, while an adversary only needs to find a single overlooked flaw to compromise a network [8]. In early 2026, the emergence of highly autonomous frontier AI models threatened to accelerate this asymmetry to a breaking point. However, an unprecedented industry mobilization is currently attempting to invert that dynamic, deploying advanced AI to autonomously audit and patch the world's critical infrastructure before threat actors can acquire equivalent capabilities [1, 8].[1][8]

The catalyst for this shift is Claude Mythos Preview, an unreleased frontier AI model developed by Anthropic. Originally designed to push the boundaries of autonomous software engineering, the model's deep comprehension of complex codebases inadvertently resulted in exceptional cybersecurity capabilities [1]. Evidence of these capabilities is strong and empirically verified: rather than merely flagging potential errors like traditional static analyzers, Mythos can autonomously trace data flows, identify zero-day vulnerabilities, and write the functional exploit code required to compromise a system [1, 3].[1][3]

The most rigorous public assessment of the model's offensive potential comes from the United Kingdom's AI Security Institute (AISI). In April 2026, the agency published evaluations demonstrating that Mythos Preview represents a significant escalation in machine-driven cyber performance [3]. The AISI tested the model against "The Last Ones" (TLO), a highly complex 32-step corporate network attack simulation that requires an estimated 20 hours of human labor to complete [3].[3]

The AISI's findings provide concrete evidence of the model's autonomy. Mythos Preview became the first AI model to solve the TLO simulation from start to finish, succeeding in 3 out of 10 attempts and completing an average of 22 out of 32 steps across all trials [3]. The agency concluded that the model is capable of executing multi-stage attacks and autonomously exploiting vulnerabilities in weakly defended enterprise systems—tasks that would traditionally require days of work by human professionals [3].[3]

UK AISI evaluations demonstrated Mythos Preview's ability to autonomously navigate complex network attack simulations.

Recognizing that releasing such a model to the public could trigger a catastrophic wave of automated cyberattacks, Anthropic restricted access to Mythos [1, 4]. Instead, on April 7, 2026, the company launched Project Glasswing, a defensive coalition granting exclusive access to the model to a vetted group of major technology providers [1]. The founding partners include Amazon Web Services, Apple, Broadcom, Cisco, CrowdStrike, Google, Microsoft, NVIDIA, and Palo Alto Networks [1].[1][4]

The stated objective of Project Glasswing is to weaponize the model for defense, utilizing its capabilities to scan and secure both proprietary platforms and the open-source software that underpins the modern internet [1]. The early empirical results from this deployment are staggering. By late May 2026, Anthropic reported that the initial 50 partners had collectively used Mythos to discover more than 10,000 high- or critical-severity vulnerabilities in systemically important software [2].[1][2]

In its first month, the defensive coalition uncovered thousands of vulnerabilities across critical software.

The nature of the discovered vulnerabilities underscores the limitations of traditional security tooling. According to Anthropic's red team, Mythos uncovered a 27-year-old remote-crash vulnerability in OpenBSD, an operating system renowned for its rigorous security hardening [1]. It also identified a 16-year-old flaw in FFmpeg—a ubiquitous video processing library—locating a vulnerable line of code that automated fuzzing tools had tested five million times without triggering an alert [1]. Furthermore, the model demonstrated the ability to autonomously chain together multiple low-level vulnerabilities in the Linux kernel to achieve full system control [1].[1]

The nature of the discovered vulnerabilities underscores the limitations of traditional security tooling.

While the evidence for AI-driven vulnerability discovery is robust, the initiative faces a significant operational bottleneck: human verification and patching [2]. Discovering a flaw takes seconds, but verifying the finding, developing a secure patch, and coordinating its deployment across the global software supply chain remains a labor-intensive process [2]. This friction is particularly acute in the open-source ecosystem, which relies heavily on volunteer maintainers [2].[2]

To address this disparity, Anthropic committed up to $100 million in usage credits for defensive security work and $4 million in direct donations to open-source security organizations [1]. However, there is transparent uncertainty regarding whether this financial injection will be sufficient to clear the massive backlog of vulnerabilities the AI is generating, or if open-source maintainers will simply be overwhelmed by the influx of critical reports [2, 8].[1][2][8]

In the enterprise sector, where organizations control their own codebases, remediation is moving significantly faster. By late May, enterprise partners had already deployed patches for over 2,100 vulnerabilities discovered by the model [2]. The initiative expanded rapidly; by early June 2026, the coalition had grown to encompass approximately 200 organizations across more than 15 countries [4].[2][4]

The AI model is capable of tracing data flows to identify deeply buried vulnerabilities that human reviewers missed.

The deployment extends deep into critical financial infrastructure. On June 4, 2026, the Intercontinental Exchange (ICE)—the operator of the New York Stock Exchange and major global clearinghouses—announced it had integrated Claude Mythos Preview into its core cybersecurity architecture [5]. ICE executives confirmed the model is actively scanning the trading platforms and mortgage technology systems that serve as the backbone of global capital markets, aiming to remediate flaws before they can be exploited [5].[5]

Despite the success of the defensive rollout, the broader strategic window is closing. Security analysts at Bain & Company note that while Mythos is currently restricted, equivalent capabilities are actively being developed by other organizations, including OpenAI's GPT-5.4-Cyber and Google's Big Sleep [6]. The era of AI-enabled cyberattacks at scale is inevitable, shifting the immediate priority to strengthening foundational cybersecurity postures before these tools proliferate [6].[6]

Bain's analysis highlights a critical vulnerability: chronic corporate underinvestment in cybersecurity [6]. Many organizations, particularly those operating legacy operational technology in the energy, manufacturing, and transportation sectors, possess deep underlying weaknesses that AI-driven attacks will rapidly expose [6]. Analysts argue that incremental budget increases will be insufficient, suggesting that many businesses will need to double their cybersecurity spending to survive the coming shift [6].[6]

The primary bottleneck has shifted from discovering vulnerabilities to verifying and patching them.

In the interim, AI developers are attempting to thread a delicate needle between releasing capable models and preventing mass exploitation. When Anthropic released its consumer-facing Fable 5 model to the public on June 9, 2026, it implemented strict safeguards [4]. Any user queries related to advanced cybersecurity, biology, or chemistry are automatically intercepted and routed to the older, less capable Opus 4.8 model, effectively neutering the public tool's hacking potential [4].[4]

The ultimate efficacy of Project Glasswing remains an open question [8]. While the initiative has successfully demonstrated that frontier AI can be harnessed to identify and patch thousands of critical zero-day vulnerabilities, it is fundamentally a race against time [1, 7]. The coalition must secure the internet's foundational architecture before the inevitable democratization of these offensive capabilities places a machine-speed hacker on the laptop of every threat actor in the world [7, 8].[1][7][8]

How we got here

April 7, 2026
Anthropic announces Project Glasswing and the highly capable Claude Mythos Preview model.
April 13, 2026
The UK AI Security Institute publishes evaluations confirming Mythos can autonomously execute multi-stage network attacks.
May 22, 2026
Anthropic reports that the coalition's initial partners have discovered over 10,000 critical vulnerabilities.
June 4, 2026
Intercontinental Exchange (ICE) announces the deployment of Mythos to secure the New York Stock Exchange.
June 9, 2026
Anthropic releases the consumer-facing Fable 5 model, strictly routing advanced cybersecurity queries to older models.

Viewpoints in depth

Frontier AI Developers

Argue that highly capable models must be restricted and deployed defensively first to prevent catastrophic exploitation.

Organizations like Anthropic maintain that releasing models with autonomous hacking capabilities to the public would be deeply irresponsible. By restricting access to vetted partners through initiatives like Project Glasswing, they aim to create a 'defensive window' where the good guys can patch legacy vulnerabilities before threat actors inevitably acquire or replicate the same AI capabilities.

Enterprise Defenders

View AI-driven vulnerability discovery as a necessary force multiplier to overcome chronic technical debt.

Security analysts and enterprise leaders argue that the industry has suffered from years of chronic underinvestment, leaving critical infrastructure riddled with decades-old bugs. For these defenders, frontier AI is not just a threat, but the only viable mechanism to audit massive, complex codebases at scale and remediate vulnerabilities faster than human teams ever could.

Government Evaluators

Focus on empirical testing and the urgent need for foundational security hygiene.

Agencies like the UK's AI Security Institute emphasize that while AI models are achieving unprecedented success in simulated cyber ranges, their real-world impact depends heavily on a target's existing security posture. They advocate for a return to rigorous security basics—such as strict access controls and rapid patching—warning that AI will ruthlessly exploit organizations that fail to maintain basic cyber hygiene.

What we don't know

How quickly open-source volunteer maintainers can verify and patch the massive volume of vulnerabilities Mythos is discovering.
When equivalent offensive AI capabilities will inevitably leak or be replicated by nation-state threat actors.
Whether the $100 million in usage credits will be sufficient to secure the long tail of underfunded open-source projects.

Key terms

Zero-day vulnerability: A software flaw that is unknown to the vendor, meaning no patch exists at the time it is discovered or exploited.
Capture-the-flag (CTF): A cybersecurity exercise where participants attempt to find and exploit vulnerabilities to retrieve a hidden piece of data.
Fuzzing: An automated software testing technique that inputs massive amounts of random data to find coding errors and system crashes.
Operational technology (OT): Hardware and software that detects or causes a change through the direct monitoring and control of physical devices, common in manufacturing and energy.
Agentic scaffolding: Software frameworks that allow AI models to autonomously plan, execute, and iterate on multi-step tasks over long periods without human intervention.

Frequently asked

Can anyone use Claude Mythos to find vulnerabilities?

No. Anthropic has restricted access to vetted partners in Project Glasswing. The public version, Fable 5, actively blocks advanced cybersecurity tasks.

Did Anthropic build Mythos specifically to be a hacking tool?

No. It was designed as an advanced software engineering model, but its deep understanding of complex code inherently made it exceptional at finding vulnerabilities.

What kind of bugs is the AI finding?

It is finding deeply buried flaws, including a 27-year-old remote-crash bug in OpenBSD and complex vulnerability chains in the Linux kernel that human reviewers missed.

How are open-source projects handling the influx of bug reports?

It remains a significant challenge. While Anthropic has donated $4 million to open-source security orgs, volunteer maintainers face a massive backlog of AI-generated vulnerability reports to verify and patch.

Sources

[1]AnthropicFrontier AI Developers
Project Glasswing
Read on Anthropic →
[2]AnthropicFrontier AI Developers
Project Glasswing: An initial update
Read on Anthropic →
[3]UK AI Security InstituteGovernment Evaluators
Cyber evaluations of Anthropic's Claude Mythos Preview
Read on UK AI Security Institute →
[4]The GuardianGovernment Evaluators
AI company restricted access to Fable 5, its most powerful Mythos model, for months over cybersecurity concerns
Read on The Guardian →
[5]Intercontinental ExchangeEnterprise Defenders
Cyber security initiative deployed across exchanges and clearinghouses
Read on Intercontinental Exchange →
[6]Bain & CompanyEnterprise Defenders
Claude Mythos and the Era of AI-Enabled Cyberattacks
Read on Bain & Company →
[7]Arctic WolfEnterprise Defenders
Project Glasswing Shifts the Attack-Defense Balance
Read on Arctic Wolf →
[8]Factlen Editorial TeamEnterprise Defenders
Synthesis by Factlen editorial team
Read on Factlen Editorial Team →

Up next

Decentralized Web

The End of the Walled Garden: How Decentralized Protocols Are Rewiring Social Media

Open standards like ActivityPub and the AT Protocol are quietly dismantling the era of locked-in social networks. By decoupling user identity from corporate servers, the decentralized web promises a future where your digital life belongs entirely to you.

Every angle. Every day.

Get technology stories with full source coverage and perspective breakdowns delivered to your inbox.

Get the briefing →Browse technology