The Evidence Pack: How AI is Forcing a Revolution in Cyber Defense
Following reports that Anthropic's Mythos model breached classified systems during a test, the Five Eyes alliance is urging organizations to adopt machine-speed defenses.
By Factlen Editorial Team
- Defensive Security Practitioners
- Argue that AI is the only way to scale defense against automated threats, pointing to the hundreds of zero-days patched before exploitation as proof of its value.
- National Security Officials
- Focus on the geopolitical risks of frontier AI proliferation, arguing that offensive capabilities must be treated as strategic assets with strict export controls.
- Frontier AI Developers
- Highlight the necessity of controlled red-teaming and restricted rollouts to safely test the boundaries of AI capabilities while giving defenders a head start.
What's not represented
- · Open-source software maintainers who lack the budget to access frontier AI defensive tools.
- · Small and medium-sized businesses struggling to implement machine-speed defenses.
Why this matters
The timeline for identifying and patching software vulnerabilities is shrinking from months to minutes. Organizations that fail to adopt AI-driven defensive tools risk being outpaced by automated threats that can bypass traditional security measures.
Key points
- Anthropic's Mythos AI model successfully identified vulnerabilities in classified US systems during an authorized red-team exercise.
- The Five Eyes alliance warned that AI will transform cyber capabilities in 'months, not years.'
- Defenders are using the same AI technology to patch zero-day flaws before they can be exploited.
- Mozilla used the Mythos Preview to identify and patch 271 security vulnerabilities in Firefox.
- Intelligence agencies urge organizations to adopt machine-speed defense and treat cybersecurity as a core business risk.
The recent headlines surrounding the intersection of artificial intelligence and national security sound like the plot of a science fiction thriller: a frontier AI model reportedly breached classified United States military systems in a matter of hours. For an industry already grappling with the rapid acceleration of machine learning capabilities, the news initially seemed to confirm the worst fears about autonomous digital weapons operating beyond human control. The sheer speed of the reported intrusion suggested a fundamental shift in the balance of power between cyber attackers and defenders, sparking immediate concern across global markets and intelligence communities.[1][2]
But context changes everything, and the reality of the incident offers a far more constructive lesson. This was not a rogue attack executed by a malicious state actor or a ransomware syndicate, but rather a highly controlled, authorized red-team exercise using Anthropic’s unreleased "Mythos" model. The U.S. government intentionally pointed the AI at its own infrastructure to see what it could find. By simulating an advanced persistent threat in a safe environment, officials were able to map out exactly how next-generation AI systems will approach network infiltration, turning a potential crisis into a vital learning opportunity.[1][3]
The exercise proved something vital for the future of the cybersecurity industry. It demonstrated unequivocally that frontier AI can find zero-day vulnerabilities—critical software flaws previously unknown to vendors—faster and more comprehensively than human auditors. Rather than viewing this solely as a threat, security professionals recognize it as a massive leap forward in diagnostic capability. If an AI can find a vulnerability in hours during a test, defenders can use that exact same capability to patch the flaw before an adversary ever discovers it.[1]
This revelation prompted the "Five Eyes" intelligence alliance—comprising the U.S., U.K., Australia, Canada, and New Zealand—to issue a rare joint warning this week. The coalition declared that AI will fundamentally transform cyber capabilities in "months, not years." However, their message was not one of despair, but a call to arms for corporate and government leaders to modernize their defenses. They emphasized that the window between vulnerability discovery and exploitation is shrinking rapidly, necessitating a complete overhaul of how organizations approach digital resilience.[4][6]

To understand why this development is a breakthrough for defenders rather than just a looming threat, it is necessary to look at the underlying mechanism of how models like Mythos actually audit code. For decades, the cybersecurity industry has relied on a cat-and-mouse game where defenders try to anticipate the specific techniques attackers will use. As software architecture became more complex, finding the hidden logical errors that lead to breaches became exponentially more difficult, often requiring months of dedicated research by highly specialized teams.
Traditionally, finding these software flaws required a technique known as "fuzzing"—an automated process of throwing random, malformed data at a program until it crashes—or painstaking manual review by human security experts. While fuzzing is effective at finding memory corruption bugs, it struggles to identify complex logical flaws where the code functions exactly as written but produces an insecure outcome. Human review can catch these logical errors, but it is entirely unscalable given the millions of lines of code underpinning modern digital infrastructure.
Mythos changes this paradigm entirely by applying deep semantic understanding to vast codebases. It reads the logic, understands the intended architecture, and identifies logical loopholes without needing to brute-force the system. Because it has been trained on massive datasets of both secure and insecure code, it can recognize patterns of vulnerability that would be invisible to traditional scanning tools. It essentially acts as a tireless, expert-level security researcher capable of holding the entire context of an operating system in its memory at once.[2]
In early testing, the U.K. AI Safety Institute found that Mythos succeeded 73 percent of the time on expert-level "capture the flag" cybersecurity challenges. These are not simple automated scans; they are complex, multi-stage puzzles designed to test the absolute limits of human hacking expertise. The fact that a machine learning model can consistently solve them indicates that the cognitive bottleneck in vulnerability research has been broken.[7]
AI Safety Institute found that Mythos succeeded 73 percent of the time on expert-level "capture the flag" cybersecurity challenges.
The model also demonstrated the unprecedented ability to autonomously chain together multiple complex steps. During simulated corporate network attacks, Mythos successfully executed reconnaissance, privilege escalation, and lateral movement, completing an average of 22 out of 32 required steps to achieve total network compromise. This chaining capability is what makes the model so potent: it does not just find a single unlocked door; it maps the entire building and figures out how to reach the vault.[7]

Recognizing the dual-use nature of this immense power, Anthropic launched "Project Glasswing," a restricted program granting early access to critical infrastructure defenders and government agencies. The goal was to pull forward the timeline of patching and hardening systems before models with Mythos-class capabilities inevitably proliferate to malicious actors. By giving the shield builders access to the ultimate sword, the company hoped to inoculate the digital ecosystem against future automated attacks.[3][5]
The results of arming defenders with AI have been immediate and highly effective. Mozilla, for example, used the Mythos Preview to identify and patch 271 security vulnerabilities in its Firefox browser before they could ever be exploited in the wild. Tasks that would have taken Mozilla's security team months of manual auditing were completed in a fraction of the time, proving that AI can scale defense just as effectively as it scales offense.[8]
This defensive capability is the core of the Five Eyes' message: the only way to defend against machine-speed attacks is with machine-speed defense. The intelligence agencies stressed that organizations can no longer rely on manual processes to detect and respond to intrusions. Instead, they must deploy AI-enabled tools that can monitor network behavior, identify anomalies, and execute containment protocols autonomously, matching the speed of incoming threats.[4][6]
Human-paced Security Operations Center (SOC) triage is structurally misaligned with an adversary that can execute an attack chain in seconds. If an AI agent can move laterally across a network in the time it takes a human analyst to open a support ticket, the battle is lost before it begins. The Five Eyes advisory makes it clear that automation must be met with automation, shifting the human role from frontline triage to strategic oversight and system architecture.[6]
Furthermore, the intelligence agencies are urging organizations to abandon the outdated idea that cybersecurity is purely an IT problem. They are explicitly calling for it to be elevated to a core business risk that requires active board-level oversight. Leaders must ensure that their security teams have the resources, authority, and advanced tools necessary to operate in an environment where the threat landscape shifts on a monthly basis.[4][6]

Their recommendations focus heavily on "secure-by-design" and "secure-by-default" principles. This means fundamentally reducing attack surfaces, accelerating patching cycles to close vulnerabilities the moment they are discovered, and addressing legacy systems that serve as strategic liabilities. The goal is to build networks that are inherently resilient, rather than relying on a perimeter defense that an AI model can easily bypass.[6]
The recent export controls imposed by the U.S. government, which restricted access to Mythos and forced Anthropic to pull back its release, highlight the geopolitical tightrope of managing these tools. The administration determined that the model's capabilities were too potent to risk falling into the hands of foreign adversaries, effectively classifying it as a strategic national asset.[5]
While limiting access keeps the tool out of the hands of state-sponsored hackers, it also temporarily disrupts the work of allied defenders who were using it to harden their own networks. Reports indicate that even parts of the NSA lost access to the most advanced versions of the model due to the sweeping nature of the restrictions, illustrating the complex trade-offs involved in governing dual-use technologies.[5]
Ultimately, the Mythos incident serves as a vital stress test for global digital infrastructure. It proves that while offensive digital weapons are getting smarter, the defensive shields are evolving just as rapidly—provided organizations are willing to deploy them. The transition to AI-driven cybersecurity will be turbulent, but it offers the promise of a future where networks are continuously tested, patched, and hardened at machine speed, creating a far more resilient digital world.
How we got here
April 2026
Anthropic launches Project Glasswing to give critical defenders early access to the Mythos model.
May 2026
Mozilla successfully uses Mythos Preview to patch 271 vulnerabilities in its Firefox browser.
June 11, 2026
The Senate Intelligence Committee is briefed on Mythos's ability to penetrate classified systems during testing.
June 22, 2026
The Five Eyes intelligence alliance issues a joint warning on the rapid acceleration of AI cyber capabilities.
Viewpoints in depth
Defensive Security Practitioners
Argue that AI is the only way to scale defense against automated threats.
For security teams on the front lines, the advent of models like Mythos is a long-awaited equalizer. Practitioners argue that human-paced triage is structurally incapable of defending against automated, machine-speed attacks. By utilizing frontier AI to audit codebases, defenders can identify and patch zero-day vulnerabilities before adversaries even know they exist. They point to real-world successes, such as Mozilla patching hundreds of flaws in Firefox, as proof that AI is an essential tool for hardening global infrastructure.
National Security Officials
Focus on the geopolitical risks of frontier AI proliferation and the need for strict export controls.
Intelligence and defense officials view frontier AI models through the lens of strategic national security. Because these models can autonomously chain together complex exploits and penetrate hardened systems, officials argue they must be treated as dual-use weapons. This perspective justifies the implementation of strict export controls and limited access, even among allied nations, to prevent the technology from falling into the hands of state-sponsored adversaries or ransomware syndicates.
Frontier AI Developers
Highlight the necessity of controlled red-teaming and restricted rollouts to safely test AI capabilities.
The companies building these models emphasize a cautious, iterative approach to deployment. Developers argue that programs like Project Glasswing are essential for safely testing the boundaries of AI capabilities in real-world environments. By granting early access exclusively to critical infrastructure partners and government agencies, developers believe they can give defenders a crucial head start on patching vulnerabilities before the underlying AI capabilities inevitably proliferate to the broader public.
What we don't know
- How quickly malicious actors will be able to replicate the autonomous cyber capabilities demonstrated by Mythos.
- Whether the US government will permanently restrict allied access to frontier AI models under export controls.
- How organizations without dedicated security budgets will afford to implement machine-speed defenses.
Key terms
- Red-Teaming
- The practice of rigorously challenging a system or network by acting as an adversary to identify vulnerabilities.
- Zero-Day Vulnerability
- A software flaw that is unknown to the vendor, meaning there is 'zero days' of protection or patch available.
- Lateral Movement
- Techniques that cyber attackers use to progressively move through a network after gaining initial access.
- Fuzzing
- An automated software testing technique that involves providing invalid, unexpected, or random data as inputs to a computer program.
- Five Eyes
- An intelligence alliance comprising Australia, Canada, New Zealand, the United Kingdom, and the United States.
Frequently asked
Did an AI actually hack the US government?
No malicious attack occurred. Anthropic's Mythos model was used in a controlled, authorized testing environment to identify vulnerabilities in classified systems so they could be patched.
Why did the Five Eyes issue a warning?
The alliance warned that AI is accelerating the speed and scale of cyber capabilities, urging organizations to adopt AI defensively and treat cyber resilience as a core business priority.
What is Project Glasswing?
It is a restricted access program created by Anthropic to allow critical infrastructure partners and defenders to use the Mythos model to find and fix software flaws before it is widely released.
Sources
[1]Associated PressNational Security Officials
Anthropic's Mythos model uncovers vulnerabilities in classified US systems during testing
Read on Associated Press →[2]The EconomistFrontier AI Developers
AI and the new cyber threat: How Mythos penetrated classified networks
Read on The Economist →[3]BloombergFrontier AI Developers
US Treasury Seeking Access to Anthropic's Mythos to Find Flaws
Read on Bloomberg →[4]CBC NewsNational Security Officials
Five Eyes intelligence alliance warns AI models pose huge cybersecurity risks
Read on CBC News →[5]NextgovNational Security Officials
NSA loses access to Anthropic's Mythos AI amid export controls
Read on Nextgov →[6]Industrial CyberDefensive Security Practitioners
Five Eyes global cybersecurity agencies issue joint warning on AI cyber threats
Read on Industrial Cyber →[7]UK AI Safety InstituteDefensive Security Practitioners
Evaluation of Anthropic Mythos: Autonomous Cyber Capabilities
Read on UK AI Safety Institute →[8]Mozilla Security BlogDefensive Security Practitioners
Patching 271 vulnerabilities with Anthropic's Mythos Preview
Read on Mozilla Security Blog →
Every angle. Every day.
Get technology stories with full source coverage and perspective breakdowns delivered to your inbox.










