Skip to Main Content
Faint pattern of 1s and 0s on top of hexagons

Learning from Anthropic’s “Locked-Down” AI Models

Faint pattern of locks, 1s and 0s on top of hexagons

A recent product launch takes the conversation from pure capability to a needed focus on cybersecurity. On 9th June, Anthropic released its 🔗latest generation of AI models: Claude Fable 5 and Claude Mythos 5. While there are benefits in advanced reasoning, autonomous coding skills, and long-horizon agentic capabilities, the most telling aspect of the release was how it was distributed.

Anthropic made a deliberate choice: the unrestricted version of the model, Claude Mythos 5, is not available to the general public. Instead, it’s locked behind a strict vetting process, available only to trusted cybersecurity experts and government infrastructure providers through an initiative known as Project Glasswing. The reason? The model is simply too capable, and potentially dangerous, to be released without safeguards.

Why the Keys to Mythos 5 are Closely Guarded

The decision to restrict access to Mythos 5 is an unprecedented acknowledgment by a major AI developer of the profound security risks inherent in highly advanced language models. According to Anthropic’s own system card, Mythos 5 is the most capable model they have ever evaluated on cyber tasks, scoring far ahead of previous iterations in skills like exploit development.

Without constraints, the capabilities that allow an AI to autonomously migrate a 50-million-line codebase in a single day could theoretically be turned toward malicious ends. A model that can identify vulnerabilities, draft complex code, and execute long-running tasks autonomously presents a significant risk of uplifting well-resourced threat actors.

To protect the public while still offering commercial value, Anthropic released Claude Fable 5 for general enterprise use. Fable 5 shares the same underlying architecture as Mythos 5 but is equipped with advanced safety classifiers. If a user attempts to prompt Fable 5 for assistance with cybersecurity exploits or biochemical threats, the model immediately freezes and hands the request over to a less capable legacy model (Opus 4.8) to safely refuse or filter the response.

What Mythos 5 means for Enterprise Data Security

When the creators of world-leading AI are building elaborate digital cages to contain their own creations, it serves as a warning to the rest of the business world. The threat landscape is changing, and the barrier to entry for executing sophisticated cyber attacks is being lowered by the very tools designed to drive innovation.

🔗Read more on the evolving threat of AI in cybersecurity from the UK’s National Cyber Security Centre (NCSC)

For organisations, this development highlights several critical realities:

The Threat is Getting Smarter

Cybercriminals will inevitably gain access to uncensored or jailbroken AI models, if they haven’t already. This means phishing attempts will become flawless, malware will become polymorphic and adaptive, and automated exploitation will operate at a scale previously unseen.

Defensive AI is Mandatory

You can no longer fight an AI-powered adversary with manual processes or legacy tools. Organisations must adopt advanced, AI-driven threat detection and response platforms to keep pace.

Data Governance is Paramount

Advanced AI models excel at rapid data extraction and analysis. If a breach occurs, the speed at which an attacker can identify and exfiltrate your most sensitive data is drastically increased. Robust, zero-trust data security postures are no longer optional.

The Mondas Perspective

At Mondas, we believe that acknowledging the risks of AI is the first step toward building a resilient security posture. Anthropic’s cautious rollout of Mythos 5 validates our core philosophy: security must be woven into the fabric of technological advancement, not treated as an afterthought.

We champion the use of best-in-class software and tools, including responsibly deployed AI, to empower our analysts and secure our clients’ environments. However, we also recognise that tools alone are insufficient. It requires deep expertise, continuous threat intelligence, and a proactive mindset to stay ahead of the curve.

As AI models continue to break new ground in capability, the margin for error in enterprise security will shrink. The digital walls are being tested by increasingly intelligent adversaries, and reliance on outdated security models will eventually result in critical breaches.

About the Author

Lance Nevill is the Cyber Security Director at Mondas. With a deep passion for leveraging best-in-class technology, Lance specialises in helping enterprises navigate complex threat landscapes and build resilient data security strategies.

🔗Connect with Lance on LinkedIn

Are you prepared for the next generation of cyber threats?

If you’re struggling with the issues outlined in this article, or if you want to ensure your organisation’s data is secured against AI-empowered adversaries, Mondas specialise in this topic. Reach out now to get in touch and let our expert team fortify your digital future.

Article First Published: 11 June 2026