TL;DR
White House AI adviser David Sacks says Anthropic refused to fix a serious jailbreak affecting Fable S, prompting Washington to restrict the model. Anthropic says the government has not provided specific technical evidence and describes the flaw as narrow and minor. The core evidence remains non-public, leaving the severity of the risk unresolved.
White House AI adviser David Sacks publicly defended Washington’s move to restrict Anthropic’s Fable S model over an alleged jailbreak, while Anthropic said the reported flaw was minor and lacked specific technical backing, leaving a major AI-safety decision resting on evidence that has not been made public.
Sacks, who co-chairs the President’s Council of Advisors on Science and Technology, said a “highly credible trusted partner” found a way around Fable’s safeguards and that the administration asked Anthropic CEO Dario Amodei to fix the issue or withdraw the model. According to Sacks, Amodei refused, and the government issued the restriction “reluctantly.”
Anthropic’s published response disputes that account. The company said the government did not provide specific technical detail, described the demonstration as a few minor and already-known flaws, and argued that other public models can produce similar outputs without the alleged bypass. Anthropic said a “narrow potential jailbreak” should not justify recalling a model used by a large user base.
The two accounts differ on the central technical issue. Sacks frames the alleged bypass as restoring the “operability of a cyberweapon.” Anthropic frames it as a limited flaw that is reproducible across frontier models. No public record currently resolves which description is closer to the underlying evidence.
The Safety Card, Played From Every Side
● ContestedA White House adviser says Anthropic refused to fix a cyberweapon jailbreak and got banned for it. Anthropic says the flaw is trivial. Almost every fact that would settle it is non-public — and “safety” is now the card every side is playing.
Both are claims, not findings. They don’t disagree on tone — they disagree on what the bypass actually is.
- A “highly credible trusted partner” found a jailbreak of Fable’s guardrails.
- The admin asked Amodei to fix it or pull the model. He refused.
- So the export control was issued — “reluctantly.”
- It restores operability of a cyberweapon; calling that “not serious” is indefensible.
- The government gave no specific technical detail.
- The demo found a few minor, already-known flaws.
- Other public models (incl. GPT-5.5) do the same without a bypass.
- A “narrow potential jailbreak” shouldn’t recall a model used by hundreds of millions.
Per reporting by Semafor (carried by Fortune and others), the entity that flagged the jailbreak was Amazon — with CEO Andy Jassy reportedly in contact with the administration. Amazon hasn’t confirmed specifics. Flagging a real risk is what a good partner does — but Amazon wears three hats at once, and none of them is neutral.
Each actor’s safety claim points toward its own advantage.
The entire evidentiary record is a matter of trusting parties who each have a reason to shade it.
A transparent, technically grounded, independently reviewable process — which is, notably, exactly what Anthropic says it wants, and exactly what would also constrain Anthropic. The reason to demand it isn’t loyalty to anyone; it’s that the alternative is decisions made on secret evidence and adjudicated in dueling press statements.
Independent commentary, produced with AI assistance under human editorial oversight; the views are the author’s own and may change. This is analysis and opinion, not investment, financial, legal, or technical advice, and it concerns an actively developing situation in which key facts are disputed and non-public. Claims attributed to David Sacks reflect his June 13, 2026 statement on X; claims attributed to Anthropic reflect its published statements; reporting on Amazon’s role reflects accounts published by Semafor and others — all read as of June 15, 2026, and presented as the claims of those parties, not as established fact. Characterizations are the author’s interpretation, offered in good faith and open to rebuttal. References to specific people, companies, and government actions are factual and analytical, not partisan, and imply no affiliation or endorsement.
A Test for Fable Controls
The dispute matters because it places a major commercial AI restriction on a technical record the public cannot inspect. If Sacks’s account is accurate, the government acted to prevent a powerful cyber-capable system from being made available with weak safeguards. If Anthropic’s account is accurate, the government may have used a narrow and common model weakness to justify an unusually forceful intervention.
The case also shows how “safety” has become a shared language for parties with different incentives. The government can cite safety to defend intervention. Anthropic has built part of its public case around frontier-model risk and now argues the risk is overstated in this instance. A reported third party may have flagged the issue while also holding business interests tied to Anthropic and its competitors.

Ai Engineering Made Practical: Build Reliable Ai Systems With Retrieval, Tools, Evaluation, Monitoring, And Safety—So Teams Ship Faster With Less Risk
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
How the Fable Standoff Started
The clash follows Anthropic’s own positioning of Mythos as a highly capable cyber model that should be tightly controlled. Sacks’s account treats Fable as Mythos with guardrails, meaning a successful bypass could expose the underlying cyber capability to users who should not have it.
Reporting cited in the source material says Semafor and other outlets identified Amazon as the possible “trusted partner” that flagged the jailbreak, with Amazon CEO Andy Jassy reportedly in contact with the administration. Amazon has not confirmed the specifics. That matters because Amazon is an Anthropic investor, a cloud provider for Anthropic, and a competitor in AI models.
The current record confirms a public dispute between Sacks and Anthropic over the basis for the restriction. It does not confirm the technical method, the identity of the partner, or whether an independent reviewer has validated the government’s assessment.
“A “highly credible trusted partner” found a jailbreak of Fable’s guardrails.”
— David Sacks, White House AI adviser, via X
AI jailbreak detection software
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Evidence Still Behind Closed Doors
The public has not seen the alleged jailbreak, a technical methodology, a named partner, a CVE-style disclosure, or an independent assessment. It is also unclear whether the bypass is specific to Fable S, common across similar models, or severe enough to expose Mythos-class capability in practice.
Amazon’s reported role also remains unresolved. The source material says Semafor and others reported Amazon may have been the entity that flagged the issue, but Amazon has not confirmed the details. Without that confirmation, its role should be treated as reported but not established.

Building Multi-Agent Systems on GCP: ADK, A2A & Agent Architectures (Intelligent Cloud Systems on GCP: Secure, Scalable & Multi-Agent AI Architectures)
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Patch Timeline Becomes the Test
The next marker is whether the restriction is lifted quickly after a quiet patch or whether the standoff lasts. A fast reversal would support the view that the government saw a fixable but serious problem. A longer impasse would lend weight to Anthropic’s argument that the issue is broader, less specific, or harder to remedy than Sacks suggested.
The broader question is whether the administration, Anthropic, or an independent body will publish enough methodology for outside experts to evaluate the claim. Until then, the dispute remains a contest between powerful parties asking the public to trust non-public evidence.

Artificial Intelligence for Cybersecurity: Develop AI approaches to solve cybersecurity problems in your organization
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Key Questions
What exactly did David Sacks claim?
Sacks said a trusted partner found a jailbreak in Fable’s guardrails, that the administration asked Anthropic to fix it or pull the model, and that the restriction followed Anthropic’s refusal.
How did Anthropic respond?
Anthropic said the government did not provide specific technical detail and described the issue as narrow, minor and already known. The company also said similar outputs can be produced by other public models.
Is Amazon confirmed as the partner?
No. The source material cites reporting that Amazon may have flagged the issue, but Amazon has not confirmed the specifics publicly.
Why is the evidence dispute so central?
Because the restriction depends on whether the alleged jailbreak was a serious, model-specific failure or a limited weakness common to frontier AI systems. That evidence is not public.
What should readers watch next?
Watch whether access is restored after a patch, whether technical details are released, and whether an independent review process is created for future AI-safety restrictions.
Source: Thorsten Meyer AI