The Anthropic Fable mess, explained - Cautious Optimism
Skip to content
Subscribe<br>My Account<br>Login<br>Logout
Happy Sunday! Below is my summary of the Anthropic, Mythos, Fable, government saga that has been The Topic since Friday. Consider it an opinionated tick-tock of what matters. Enjoy! — Alex
The Timeline
February/March: Anthropic and the Department of Defense get into a spat over the use of the company’s AI models; Anthropic wants certain limits on how its technology is used. The ensuing fight leads to Anthropic being dubbed a supply-chain risk, theoretically restricting government use of the company’s models and, in the same stroke, limiting government contractors’ access to the same.
April 7: Anthropic introduces Mythos, a new model family. After discovering that Mythos is adept at finding and exploiting novel cybersecurity flaws, Anthropic launches ‘Project Glasswing‘ to provide critical technology companies with access to tools to harden their own software before a broader release.
April 16: The White House was reportedly working to make a version of Mythos available for agency use.
April 30: Anthropic wants to expand the number of groups that can access Mythos; the White House reportedly opposes the move, concerned that adding more partners would make Anthropic compute-constrained, thereby limiting USG access to the model.
June 2: Anthropic announces that Project Glasswing’s initial 50 partners found more than 10,000 serious software flaws by using Mythos. The company expands Project Glasswing to 150 new organizations in 15 countries. Anthropic also promises that it is working to "safely release Mythos-level capabilities in general access," but that the "highly robust safeguards that prevent the model’s cyber capabilities from being misused–safeguards that we (and, to our knowledge, all other AI developers) have yet to develop."
June 9: Anthropic announces and releases Fable 5, a ‘Mythos-class’ model built to reduce cybersecurity and biology-related risks. The company said in its release notes that it had built safeguards for Mythos (Fable) sufficient "for a general release." At the time, Anthropic said that it had "prioritized safety" by making the guardrails on Fable 5 "stricter than would be ideal."
Some users found Fable 5 so restricted in certain use cases (mostly biology-related queries) that it was nigh useless.
Fable 5 also featured distillation protections and a strict 30-day data retention policy that Anthropic claimed would help "defend against complex and novel attacks (including new jailbreaks and attacks that operate across many requests) as well as help [it] identify and reduce false positives."
At this juncture, the core complaint levied against Anthropic was that it was too cautious; that by keeping Mythos private and releasing only a security-minded version of the model (Fable), it was creating a two-tier AI market. People didn’t like that!
Anthropic had "previously notified the government multiple times" about its June 9th release date for Fable before the launch.
June 10: Anthropic releases two frameworks to address advanced AI development and its economic impacts. The papers called for "government action and for regulation—regulations that are carefully designed to prevent government overreach and protect innovation," including, when "a model poses risks of this kind," allowing the government to "have the legal authority to block or deter its deployment."
June 11: Amazon’s Andy Jassy reportedly told the government that its researchers had found a way to get Fable 5 to "provide [it] with information that could be used to aid cyberattacks and was supposed to be off-limits." (At least five other companies chimed in, too, making this less an Amazon-specific point, even if it was a key actor.)
June 12: Senior White House staff and administration leaders met to discuss the situation, then brought Anthropic CEO Dario Amodei into the conversation via phone. (This took 1.25 hours per Politico, with the company saying that it offered other senior execs in the interim; there’s beef about why it took so long to get Amodei on the horn. We can skip the drama and focus on what matters.)
June 12, continued: Amodei viewed the issue as a misunderstanding and argued that the reported "bypass" did not "pose the same risk as a broader ‘jailbreak.’” The White House "urged" Anthropic "to voluntarily remove the model and coordinate with the government to address the vulnerabilities." Amodei asked for more "time and information, but he made no commitments to pull the model."
Politico reports that Treasury Secretary Scott Bessent "told Amodei directly that he was making a ‘bad decision,’ according to the senior White House official."
June 12, even more: After failing to reach a deal with Anthropic, the Trump administration imposed export controls on both Fable 5 and Mythos 5 (the two available versions of Mythos-class models, with differing safety limits).
Anthropic responds by...