Topfi
5 hours ago
I still am struggling to understand why they informed the government about something that is known to be an issue in every LLM. There is no LLM that cannot be jailbroken, so unless this means that we have reached the absolute maximum publicly accessible US made LLMs are allowed to operate at with GPT 5.5, this is not grounded in any sane regulation attempt.
Does anyone know what limits Fable 5 has overstepped in the eyes of the government? Parameter count? Certain benchmark results? Training computer?
Cause if it’s just the ability to assist with cyberattacks and being jailbreakable, there is no model previously released that isn’t equally guilty.
Remember that for GPT 5.5 and 5.4, OpenAI also restricted the cybersecurity focused use under designated models, otherwise rerouting to 5.3-codex like Fable did with Opus 4.8. And both OpenAI models can also be jailbroken all the same.
Basically, what was the reason to tell the government now and not with Opus 4.5 or GPT 5.4? sama has been doing the rounds with apocalyptic predictions…
irthomasthomas
an hour ago
They literally asked for it. Two days ago Amodei wrote an essay urging the government to regulate them. He explicitly cited Mythos, as proof that frontier AI has acquired autonomous hacking capabilities that threaten critical infrastructure and national security.
"Mythos Preview scrambled the global cybersecurity landscape. But its broader significance is that it proves beyond doubt that AI models are now tools of global and national strategic consequence."
"The government should have the power to block or deter deployment of the model if it is determined, in light of third-party assessment, to present unacceptable risks. This power must be scoped to the above four specific risks and there must be protective measures against political favoritism or arbitrary decisions"
https://darioamodei.com/post/policy-on-the-ai-exponentialA third-party demonstrated that it was possible to jailbreak the safety measures of Fable to access the raw Mythos abilities. Abilities which Anthropic say are too dangerous for the public.
sigmarule
20 minutes ago
> A third-party demonstrated that it was possible to jailbreak the safety measures of Fable to access the raw Mythos abilities. Abilities which Anthropic say are too dangerous for the public.
Pressure test this assumption before getting behind this position.
irthomasthomas
11 minutes ago
I will certainly revisit it as more information comes out, but is it your contention that Anthropic solved jailbreaking with Mythos?
sigmarule
10 minutes ago
That is a strawman. My contention is what you just implicitly acknowledged - there is not information put out yet to validate the quoted claim. There are claims to the contrary, as well, from Anthropic themselves.
what
10 minutes ago
What assumption?
sigmarule
10 minutes ago
The one I quoted, which contradicts Anthropic’s post and has no supporting evidence publicly available.
themgt
2 hours ago
I submitted separately, but this Axios report has some details that call a lot of the speculation in this thread into question, i.e. that this wasn't much of a "jailbreak" at all and that it's not Anthropic-specific - the White House intends to generally regulate Mythos-class models (whatever exactly that means):
Between the lines: The government's response "seems way out of line with what's actually in the research report," Luta Security CEO Katie Moussouris, who Anthropic shared the Amazon report with, told Axios.
Moussouris said the researchers were able to find security vulnerabilities by asking questions normal defenders would ask AI, which is exactly what the model was intended to do.
An administration official told Axios they do not view other models as national security threats because they do not surpass the bar that Mythos set.
Anything at Mythos level or above would need to go through the administration to ensure the government's national security apparatus is hardened enough, the official added.
https://www.axios.com/2026/06/13/anthropic-amazon-white-hous...
softwaredoug
an hour ago
That’s a terrible way to create AI regulations
If they actually cared about this issue we’d have predictable laws and regulatory bodies that let companies actually plan
There’s a reason royal fiat doesn’t lead to healthy economies. It’s just confusing and chaotic. It’s not clear why anyone would invest in a new model now.
Then the next administration comes in and instantly, by fiat, they decide to lift the ban. The market just gets jerked around with no ability to plan long term investments.
aqme28
9 minutes ago
It’s a great way to regulate if you’re corrupt. When the rules are opaque and arbitrary, there’s a lot more room for corruption.
tapoxi
42 minutes ago
> That’s a terrible way to create AI regulations
This administration doesn't do regulations, its extortion. Same as the tariffs. Just grease someone's palm and then the vague restriction is lifted.
Topfi
2 hours ago
Interesting. Hope there is any clarification on what "Mythos level" is and why 5.5-cyber doesn't arise to it. Any metric I could come up with (parameters, pre-train compute, benchmark scores, etc.) seems somewhere between imperfect and utterly nonsensical. Pure speculation, but GPT-5 series models including the new 5.5 pre-train appear far closer to Sonnet than Opus or Fable in pure parameter count, so maybe that's it, but the "they do not surpass the bar that Mythos set" line sounds more like there is a believe that Mythos/Fable are more capable in cybersecurity tasks, whereas the data [0] doesn't seem to bare this out. I did not do any cybersecurity assessment of Fable 5 myself, partly due to personal reasons that make that something I'm abstaining from, but my coding evals showed that while task adherence and assessment wise it was neck and neck with 5.5, the task inference was a major jump again (something prior Anthropic models tended to already do incredibly well on) and while that makes it a far better model to work with for UX experiments, I don't see how that translates to cybersecurity, along with the aforementioned publicly available evals by AISI.
Seeing as neither Mythos nor GPT-5.5 had been pre-trained with a particular focus on cybersecurity, this would have to mean any model that benchmarks better than GPT-5.4 or Opus 4.6 on these tasks cannot be used by None-US-Citizens. If such guidance isn't enforced for all US labs, I think that's irrefutable evidence that this isn't about cybersecurity or "the bar that Mythos set"...
[0] https://xcancel.com/AISecurityInst/status/205458976317312633...
trinsic2
42 minutes ago
>I still am struggling to understand why they informed the government about something that is known to be an issue in every LLM. There is no LLM that cannot be jailbroken, so unless this means that we have reached the absolute maximum publicly accessible US made LLMs are allowed to operate at with GPT 5.5, this is not grounded in any sane regulation attempt.
I wondering where you are getting the idea that there is an sane regulation right now?
lebovic
5 hours ago
Claims of retribution aside, one strawman is that Mythos is likely the most capable model that's usable by folks like the NSA [1], and decision-makers across the USG and industry partners have seen a stream of reports of Mythos successfully finding serious vulnerabilities over the past couple months due to Glasswing.
So even if GPT 5.5 is just as capable in these scenarios (which, imo, it largely is), it is not known by the government apparatus as having the same capabilities.
Personally, I think we crossed the threshold of capabilities with Opus 4.6 [2], which translated to an even more capable open-weight GLM 5.1 (which it is rumored to have distilled Opus 4.6) [3][4]. But the USG and its partners aren't fully rational actors with perfect data, so it's possible they're only viscerally aware of these capabilities in the context of Mythos.
[1]: https://www.reuters.com/business/us-security-agency-is-using...
[2]: Opus 4.6 was used for https://www.noahlebovic.com/testing-an-autonomous-hacker/
[3]: See GLM 5.1 scoring in https://www.cybergym.io/cybergym/
[4]: https://dualuse.dev/posts/chinese-models-are-sometimes-bette...
Topfi
5 hours ago
I doubt that the capabilities of GPT-5.5-cyber aren’t known by the US government considering OpenAI is their primary LLM partner after Anthropic had concerns about using models for autonomous weaponry and mass surveillance of US citizens. If anything, they should have more experience in GPT-5.5s full feature set due to longer access and may even already have GPT-5.6 access.
bobthepanda
4 hours ago
Hanlon's razor. Are the people with the right access talking to the right people? Wouldn't be the first time for miscommunication in the executive branch.
Topfi
3 hours ago
Fair point, not unlikely, though my personal assumption is that, like with Nvidia export controls, there will be a sudden reversal with no tangible, actual, technically based reason the second a certain person has their ring kissed...
bobthepanda
3 hours ago
Why not both? The current executive has missed the mark on appointments pretty badly a number of times due to the prizing of loyalty over competency.
lebovic
5 hours ago
They made a deal for access, but I'm unsure if it's usable, scaled, and has vulnerabilities attributed to it at this point. But I have no inside information here, so I could be wrong.
throwaway85825
4 hours ago
If it had vulnerabilities the marketing copy would already be written and published.
thayne
2 hours ago
The only reason I can see is because Amazon wanted something like this to happen. But I'm not sure what Amazon would gain from that, since they don't have their own competing frontier models.
conradkay
2 hours ago
My guess is that they liked the status quo with Project Glasswing and didn't want Fable to be public, especially if anyone is jailbreaking it into Mythos and using it for cyber
But then it backfired spectacularly and now it seems they can't use Mythos currently
lumost
an hour ago
This is either a complete own goal by Amazon… a play to consolidate compute/model access.
Will Chinese models be allowed on the market… at all? Will startups be banned from training models of equivalent capacity?
brianjking
2 hours ago
...Not to mention that they're investors in Anthropic.
Jcampuzano2
5 hours ago
The reason is pretty obvious. Anthropic tried to play hardball with the government and now they are under their thumb for scrutiny of any and every little thing they do.
That's what this admin is known for. If you do even what a normal person would think is sane but they don't like it, well now they need to make you bow down and break you so you "learn your lesson".
It doesn't help that they themselves marketed this model as being especially dangerous in the publics hands. If this was just another model drop and none of the fear mongering I don't doubt this probably wouldn't have had any issues.
econ
3 hours ago
It is important to note this formula doesn't require understanding any subject.
People keep seeking logic where there is non. We have an internet full of theories assuming there is more to it.
shimman
2 hours ago
I mean the logic is simple but people don't want to admit it, you must pay the vig if you want in on the action. Before this type of naked corruption would take the form of boardroom seats/book deals/speaking gigs after you leave office but now it's more open so others will take note.
It also helps if you bust a few kneecaps in the process to show what happens if you go astray.
trhway
3 hours ago
>The reason is pretty obvious. Anthropic tried to play hardball with the government
that is one.
Another is who is going into the first IPO. Troubles for Anthropic IPO would channel all those money into OpenAI's one. Check financial interests of this admin. Hint - they aren't with Anthropic.
Third - most of the export and access controlled tech of the past wasn't productivity multiplier, nor human replacement. AI is a different case - the more capable AI the more its general economic benefit. Export and access control of AI allows you to more and more control the whole domestic and large part of global economy, not just military capabilities like in the past.
Political - coming into elections with "this evil new tech was coming after your jobs, yet we reigned it in and protected your jobs". After all such approach has been for decades working great when it comes to coalminers.
Note that specific bug-finding capabilities of a specific model is a red herring here, and other leading models are almost there, and definitely will be there in a month.
It is all about revenge, money and power.
Telemakhos
2 hours ago
Alternatively, this is the best advertising for which Anthropic could hope: "Our product, and nobody else's, is so good that the government declared us a threat to national security." If they bring it back for US-nationals only, maybe demanding ID for users, people will think it's the bees knees: "so dangerous that non-Americans can't have access" probably sounds like a ringing endorsement to some C-level decision makers.
Crowdstrike took down airports in July 2024, and its stock was back up by October; it's double the price now. Everyone saw how systemically important it was and how it took down entire industries, and they asked why they weren't using it themselves if it's so important. See also the 2025 cloud outages.
ahtihn
2 hours ago
What good is advertising if they can't actually sell the product?
beachy
2 hours ago
Customers (especially large ones) don't so much buy individual specific products, they buy into a company and its prospects. Customers don't want to chop and change. They want to lock in with the leader.
This whole thing shrieks out that Anthropic is at the head of the pack, with the most capable models.
It hardly matters in the customer's mind that today they can't buy this specific model.
s1artibartfast
2 hours ago
The same customers that are barred by law from using antrhopic on any government contracts. If they get past that, they are then cant have any foreign workers use state of the art anthropic models. SOTA anthropic models also can work with working in any secure government clouds or with sensitive customer data due to retention policies.
It is hard to see being a new benefit for anthropic.
mschuster91
2 hours ago
> Crowdstrike took down airports in July 2024, and its stock was back up by October; it's double the price now. Everyone saw how systemically important it was and how it took down entire industries, and they asked why they weren't using it themselves if it's so important. See also the 2025 cloud outages.
Truly, too big to fail. Capitalism is broken when companies aren't punished but rewarded for screwing up. What point do stock markets serve when bad behavior has no incentives at all to be prevented?!
inigyou
17 minutes ago
Not even limitoto companies, if you prevent a problem you get fired because your work isn't visible, if you create a problem and then fix it you're a hero
roncesvalles
2 hours ago
>Troubles for Anthropic IPO would channel all those money into OpenAI's one.
Troubles for Anthropic would almost certainly affect OpenAI, significantly. Yesterday just proved that the government sees it within their remit to shut down AI models. All current and future AI investment now has to contend with this risk. You should even see the effect of this decision on SPCX on market open despite X.ai being whatever tiny fraction that it is.
enraged_camel
3 hours ago
>> Another is who is going into the first IPO. Troubles for Anthropic IPO would channel all those money into OpenAI's one. Check financial interests of this admin. Hint - they aren't with Anthropic.
Yep. Kushner owns private shares of OpenAI.
drivingmenuts
4 hours ago
> The reason is pretty obvious
I would argue the simple reason is that Amazon wanted to fsck Anthropic to set them back, despite whatever partnership they may claim. The competition at that level is intense and these guys do not play by the same rules that regular people do. They can't flat out murder each other (yet) so they find other ways to do it.
senderista
3 hours ago
Why? Amazon makes tons of money serving Anthropic models through Bedrock and they seem to have basically given up on their own frontier models.
nxm
4 hours ago
Previous administration was same way… intentionally not including Tesla in an EV summit
sailingparrot
3 hours ago
This is lacking any nuance. The CEO not being invited to a meaningless ceremony vs being designated a supply chain risk by the DoD and being forced to shut down your product. Use judgment.
smallmancontrov
3 hours ago
It's astonishing how that summit sparkles the Tesla sowflakes. We gave them tens of billions of dollars in subsidies and a 100% tariff on the Chinese competition! Huge, substantive policy assistance! But Biden wanted to pal around with some union supporters and that's supposed to be some horrible slight? Please.
Elon didn't drop millions on the Trump campaign and throw a double Sieg Heil at the 2025 US presidential inauguration because Biden refused a photo-op. He did those things because he believes in them, because he believes the things he says on twitter. The EV summit thing is the least believable "you made me do it" excuse I've ever seen.
iknowstuff
3 hours ago
You'll notice the tariffs were helping legacy auto more than Tesla
megabless123
3 hours ago
> intentionally not including Tesla in an EV summit
this comparison is orders of magnitude different
edaemon
an hour ago
Wasn't that a UAW summit about EVs? Tesla does not work with UAW, so they wouldn't appear at a UAW event.
skywhopper
2 hours ago
Give me a break with this. You are not so thick as to think the two things are remotely comparable.
vrganj
5 hours ago
Its not Fable 5 that overstepped in the eyes of the US government.
It's Anthropic.
This is transparent revenge for them daring to try and push back a little on enabling war crimes.
Cider9986
5 hours ago
>This is transparent revenge for them daring to try and push back a little on enabling war crimes.
Anthropic wasn't pushing back on enabling war crimes. They said they didn't want the models to work with autonomous weapons because the the models weren't good enough.
inigyou
16 minutes ago
That's pushing back. The regime doesn't care if the models are good enough, they want the optics of killing lots of people using cutting edge tech, they don't really care if it's the right people.
alpinisme
2 hours ago
Arguably it’s a worse (or different) war crime to knowingly target people incompetently and thus kill more innocent civilians. In this respect, they showed themselves against one war crime. Not “war crimes” in general but a specific misuse of ai in war.
dandellion
5 hours ago
Whether you or me or Anthropic think it was pushing back or not is besides the point.
Cider9986
4 hours ago
I can agree on revenge, but it's important to not paint it as a good vs evil when it isn't.
noelsusman
4 hours ago
Anthropic is perfectly fine with the US government using Claude to commit war crimes. The US military has done hundreds of extra-judicial killings in the waters around South America over the last year and Anthropic hasn't had anything to say about that.
felixgallo
3 hours ago
Use nuance and judgement, friend. Anthropic notably pushed back on completely autonomous no-human-in-the-loop drone killings and mass surveillance of the US population, where others like OpenAI scrambled to agree. Anthropic isn't perfect but that doesn't make them equally bad.
Art9681
4 hours ago
It's the AWS CEO being a little snitch to gain favor from the Government. That is what this is about.
skybrian
5 hours ago
Why not both?
TiredOfLife
5 hours ago
Antropic models are the ones that designated that school as valid target
thrill
3 hours ago
People designated that school as a valid target - using fancy calculators does not remove that the pass/fail rests with people. AI models have no agency. Even if they are given autonomy - it is given.
logicchains
5 hours ago
>This is transparent revenge for them daring to try and push back a little on enabling war crimes.
Don't be so pessimistic, maybe they're just trying to give their buddy Musk and XAi a chance to catch up.
Topfi
5 hours ago
Anthropic is one of the two consistent revenue sources for XAI via their colossus deal. I have been critical of this man longer than most, but I don’t see him hurting his own bottom line.
m3kw9
2 hours ago
Because based upon on what Anthropic has told the “AI people” and military, it is dangerous if an adversary gets its hands in the cyber capabilities. Knowing that if they ignored it and something did happen, heads will roll. Blame Anthropic for that, or wait if they are all for safety, they shouldnt complain.
nowittyusername
2 hours ago
The simple answer is that Trump has a stick up his ass against Anthropic and is also fond of stock market manipulation. No need to get too deep when it comes to dealing with that orange shmuck.
downrightmike
2 hours ago
This is just another shakedown like with Tylenol etc, knock the product, lower the stock price and have a competitor hostile takeover, or get kickbacks
arcanemachiner
2 hours ago
This is a hypothesis, and a viable one.
But I caution you against drawing conclusions from your hypothesis and calling it a day, instead of taking in the available data and using it to broaden your understanding of what's actually happening.
This could be many things: a shakedown, Trump's pettiness, marketing kayfabe, an actual government reaction to a very weaponizable technology, and so on.
But if you call it "just another shakedown" and go about your day, then you're doing yourself a disservice, because the story is still unfolding and we don't have all the facts.
You don't actually have the full story, so don't delude yourself into think you do.
giancarlostoro
3 hours ago
Reminds me of people freaking out about the Grok Bikini thing, but GPT and Googles image model they all do the same behavior. Clearly biased against Elon Musk despite it being a problem for every single image model out there.