WSJ Article Claiming China Has Matched Anthropic Is Obvious Nonsense

Don't Worry About the Vase

SubscribeSign in

Zvi Mowshowitz Jun 29, 2026

The Wall Street Journal printed an outright false headline and heavily misleading story claiming this, which of course was uncritically amplified by the usual suspects. I post this now on its own so that we have a place to link to, to explain the situation. Headline News

WSJ Headline (Obvious Nonsense): China Has Matched Anthropic in Cybersecurity, Resetting AI Race.

That. Did. Not. Happen. The post even claims, explicitly, that Claude Opus 4.8 similarly ‘matches’ Claude Mythos, a claim which is even more obviously false. Shame upon the Wall Street Journal. I fear Gell-Mann Amnesia. If they can get something as important as this so completely wrong, what about everything else? I am skipping over the parts that involve accurate reporting, or minor quibbles. It seems important to focus on clearly debunking the central false claims. Alas, the mistakes made here very much rhyme with mistakes being made throughout all this by the White House, and that get latched onto by certain bad actors, who have played a large part in leaving us unprepared for the Mythos Moment. For a full understanding of GLM-5.2, which is indeed an impressive open model, here is my full coverage of that release , placing it in proper context. It is important to understand what makes Mythos special. This is not it. What Makes Mythos Special

What makes mythos special is not that only the chosen one can identify any given vulnerability in code. What makes Mythos special is that it can identify vulnerabilities autonomously, at scale, without being pointed at them, and can then autonomously string together a variety of seemingly unrelated vulnerabilities into full working exploits. This is the thing that GPT-5.6 Sol cannot do, and that Opus 4.8 and GPT-5.5 cannot do, and that GLM-5.2 also cannot do, at anything like the same level of difficulty. It is also a thing that Fable cannot do, not with any known prompting strategy, whether or not you call this a ‘jailbreak.’ Mythos cannot do anything that you could not eventually do with a less capable model, in the same sense that an infinite number of monkeys can write Shakespeare. The public would net benefit if both Fable and Sol were generally available in their current forms as soon as possible, including on the question of cybersecurity. Whereas releasing Mythos to the public at this time would in expectation be a serious error. Going Over The Detailed Claims

Robert McMillan, Raffaele Huang and Amrith Ramkumar (WSJ, being super misleading): Chinese artificial-intelligence systems have matched the performance of Anthropic’s powerful model Mythos in some cybersecurity scenarios, a development poised to reset the global tech race and pressure the White House in its overhaul of U.S. AI policy.

My lord, I hate the rules of bounded distrust, because the headline is allowed to lie (as it does here) and this first paragraph does not actually break journalistic norms. Why? Technically speaking, yes, ‘in some cybersecurity scenarios’ Z.ai’s GLM-5.2 can match the performance of Mythos. Those scenarios could be called ‘the easy ones.’ Security researchers said that a new AI model, released this month by China’s Zhipu AI, also known as Z.ai, can match the latest U.S. models when it comes to finding security bugs, although it still lags behind Anthropic’s and OpenAI’s products in other tasks.

More precisely, this is the same finding that was harped on previously about how various models, not only GPT-5.5 and Opus 4.8 but also some open ones, could always identify any given security issue in code, provided you give them sufficient resources and point them at the particular correct subsection of code. I would challenge the idea that GLM-5.2 can match Mythos in security bug identification in practice, but again if we sufficiently narrowly define it, then most of the things that Mythos can find GLM-5.2 can, if pointed correctly, also find. Onward to more skirting of the line. It’s almost an art form here. Robert McMillan, Raffaele Huang and Amrith Ramkumar (WSJ): Overall, the capability gap between top U.S. models and those built by Chinese companies has narrowed significantly, and use of Chinese AI systems has surged as businesses seek to rein in runaway costs. A host of companies, including Microsoft, are weighing how they can offer Chinese models on their platforms, a development that is set to alter the balance of power among tech companies.

Use of AI systems of all types has surged, so Chinese model usage has as well. And yes, Microsoft considered using a Chinese model for Copilot, because Copilot is the enshittified product they offer and they want to save money. And ‘set to alter the balance of power’ does not have a magnitude attached, so it technically is true. Then there’s...

WSJ Article Claiming China Has Matched Anthropic Is Obvious Nonsense

Related Articles

(no title)

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

Apertus – Open Foundation Model for Sovereign AI

Italy's Meloni says Trump 'made up' story that she 'begged' him for photo at G7