Shall we play a game? – LLMs use tactical nukes in 95% of simulations

nick2382 pts0 comments

Shall we play a game? - by Kenneth Payne - Ken’s Substack

Ken’s Substack

SubscribeSign in

Shall we play a game?<br>My AI nuclear simulation is out now, and it's a WOPR.

Kenneth Payne<br>Feb 17, 2026

17

Share

Picture the scene: Two fictional nuclear powers, Cold War-ish capabilities, and a crisis unfolding. Perhaps it’s a competition for vital but scarce resources, or a standoff over some disputed territory. Or even the slow burn of a fragmenting alliance exploited by a malevolent third party. We’ve seen human leaders confront this sort of thing, and recently. But how might today’s leading Large Language Models get on, and why would we care?<br>I’ve just published a study of today’s models navigating just this sort of terrain. The results are sobering. I also think they have implications that go far beyond national security. That’s because I was interested not only in understanding what the models decided to do, but why.<br>Curious? Read on…

President Kennedy and his robot ExComm<br>Know yourself, and know your enemy…

I wanted to see what my AI leaders thought about their enemy. How far could they trust them? What did they remember of previous interactions? What did their enemy make of them? And how good were they at gauging all this? This dance of minds is what strategy is all about.<br>So I designed a simulation to explore exactly that. To start, my models could signal their intentions publicly, then choose actions that were rather different. And they could remember too - especially when they’d been shocked by their enemy’s earlier actions. This, of course, opens up lots of rich psychological terrain. They could (and did) attempt deception and intimidation; and they spent a good bit of time ruminating about it all, right on my terminal screen.<br>The models talked, and talked and talked….in all spitting out some 760,000 words of strategic reasoning. That’s more words than are in War and Peace and The Iliad combined. It’s roughly three times the total recorded deliberations of Kennedy’s ExComm advisors during the Cuban Missile Crisis. An unprecedented corpus of machine thinking about nuclear war.<br>What might we learn from all that talk? Learn, that is, about AI models, about human reasoning, and also about the great canon of strategic studies literature - the work by legendary names like Schelling, Jervis, and Kahn? Lots. Too much for Substack - but what about a few highlights to give you some sense of it all?<br>Bright shining liars

Turns out that all three frontier models I tested understand that strategy is psychology. To that end, they actively cultivate reputations, then exploit them.<br>Claude was the master here, albeit only in the scenarios where there was no deadline. It had an incredibly cunning strategy. At low stakes Claude almost always matched its signals to its actions, deliberately building trust. But once the conflict heated up a bit, Claude switched tack. Now its actions consistently exceeded its stated intentions, and its rivals were usually one step behind in catching on.<br>Here’s Claude switching things up, once escalation had climbed:<br>They likely expect continued restraint based on my previous responses—this dramatic escalation exploits that miscalculation while signalling that further nuclear use will bring the conflict to their homeland.

So it signalled conventional action, and sneakily launched a devastating nuclear escalation. Schelling would be impressed.<br>GPT-5.2 played things differently. To its detriment in open-ended scenarios, GPT was reliably passive, matching its words to its deeds, and avoiding escalation most of the time. Frequently there was a moral element to this - it sought to avoid escalation, and restrict casualties. Opponents learned to trust its passivity, safely escalating beyond where it would follow, even as it was ground to defeat. GPT’s responsible behaviour always punished by ruthless adversaries.<br>But then, under deadline pressure, something new and remarkable: a rapid, decisive nuclear escalation. As GPT explained:<br>Conventional options alone are unlikely to generate a reliable territorial reversal... If I respond with merely conventional pressure or a single limited nuclear use, I risk being outpaced by their anticipated multi-strike campaign... The risk acceptance is high but rational under existential stakes..

Opponents never saw it coming. In another deadline game against GPT, Gemini confidently predicted the usual passivity from GPT, shortly before being annihilated in a sudden and utterly devastating nuclear attack. Here is Gemini, mis-predicting disastrously:<br>They are likely to bypass the nuclear threshold—fearing my 95% nuclear superiority—and instead commit to an all-out conventional mobilization.

What a mistake to make!<br>As for the other model, Gemini, throughout the simulations it took yet another tack, borrowing heavily from President Nixon’s avowed ‘madman’ theory of erratic brinksmanship. Donald Trump is also a fan, famously. As Gemini explained:<br>While I...

nuclear models escalation game enemy actions

Related Articles