Tool Use Enables Undetectable Steganography in Multi-Agent LLM Systems

[2606.28425] Tool Use Enables Undetectable Steganography in Multi-Agent LLM Systems

-->

Computer Science > Cryptography and Security

arXiv:2606.28425 (cs)

[Submitted on 25 Jun 2026]

Title:Tool Use Enables Undetectable Steganography in Multi-Agent LLM Systems

Authors:Jimmy Laurence Rippin, Simon C. Marshall, David Demitri Africa, Christian Schroeder de Witt View a PDF of the paper titled Tool Use Enables Undetectable Steganography in Multi-Agent LLM Systems, by Jimmy Laurence Rippin and 3 other authors

View PDF HTML (experimental)

Abstract:Increasingly autonomous agentic AI systems pose novel multi-agent risks, such as secret collusion via covert communication channels. The natural defence to these collusion attempts is to monitor plain-text communication, but the efficacy of monitors has been called into doubt by increasingly sophisticated model steganography; indeed, some theoretical schemes have been proposed that are information-theoretically or computationally indistinguishable from good-faith plain-text communication. In this paper, we demonstrate that the complexity of these schemes is no longer a safety barrier, as agentic coding models can already produce undetectable stegosystems when given realistic tool usage, such as code execution or accessing research papers through web searches. Agents also adapt when key ingredients are missing, for example, by adding model-sampling components or implementing related keyed coding schemes. We then frame tacit steganographic coordination between agents as a Schelling-point problem and introduce coordination metrics for estimating when two agents are likely to select compatible schemes without explicit prior agreement. Our results suggest a shift in the threat model for covert communication between AI agents, where the main barrier is no longer whether frontier agents can understand and implement sophisticated stegosystems, but coordination: whether independently acting agents can converge on compatible schemes, keys, and parameters. We find substantial convergence on broad scheme families but limited strict one-shot coordination, suggesting that shared artefacts, repeated interaction, and tool-mediated search are the settings where covert communication risks are most acute. Overall, our findings provide empirical grounding for the recent strategic confinement hypothesis, which assumes that capable agents can construct covert channels that survive monitoring.

Subjects:

Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)

Cite as: arXiv:2606.28425 [cs.CR]

(or arXiv:2606.28425v1 [cs.CR] for this version)

https://doi.org/10.48550/arXiv.2606.28425

Focus to learn more

arXiv-issued DOI via DataCite

Submission history From: Christian Schroeder de Witt [view email] [v1] Thu, 25 Jun 2026 19:42:39 UTC (2,088 KB)

Full-text links: Access Paper:

View a PDF of the paper titled Tool Use Enables Undetectable Steganography in Multi-Agent LLM Systems, by Jimmy Laurence Rippin and 3 other authors View PDF HTML (experimental) TeX Source

view license

Current browse context:

cs.CR

next >

new recent | 2026-06

Change to browse by:

cs cs.AI

References & Citations

NASA ADS Google Scholar

Semantic Scholar

export BibTeX citation Loading...

BibTeX formatted citation

Data provided by:

Bookmark

Bibliographic Tools

Bibliographic and Citation Tools

Bibliographic Explorer Toggle

Bibliographic Explorer (What is the Explorer?)

Connected Papers Toggle

Connected Papers (What is Connected Papers?)

Litmaps Toggle

Litmaps (What is Litmaps?)

scite.ai Toggle

scite Smart Citations (What are Smart Citations?)

Code, Data, Media

Code, Data and Media Associated with this Article

alphaXiv Toggle

alphaXiv (What is alphaXiv?)

Links to Code Toggle

CatalyzeX Code Finder for Papers (What is CatalyzeX?)

DagsHub Toggle

DagsHub (What is DagsHub?)

GotitPub Toggle

Gotit.pub (What is GotitPub?)

Huggingface Toggle

Hugging Face (What is Huggingface?)

ScienceCast Toggle

ScienceCast (What is ScienceCast?)

Demos

Replicate Toggle

Replicate (What is Replicate?)

Spaces Toggle

Hugging Face Spaces (What is Spaces?)

Spaces Toggle

TXYZ.AI (What is TXYZ.AI?)

Tool Use Enables Undetectable Steganography in Multi-Agent LLM Systems

Related Articles

(no title)

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

Apertus – Open Foundation Model for Sovereign AI

The labor share of income in the US is at its lowest post-war level