LLMs Corrupt Your Documents When You Delegate - Microsoft Research
Skip to main content
Research
Publications<br>Code & data<br>People<br>Microsoft Research blog
Artificial intelligence<br>Audio & acoustics<br>Computer vision<br>Graphics & multimedia<br>Human-computer interaction<br>Human language technologies<br>Search & information retrieval
Data platforms and analytics<br>Hardware & devices<br>Programming languages & software engineering<br>Quantum computing<br>Security, privacy & cryptography<br>Systems & networking
Algorithms<br>Mathematics
Ecology & environment<br>Economics<br>Medical, health & genomics<br>Social sciences<br>Technology for emerging markets
Academic programs<br>Events & academic conferences<br>Microsoft Research Forum
Behind the Tech podcast<br>Microsoft Research blog<br>Microsoft Research Forum<br>Microsoft Research podcast
About Microsoft Research<br>Careers & internships<br>People<br>Emeritus program<br>News & awards<br>Microsoft Research newsletter
Africa<br>AI for Science<br>AI Frontiers<br>Asia-Pacific<br>Cambridge<br>Health Futures<br>India<br>Montreal<br>New England<br>New York City<br>Redmond
Applied Sciences<br>Mixed Reality & AI - Cambridge<br>Mixed Reality & AI - Zurich
Register: Research Forum
Microsoft Security<br>Azure<br>Dynamics 365<br>Microsoft 365<br>Microsoft Teams<br>Windows 365
Microsoft AI<br>Azure Space<br>Mixed reality<br>Microsoft HoloLens<br>Microsoft Viva<br>Quantum computing<br>Sustainability
Education<br>Automotive<br>Financial services<br>Government<br>Healthcare<br>Manufacturing<br>Retail
Find a partner<br>Become a partner<br>Partner Network<br>Microsoft Marketplace<br>Software companies
Blog<br>Microsoft Advertising<br>Developer Center<br>Documentation<br>Events<br>Licensing<br>Microsoft Learn<br>Microsoft Research
View Sitemap
LLMs Corrupt Your Documents When You Delegate
Philippe Laban
Tobias Schnabel
Jennifer Neville
April 2026
arXiv
Download BibTex
Large Language Models (LLMs) are poised to disrupt knowledge work, with the emergence of delegated work as a new interaction paradigm (e.g., vibe coding). Delegation requires trust – the expectation that the LLM will faithfully execute the task without introducing errors into documents. We introduce DELEGATE-52 to study the readiness of AI systems in delegated workflows. DELEGATE-52 simulates long delegated workflows that require in-depth document editing across 52 professional domains, such as coding, crystallography, and music notation. Our large-scale experiment with 19 LLMs reveals that current models degrade documents during delegation: even frontier models (Gemini 3.1 Pro, Claude 4.6 Opus, GPT 5.4) corrupt an average of 25% of document content by the end of long workflows, with other models failing more severely. Additional experiments reveal that agentic tool use does not improve performance on DELEGATE-52, and that degradation severity is exacerbated by document size, length of interaction, or presence of distractor files. Our analysis shows that current LLMs are unreliable delegates: they introduce sparse but severe errors that silently corrupt documents, compounding over long interaction.
Opens in a new tab
Publication
Blog & Podcasts
Research Areas
Follow us:
Follow on X
Like on Facebook
Follow on LinkedIn
Subscribe on Youtube
Follow on Instagram
Subscribe to our RSS feed
Share this page:
Share on X
Share on Facebook
Share on LinkedIn
Share on Reddit
Surface Pro<br>Surface Laptop<br>Surface Laptop Studio 2<br>Copilot for organizations<br>Copilot for personal use<br>AI in Windows<br>Explore Microsoft products<br>Windows 11 apps
Account profile<br>Download Center<br>Microsoft Store support<br>Returns<br>Order tracking<br>Certified Refurbished<br>Microsoft Store Promise<br>Flexible Payments
Microsoft in education<br>Devices for education<br>Microsoft Teams for Education<br>Microsoft 365 Education<br>How to buy for your school<br>Educator training and development<br>Deals for students and parents<br>AI for education
Microsoft AI<br>Microsoft Security<br>Dynamics 365<br>Microsoft 365<br>Microsoft Power Platform<br>Microsoft Teams<br>Microsoft 365 Copilot<br>Small Business
Azure<br>Microsoft Developer<br>Microsoft Learn<br>Support for AI marketplace apps<br>Microsoft Tech Community<br>Microsoft Marketplace<br>Software companies<br>Visual Studio
Careers<br>About Microsoft<br>Company news<br>Privacy at Microsoft<br>Investors<br>Diversity and inclusion<br>Accessibility<br>Sustainability
Your Privacy Choices Opt-Out Icon
Your Privacy Choices
Your Privacy Choices Opt-Out Icon
Your Privacy Choices
Consumer Health Privacy
Sitemap<br>Contact Microsoft<br>Privacy<br>Manage cookies<br>Terms of use<br>Trademarks<br>Safety & eco<br>Recycling<br>About our ads