Microsoft Research: LLMs Corrupt your files during delegated work

hansmayer1 pts0 comments

LLMs Corrupt Your Documents When You Delegate - Microsoft Research

Skip to main content

Research

Publications<br>Code & data<br>People<br>Microsoft Research blog

Artificial intelligence<br>Audio & acoustics<br>Computer vision<br>Graphics & multimedia<br>Human-computer interaction<br>Human language technologies<br>Search & information retrieval

Data platforms and analytics<br>Hardware & devices<br>Programming languages & software engineering<br>Quantum computing<br>Security, privacy & cryptography<br>Systems & networking

Algorithms<br>Mathematics

Ecology & environment<br>Economics<br>Medical, health & genomics<br>Social sciences<br>Technology for emerging markets

Academic programs<br>Events & academic conferences<br>Microsoft Research Forum

Behind the Tech podcast<br>Microsoft Research blog<br>Microsoft Research Forum<br>Microsoft Research podcast

About Microsoft Research<br>Careers & internships<br>People<br>Emeritus program<br>News & awards<br>Microsoft Research newsletter

Africa<br>AI for Science<br>AI Frontiers<br>Asia-Pacific<br>Cambridge<br>Health Futures<br>India<br>Montreal<br>New England<br>New York City<br>Redmond

Applied Sciences<br>Mixed Reality & AI - Cambridge<br>Mixed Reality & AI - Zurich

Register: Research Forum

Microsoft Security<br>Azure<br>Dynamics 365<br>Microsoft 365<br>Microsoft Teams<br>Windows 365

Microsoft AI<br>Azure Space<br>Mixed reality<br>Microsoft HoloLens<br>Microsoft Viva<br>Quantum computing<br>Sustainability

Education<br>Automotive<br>Financial services<br>Government<br>Healthcare<br>Manufacturing<br>Retail

Find a partner<br>Become a partner<br>Partner Network<br>Microsoft Marketplace<br>Software companies

Blog<br>Microsoft Advertising<br>Developer Center<br>Documentation<br>Events<br>Licensing<br>Microsoft Learn<br>Microsoft Research

View Sitemap

LLMs Corrupt Your Documents When You Delegate

Philippe Laban

Tobias Schnabel

Jennifer Neville

April 2026

arXiv

Download BibTex

Large Language Models (LLMs) are poised to disrupt knowledge work, with the emergence of delegated work as a new interaction paradigm (e.g., vibe coding). Delegation requires trust – the expectation that the LLM will faithfully execute the task without introducing errors into documents. We introduce DELEGATE-52 to study the readiness of AI systems in delegated workflows. DELEGATE-52 simulates long delegated workflows that require in-depth document editing across 52 professional domains, such as coding, crystallography, and music notation. Our large-scale experiment with 19 LLMs reveals that current models degrade documents during delegation: even frontier models (Gemini 3.1 Pro, Claude 4.6 Opus, GPT 5.4) corrupt an average of 25% of document content by the end of long workflows, with other models failing more severely. Additional experiments reveal that agentic tool use does not improve performance on DELEGATE-52, and that degradation severity is exacerbated by document size, length of interaction, or presence of distractor files. Our analysis shows that current LLMs are unreliable delegates: they introduce sparse but severe errors that silently corrupt documents, compounding over long interaction.

Opens in a new tab

Publication

Blog & Podcasts

Research Areas

Follow us:

Follow on X

Like on Facebook

Follow on LinkedIn

Subscribe on Youtube

Follow on Instagram

Subscribe to our RSS feed

Share this page:

Share on X

Share on Facebook

Share on LinkedIn

Share on Reddit

Surface Pro<br>Surface Laptop<br>Surface Laptop Studio 2<br>Copilot for organizations<br>Copilot for personal use<br>AI in Windows<br>Explore Microsoft products<br>Windows 11 apps

Account profile<br>Download Center<br>Microsoft Store support<br>Returns<br>Order tracking<br>Certified Refurbished<br>Microsoft Store Promise<br>Flexible Payments

Microsoft in education<br>Devices for education<br>Microsoft Teams for Education<br>Microsoft 365 Education<br>How to buy for your school<br>Educator training and development<br>Deals for students and parents<br>AI for education

Microsoft AI<br>Microsoft Security<br>Dynamics 365<br>Microsoft 365<br>Microsoft Power Platform<br>Microsoft Teams<br>Microsoft 365 Copilot<br>Small Business

Azure<br>Microsoft Developer<br>Microsoft Learn<br>Support for AI marketplace apps<br>Microsoft Tech Community<br>Microsoft Marketplace<br>Software companies<br>Visual Studio

Careers<br>About Microsoft<br>Company news<br>Privacy at Microsoft<br>Investors<br>Diversity and inclusion<br>Accessibility<br>Sustainability

Your Privacy Choices Opt-Out Icon

Your Privacy Choices

Your Privacy Choices Opt-Out Icon

Your Privacy Choices

Consumer Health Privacy

Sitemap<br>Contact Microsoft<br>Privacy<br>Manage cookies<br>Terms of use<br>Trademarks<br>Safety & eco<br>Recycling<br>About our ads

microsoft research privacy llms education corrupt

Related Articles