Information-flow control: Moving toward secure, autonomous agents

Towards secure, autonomous agents with information-flow control (IFC)

Source

Signal blog Official Microsoft Blog Microsoft On The Issues Asia Canada Europe, Middle East and Africa Latin America The Code of Us Conexiones

What's new today

Innovation

Digital Transformation

Sustainability

Security

Work & Life

Diversity & Inclusion

Unlocked

Microsoft 365 Azure Copilot Windows Surface XBOX Deals Small Business Support

Windows Apps Outlook OneDrive Microsoft Teams OneNote Microsoft Edge Moving from Skype to Teams

Computers Shop XBOX Accessories VR & mixed reality Certified Refurbished Trade-in for cash

XBOX Game Pass Ultimate PC Game Pass XBOX games PC games

Microsoft AI Microsoft Security Dynamics 365 Microsoft 365 for business Microsoft Power Platform Windows 365 Small Business Digital Sovereignty

Azure Microsoft Developer Microsoft Learn Support for AI marketplace apps Microsoft Tech Community Microsoft Marketplace Software companies Visual Studio

Microsoft Rewards Free downloads & security Education Gift cards Licensing Unlocked stories

View Sitemap

Search articles

Deep Dive

Information-flow control: Moving toward secure, autonomous agents

A deterministic security system, information-flow control (IFC) offers a promising path towards secure and autonomous agents.

By Santiago Zanella-BéguelinPrincipal Researcher, Microsoft, Shruti ToplePrincipal Researcher, Microsoft, Mark RussinovichAzure CTO & Technical Fellow, Microsoft, Aashish KolluriSenior Researcher, Microsoft, Boris KöpfSenior Principal Researcher, Microsoft, and Manuel CostaVP & Distinguished Engineer, Microsoft

When agents can take high-stakes actions like sending an email, sharing a business document, or opening a pull request, a single misstep has the potential to leak confidential data or hand control to an attacker that may then invoke tools that break security or cause damage. Today, we often manage that risk by putting a human in the loop to approve consequential actions. This scales poorly, erodes vigilance, and takes away the very autonomy that makes agents useful.

We lean on humans as a safeguard because the models driving agents behave stochastically, make mistakes, and could be steered by malicious content smuggled in through prompt injection. Despite progress in model alignment, contextual awareness, and content safety classifiers, security can’t depend solely on probabilistic mitigations. A good rule of thumb to keep in mind when designing an agentic system is that anything that an agent can do in response to a user prompt can also be accomplished by a model’s mistake or by an attacker with a prompt injection.

Anything that an agent can do in response to a user prompt can also be accomplished by a model’s mistake or by an attacker with a prompt injection.

A promising path towards secure and autonomous agents is through information-flow control (IFC) , a deterministic security system built on three simple steps:

Label data. Every piece of data that an agent ingests carries labels for integrity (for example, trusted or untrusted) and confidentiality (for example, public, confidential, or a read-access list such as {Alice, Bob, Charlie}).

Propagate labels. As data flows into the agent loop and derivative results are produced, labels travel with them. Derived data is labelled conservatively with the least upper bound of its sources: a result influenced by an untrusted input stays untrusted, and a result based on two documents is readable only by principals who could read both source documents.

Check before acting. Before each tool call, a policy engine inspects the relevant labels and decides whether to allow the action, block it, or ask a human to review it.

This turns a probabilistic system into one with guarantees you can audit. Because the policy engine relies on labels that an attacker can’t manipulate and is independent of the model’s judgement, it can enforce policies deterministically. The policy “untrusted data can never influence a consequential action” closes off prompt injection . The policy “data can only egress to destinations compatible with its confidentiality label” closes off data exfiltration . The user is consulted only when it genuinely matters—for example, when an action risks revealing information to someone who didn’t previously have access to it. The UI dialogs shown to the user can also be made more effective, highlighting the origin of untrusted data or what data is being shared more broadly and with whom.

In our past research, we showed how IFC can reduce the need for human intervention, increasing autonomy while offering deterministic security guarantees. In this post, we focus on how IFC can be integrated into real agentic systems based on GitHub Copilot CLI, the Microsoft Agent Framework, and the Model Context Protocol (MCP). We begin with two...

Information-flow control: Moving toward secure, autonomous agents

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

Claude Fable 5

US Government directive to suspend access to Fable 5 and Mythos 5

It's Not Just X. It's Y