Why Secure AI Needs Compile-Time Sandboxing

Why Secure AI Needs Compile-Time Sandboxing | Jo Secure Programming Language

Appearance

MenuReturn to top

Why Secure AI Needs Compile-Time Sandboxing June 11, 2026 · The Jo Team AI agents are becoming more capable at automating tasks by generating and executing code. But AI agents cannot be fully trusted. Earlier this year, OpenClaw deleted hundreds of emails from the inbox of Meta Superintelligence Labs' alignment director. Until an agent's actions are strictly bounded, we cannot use it on critical infrastructure or with sensitive data. What exactly is the untrusted code allowed to do — and who checked? The standard answer is runtime sandboxing: containers, seccomp filters, VMs. But they operate at the wrong level of abstraction. A container can stop a process from opening a socket; it cannot stop untrusted code from querying another user's rows through a database connection it legitimately holds. Runtime vs. Compile-Time Sandboxing Runtime sandboxing Enforced in the infrastructure, after deployment

Compile-time sandboxing Jo Enforced by the compiler, before code runs

✕ Blind to business logic Can block syscalls and files, but cannot express rules like "read only this user's rows" or "only these 2 narrowed REST APIs"

✓ Aware of business logic Rules like "read only this user's rows" or "only these 5 narrowed REST APIs" are typed capabilities the compiler enforces.

✕ Boundary buried in deployment stack Authority is scattered across configs and runtime parameters — auditing means digging through infrastructure.

✓ Boundary is typed, versioned code Authority is declared in typed interfaces — auditing means reviewing code in version control.

✕ Violations surface at runtime Escapes are discovered at runtime, after the code is already deployed.

✓ Violations are compile errors The compiler pinpoints them in source, with detailed errors — avoids unnecessary deployment on infrastructure.

A runtime sandbox speaks the operating system's language: processes, files, sockets. The rules worth enforcing need to be written in the application's language — which rows, which endpoints, which user's data — and a boundary can only enforce what it can express. Types are the only boundary that operates at the application level. Compile-Time Sandboxing, Applied to AI In a recent ACM Queue article, Safe Coding, Christoph Kern distills decades of Google's security engineering into a principle of rigorous modular reasoning: ... the safety of risky operations within an abstraction must rely solely on assumptions supported by the abstraction's APIs and type signatures. Conversely, the composition of safe abstractions with safe code (i.e., code free of risky operations, which constitutes the vast majority of a program) is automatically verified by the implementation language's type checker.

That is exactly what Jo's compile-time sandboxing is doing. In Jo, risky operations — filesystem access, network calls, FFI, database queries — need explicit permissions, visible in function type signatures as capabilities. In the following example, the AI-generated code is confined to the capabilities it is given: jointerface OrdersApi def query(lastDays: Int): List[Order] end

param ordersApi: OrdersApi

// Untrusted code (AI-generated): can use only what it received def aiMain(): Unit receives ordersApi, IO.stdout = val orders = ordersApi.query(30) printOrders: orders.select(o => o.state == "open") If aiMain tries to reach the filesystem, the network, or an unscoped query, the program does not misbehave at runtime — it fails to compile. The ordersApi capability is provided by the trusted harness — for example, as a user-scoped, read-only view over the real database. But the implementation is opaque to the AI code: all it ever sees is the OrdersApi interface. Alternatives Runtime Sandboxing + REST APIs A popular middle ground is to sandbox the agent and let it reach the world only through REST APIs. This helps, but it inherits a design mismatch: REST APIs were designed for trusted callers and they expose a large capability surface . A typical API token grants everything the user can do through the web interface — read all orders, change the account, delete records — far more than any single agent task needs. An agent is not a trusted caller. What it may call depends on the security context of the task at hand: one task should see only a small subset of the API surface; another needs an endpoint, but with its scope narrowed — this user's rows, read-only, the last 30 days. Existing REST APIs were not designed to answer such needs, so the restrictions end up in gateways and per-agent proxy policies — authority drifts back into infrastructure configuration, with all the audit problems above. With capabilities, the same narrowing is a few lines of trusted code: wrap the API in a smaller interface. MCPs MCP gives LLMs a catalog of vetted tools that can restrict LLM capabilities to application-defined security...

Why Secure AI Needs Compile-Time Sandboxing

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

Claude Fable 5

It's Not Just X. It's Y

Show HN: GoPeek – open links in live mini browser windows without new tabs