AI agents as explicit state machines

Why AI Agents Should Be State Machines

A monolithic agent prompt is convenient until one prompt starts doing five jobs at once: routing, extraction, tool selection, formatting, and recovery. When the agent fails, the prompt does not tell you which job failed. It only returns another opaque output. I think this is where many agent systems need less prompting and more ordinary software architecture. A state machine separates the runtime into explicit states, typed transitions, validators, and recovery paths. The LLM still handles local ambiguity inside a state; the surrounding system controls what state can happen next. That boundary matters because LLM calls are stochastic. If a state emits malformed output, the next state should not have to guess what went wrong. The transition should fail loudly, attach a useful error, retry only when the failure is recoverable, and route to a human or dead-letter path when it is not.

TL;DR — Key Takeaways:

A monolithic agent prompt hides routing, extraction, tool selection, formatting, and recovery inside one probabilistic call.

A state machine makes those concerns explicit: states do work, transitions choose the next step, and contracts validate handoffs.

Each state can be tested and observed separately; malformed output fails at the boundary instead of contaminating later steps.

Model routing becomes possible once work is split by state, but economics should remain secondary to reliability.

Error handling becomes architecture: deterministic validators, bounded retries, fallbacks, and dead-letter paths.

The Monolithic Prompt Failure Taxonomy Entanglement. When a single prompt encodes routing logic, domain knowledge, and output formatting simultaneously, changing one dimension requires re-testing all others. I have seen agent prompts grow until nobody could change one instruction without revalidating the whole behavior surface. Untestability. You cannot write a unit test for an LLM prompt that does five things at once. You can only run end-to-end evaluations and observe whether the emergent behavior stays stable. As I wrote in The End of Determinism, the stochastic nature of LLMs makes this problem structural — but a state machine gives you isolation boundaries for your evaluation suites. Cost amplification. A monolithic agent sends the same context, tool schemas, and domain rules to the same model for every step. Classification, extraction, validation, and synthesis may need different levels of intelligence, but the monolith pays for the whole bundle every time. Opacity. When an agent misbehaves, a monolithic prompt conflates routing, extraction, tool selection, and formatting failures into a single black box. The state machine emits structured logs at every transition. The failing state is always identifiable.

Failure ModeMonolithic promptState machine

EntanglementAll logic in one prompt — change anything, risk everythingEach state owns one concern — changes are surgical UntestabilityOnly end-to-end evals possiblePer-state unit tests + contract validation CostFrontier model for every callPer-state model routing — budget where possible DebuggabilityFailure location unknownStructured logs at every transition Error handlingAsk the same prompt to recoverRetry, fallback, dead-letter per transition

Anatomy of an Agent State Machine A production agent state machine has three primitives: states , transitions , and contracts . States do work. Transitions encode control flow. Contracts enforce the shape of data at every boundary. Figure 1: Monolithic prompt versus state-machine decomposition — one opaque prompt becomes discrete, testable statesfrom __future__ import annotations

from enum import Enum from typing import Any

from pydantic import BaseModel, Field

class TicketIntent(str, Enum): BILLING = "billing" TECHNICAL = "technical" GENERAL = "general" UNKNOWN = "unknown"

class ClassifyOutput(BaseModel): intent: TicketIntent confidence: float = Field(ge=0.0, le=1.0) reasoning: str

class ExtractOutput(BaseModel): entities: dict[str, Any] ticket_id: str | None = None customer_tier: str = "standard"

class StateResult(BaseModel): next_state: str payload: dict[str, Any]

# --- State definitions ---

async def classify_intent(user_message: str, llm_client: Any) -> ClassifyOutput: """State 1: Classify — uses a small classification model.""" response = await llm_client.complete( model="small-classifier-model", system="Classify the user message into one of: billing, technical, general. " "Return JSON with intent, confidence (0-1), and reasoning.", user=user_message, response_model=ClassifyOutput, return response

async def extract_entities( user_message: str, intent: TicketIntent, llm_client: Any ) -> ExtractOutput: """State 2: Extract — uses a constrained extraction model.""" response = await llm_client.complete( model="structured-extraction-model", system=f"Extract structured entities from a {intent.value} support ticket. " "Return JSON with...

AI agents as explicit state machines

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

SpaceX not the behemoth everyone thought

The Mirror Is Part of the Machine

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits