A Dumb Harness: Fundamentals of running coding agents on a loop

The Deck · Beontheloop · Beontheloop From Inference Loops to Long-Running Agents Fundamentals, Workflow, and What Actually Fits ← tap to go back tap for next →

A note before we startThese aren’t all my own ideas. This is an aggregate of the best thinking in this space — the engineers I learn from — folded together with my own experiments on what actually works and what doesn’t.

“text in”→LLM→“text out” An LLM is a function. Text in, text out. Stateless. No memory between calls.

An agent is a while-true loop that appends to an array. agent.py # What an agent actually is while True: user_input = get_input() response = llm.complete(user_input) if response.wants_tool: result = execute_tool(response.tool_call) response = llm.complete(result) print(response) Agent · ↻ while trueuser input→LLM↑↓ tool

→output

References Mihai Eric · The Emperor Has No Clothes: Claude Code in 200 Lines Geoffrey Huntley · fundamental skills and knowledge you must have in 2026 for SWE

And that array is the context window. ◆Every API call sends the entire array. ◆Each turn appends. ◆The model is stateless.

CONTEXT WINDOW · 200K · not to scalesystem prompt6.3k tool definitions9.5k CLAUDE.md2.5k user message0.4k assistant1.2k tool result3.1k unused context

The loop, in action. agent.py # this iteration takes the tool branchwhile True: user_input = get_input() response = llm.complete(user_input) if response.wants_tool: result = execute_tool(response.tool_call) response = llm.complete(result) print(response) Agent · log// agent ready · waiting for input

→user_input received“find all TODO comments in src/”

→LLM responds: wants_tool=truetool_call: bash(“grep -rn TODO src/”)

→executing tool · bash3 matches in auth.ts, api.ts, db.ts

→LLM responds with answer“Found 3 TODOs across the codebase”

→print(response) · loop iterates ↻

Context window · 200Ksystem prompt6.3k tool definitions9.5k CLAUDE.md2.5k skills0.9k user0.3k

assistant · tool_call0.5k

tool_result0.6k

assistant0.4k

~179k remaining · empty

The agent harness wraps the loop. Everything that isn’t the LLM — what tools exist, what context loads, when to stop.

Agent harness · Claude CodeAgent · ↻ while trueuser input→LLM↑↓ tool

→output

system prompt context mgmt skills & tools MCPs sub-agents plan mode session persistence permissions & hooks

the whole stackLoop.Array.Harness. An agent is a while-true loop appending to an array. The harness controls what’s in it.

If the context window is just an array, what goes in the array is everything.

The instruction ceiling is real. Frontier thinking models reliably follow ~150–200 instructions. Beyond that, even rules at the top get ignored. smaller models · exponential decay•frontier thinking · linear decay

Reference Dex Horthy · humanlayer.dev · arxiv:2507.11538

Smart zone, dumb zone. The window isn’t uniform. The first ~40%is where the model thinks clearly. Past that, attention frays — tool choice gets sloppy, instructions get dropped, the goal drifts. “The more context you use, the worse results you’ll get.” Dex Horthy / humanlayer.dev · Geoffrey Huntley

Context window · 200Ksmart zonedumb zone system prompt6.3k tool definitions9.5k CLAUDE.md2.5k skills0.9k ~180k remaining · fresh session

Reference Dex Horthy · escaping the Dumb Zone (#262)

The allocation problem. Static fills eat your usable space before the conversation starts. Static fills → smart zone shrinks.before the conversation even starts.

Context window · 200Ksmart zonedumb zone system prompt6.3k tool definitions9.5k CLAUDE.md2.5k skills0.9k ~180k remaining · ~60k smart available

The context rot problem. Nothing fails. Every call succeeds. It just fills up. Same window, same model → it rots.no errors. just volume.

Context window · 200Ksmart zonedumb zone system prompt6.3k tool definitions9.5k CLAUDE.md2.5k skills0.9k user · “implement feature X”0.3k assistant · read files0.5k tool_result · 4 files0.6k ~178k remaining · smart zone

Good context stays in the smart zone. Fresh session per task start clean, don't reuse a tired window Only what this task needs drop the MCPs and notes that aren't useful here Offload to disk save big stuff as files, keep short summaries in the window Send sub-agents for side quests let them explore, return one paragraph Leave room below the line finalizing work (tests, commits, lint) still has space Split big work across sessions when it won't fit one window, plan it, write the spec to disk, let multiple agents pick it up

Context window · 200Ksmart zonedumb zone system prompt · lean1.2k tool definitions · 4 tools2k spec.md · one task3k skills0.9k user · one clear goal0.3k assistant · tool_call0.5k tool_result1.5k assistant · tool_call0.5k tool_result2k assistant · “done”0.4k ~188k remaining · all above the line

Every session starts from zero. Context doesn’t engineer itself. Allocation, rot, compaction, recovery. Someone has to handle them.

Your harness wraps theirs. Anthropic ships the agent harness. You ship the layer...

A Dumb Harness: Fundamentals of running coding agents on a loop

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

It's Not Just X. It's Y

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy