Compaction in CC, Codex, and OpenCode

Compaction in CC, Codex, and Opencode | Lexifina FeaturesSecurityPricingBlogChangelogSign in

Blog/Research/Compaction

Compaction in CC, Codex, and Opencode Alan Yahya·June 26, 2026·3 min read

Compacting context allows agents to keep working for a long time without suffering from context rot. The premise is simple: for the current task, keep what matters, remove what does not (but index it so it can be looked up later), and give the model a clean view to continue working. Claude Code, Codex, and Opencode all take a different approach to this. What Compaction Needs To Do A good compaction system has to do five things: Notice when context is exceeding operational parameters for the given model. Pick a clear boundary. Replace expensive but recoverable data, like logs, file reads and tool output with a lookup artifact. Preserve what the agent still needs: goals, decisions, open questions, and relevant files. Apply the result to the runtime The compaction that the agent uses is not about creating a nice coherent summary. It’s about creating a usable continuation state, optimised for an LLM. Claude Code CC treats compaction as part of building the next model request. Before it asks the model to summarize anything, it tries cheaper moves first: trimming large tool outputs, replacing heavy data with references, reusing compact boundaries, and projecting a smaller view of history. CC actively tries to avoid summarising data where possible using a mechanical approach. The saves on tokens. But the tradeoff is complexity. Because CC keeps trying cheaper pre-call reductions, the next prompt can be shaped by a stack of reducers, reused boundaries, and compacted views. That makes it efficient, but harder to tell exactly which reducer changed what the model sees. Codex Codex treats compaction as a hard state change. Before compaction, the session has one active history. After compaction, it has a new one. This makes the runtime easier to reason about. Compaction has a trigger, produces a new state, records the transition, and future turns continue from that new state. The strength of this is in the lifecycle. This state transition lends itself to telemetry and optimisation. The risk is that too much weight can land on the summary step. If big recoverable payloads are still included in the compact request, Codex may summarize things that could have been reduced first. Opencode Opencode's current design is basically a summary plus a tail, but the tail is not arbitrary. It is selected against the remaining budget and turn boundaries, so the recent slice passed forward is exact and coherent. Nor does Opencode really replace the session history with a separate compacted artifact. The transcript remains the source of truth, and each model call is rebuilt as a projection of that state: the compaction request, a summary assistant message, the exact retained tail, and any newer messages. This works well because older history gets compressed while recent details stay precise. Opencode also does some cheap cleanup around compaction: it strips media, caps large tool outputs, skips old completed compactions, anchors the previous summary, and replaces compacted tool results with lightweight placeholders. In Summary CC emphasises avoiding unnecessary summaries. Codex emphasises making compaction explicit and durable. Opencode takes a middle path.

Previous Next

Compaction in CC, Codex, and OpenCode

Related Articles

US Government directive to suspend access to Fable 5 and Mythos 5

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

Apertus – Open Foundation Model for Sovereign AI

How to Earn a Billion Dollars