What's in a Codebase?

What's in a codebase?

Brian Kihoon Lee

Essays

What's in a codebase?

2026-03-23

Tagged: llms, strategy, software engineering

Does it ever make sense to rewrite your codebase from scratch?

For decades, the answer had been an unambiguous no, ever since Joel Spolsky argued that rewrites were the “single worst strategic mistake that any software company can make”.

In the era of coding agents, the cost of writing code has dramatically shifted, making it possible to rewrite your codebase from scratch, every week, if you really wanted to. But “possible” and “makes sense” are not the same. In this essay I explore the value of a codebase.

The compiler analogy

We’ve been here before - several times, actually. C codebases are ten times shorter than the assembly that they compile to, and the generated assembly code is worth approximately nothing compared to the C codebase. Decades later, Python codebases are ten times shorter than the equivalent C code, and few are weeping for the C codebases they replaced. A spec might be yet another ten times shorter than the Python code, with coding agents serving as the “compiler”.

At each level of compression, detail is necessarily lost (historically, the low-level implementation tricks required to extract maximally performant software). If you couldn’t tolerate that lossy compression, there was always the option of inlining assembly into C, or embedding C into Python. Today, coding agents fail to generate maximally simple code, often generating redundant copies of code, or having torturous data flows instead of refactoring the underlying information architecture. Perhaps we’ll have to inline Python code into the spec.

The coding agent works well as a decompression algorithm because it contains humanity’s collective knowledge of different coding patterns, algorithms, and techniques. You can invoke that knowledge with a single word – if you happen to know the right word. Agentic programmers of the future may have to learn an encyclopedia of programming patterns and techniques and when they are applicable, to be effective at their jobs.

The compilation analogy extends even further - just like many build systems allow incremental recompilation of the parts of your program that changed, you can also imagine having a agent take a text diff on your updated spec, and incrementally update an existing codebase, rather than rewriting from scratch.

Coding agents and specs

I’ve been using the word “spec” loosely, but what is a spec, actually?

One answer is an extensive test suite: We’ve seen a few examples of this already (vinext, chardet); given an exhaustive set of unit tests / API specs, an agent can rewrite the codebase, possibly in a completely different language or context. In response to these demos, some companies are considering pulling their unit tests from their open-sourced code – although I should note that an existing codebase can be fuzzed to regenerate a unit test suite, so you may as well pull the whole thing! SQLite is a notable outlier here - their test suite is 99.8% of their codebase and they’ve kept it private since inception, despite keeping the source code public.

One notable failure of this approach is Anthropic’s C compiler exercise, in which the agent succeeded in writing a C compiler that compiled Linux against several architectures (wow!), but due to a lack of clean internal abstractions, it wasn’t likely to compile anything else, and had major performance shortcomings.

Perhaps what that attempt needed to complement the unit tests was a design doc, with key architectural decisions laid out. This would provide the core of the software, while the unit tests covered the periphery.

Still, we’re missing detail. What about comments, like ## This call is expensive - only invoke when X is true, or the wisdom embedded within historical commit messages? What about the bugfixes, feature requests, and performance fixes recorded in issue trackers or version release notes? Q/A knowledgebases, FAQs, and user-facing manuals contain info about user-facing edge cases and their current or desired resolution. Simply scraping this content would be futile - only 1% would actually be valuable, and the rest would either be obsolete, redundant with the spec, or mutually contradictory.

You could drop this level of detail from the spec and gain incredible feature velocity, but that would result in buggy, nonperformant software that only has 2 9s of reliability. Maybe every developer in the world would use it anyway, who knows? shrug

Codebases coevolve with people

To expand the definition of “spec” even further, there are many ways in which even having the codebase as spec is still an underspecification.

Codebases exist alongside people: the engineers, of course, but also the on-call, the end user, the support team, and so on.

Software that’s often used on the go will develop tolerance to flaky internet connections. Software that’s used intimately by a small...

What's in a Codebase?

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

SpaceX not the behemoth everyone thought

The Mirror Is Part of the Machine

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits