There Is No Reward Function for Meaning

There Is No Reward Function for Meaning | Aaron AngThere Is No Reward Function for Meaning 27 Jun, 2026 We Only Celebrate Progress Once A century ago, sending word to another continent meant waiting weeks for a ship to cross the ocean. Today, we complain when a video call lags for half a second. For some reason, we celebrate a breakthrough once, then it quietly becomes the baseline. Smartphones condensed cameras, maps, encyclopedias, music players, and libraries into devices that fit in our pockets, yet we’re quicker to notice a low battery than to appreciate all other aspects of convenience they bring to our lives. Once something remarkable becomes routine, we stop noticing it at all. I think we’ve fallen into the same pattern with AI. The “strawberry” meme is a good example: we made fun of language models when they confidently miscount the number of r ’s in the word strawberry. It became a simple, memorable way to argue that AI wasn’t really intelligent. At the same time, those same models could explain advanced mathematics, summarize research papers, write working software, and help millions of people learn new skills. Somehow the mistake became more remarkable than the capabilities. But the joke is on us. We normalize competence almost instantly and grow increasingly sensitive to failure. Once something consistently exceeds our expectations, we stop rewarding it for doing so. We simply raise the bar and wait for the next mistake. Intelligence Isn’t the Impressive Part After a year of working with coding agents, I’ve come to appreciate that their most interesting quality isn’t raw intelligence. It’s their willingness to iterate. They produce code that won’t compile, misread requirements, reach for the wrong approach. Then they read the error, revise, rerun the tests, and try again. Then again. Sometimes dozens of times before creating something of value. Watching the process has made me realize that progress often depends less on getting the first answer right than on continuously incorporating feedback. Humans aren’t always as good at this. Once we’ve invested time in an idea, it becomes difficult to abandon it, and our pride unconsciously folds hypotheses into identities. We defend decisions because they are ours rather than because they are correct. Machines don’t have that problem. They don’t care whether yesterday’s idea survives the afternoon; if the feedback says the approach isn’t working, they move on. There’s no ego to protect, and from the outside, it looks a lot like intellectual humility. The Importance of Having an Objective Working with coding agents has also changed how I think about intelligence itself. I’ve found that their performance depends surprisingly little on clever prompting and surprisingly much on the quality of the objective I give them. If I clearly define the problem, explain the important tradeoffs, enumerate the edge cases, and provide a reliable test harness, they can often complete ninety percent of the implementation with very little intervention. In those situations the bottleneck usually isn’t the model. It’s my ability to specify what success actually looks like. The remaining ten percent is almost never about correctness. It’s about judgment. Should this abstraction live here or somewhere else? Should the implementation optimize for readability or performance? Is documentation more valuable than self-explanatory code? Where should the single source of truth live? None of these questions have objectively correct answers. They are decisions shaped by experience, context, and instinct. That distinction has become one of the most important lessons AI has taught me. Coding agents thrive because software engineering provides unusually clear feedback: every compilation, test suite, benchmark, and code review tells the system whether it’s moving closer to or farther from the objective, and the objective itself is rarely in dispute. I don’t think this is unique to programming, either. Mathematics, engineering, and much of science share the same property. Success is measurable, and reality answers with a clear “yes” or “no.” Those are exactly the environments where optimization shines. But Life Doesn’t Work That Way Then I think about the questions that bother us and have a long-term impact on our lives. Whether to forgive someone who never apologized. Whether to leave a good job for one that might matter more. When, or whether, to have children. None of these are hard for lack of information. They are hard because there isn’t a universally agreed definition of success. There is no compiler error for purpose, no benchmark for wisdom, and no test suite that tells us whether we’re becoming the person we hoped to be. Instead, we navigate through experience, relationships, culture, memory, and intuition. Two thoughtful people can...

There Is No Reward Function for Meaning

Related Articles

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

Apertus – Open Foundation Model for Sovereign AI

How to Earn a Billion Dollars

Italy's Meloni says Trump 'made up' story that she 'begged' him for photo at G7