My Mental Model of Learning

My mental model of learning | German Capuano

My current employer has a seemingly inoffensive question in its job applications. "What’s your most memorable meal?" It's an opportunity for applicants to tell us about their personality. These days, most applicants lose points for obvious AI-generated answers. The following is one of the common variants.

"My most memorable meal was at ; someone had made a . Nothing about it was ——but ."

I tested the prompt and noticed how difficult it is to get something that sounds remotely human. Even after detailed prompts and multiple corrections, I barely got something passable. This issue lingered in my mind for a long time and eventually shaped my mental model of learning. It also gave me a habit. Now when I enter a new field, I try to capture my early ideas before expertise edits them out of me. That habit has helped me stay creative and produce output even when entering deep rabbit holes. Here I’ll do my best to explain why.

Knowledge collapses the search space

Here’s another anecdote. My PhD thesis advisor once told me that I was spending too much time reading papers, and that I was going to learn the bad ways to solve the problem. He also told me that many of the top researchers at Caltech couldn’t care less about what most people write. The advice sounded strange because research is supposed to build on what is already known.

Let’s assume my advisor was right. How is it possible to push the knowledge boundary forward without first reaching it? My working hypothesis is that knowledge doesn't simply give you new options. It also eliminates some of them. More precisely, knowledge constrains the distribution of ideas we are likely to produce.

When trying to solve a problem, there's a distribution of ideas or solutions that we can come up with. In the figure, that distribution is represented by the blue blob. Some of those ideas will produce good solutions, represented in green. A much smaller subset are the optimal solutions, represented in red. To make the figure cleaner, all the blobs are drawn in similar sizes. But the probability of a good idea is much smaller than the probability of a bad one. And the probability of an optimal idea is just a drop in the bucket. You can see this whenever you write. There are countless sentences you could put next, many that would be grammatical, fewer that would be good, and very few that would say exactly what you mean.

When we learn a skill or a way to solve problems, that knowledge biases us toward what we already know. Eventually it becomes our natural answer. In a way, we collapse the range of possible ideas into a constrained region. In the figure, the constraint is the dashed ellipse. It sits inside the green blob, which means we can now reliably produce good answers. But it also doesn't overlap with the red blob of better solutions, and it prevents us from reaching unusual possibilities. Thinking outside of that box becomes harder. We get more reliable answers at the cost of variation, and possibly creativity. The same thing happens in writing. Grammar and taste help us produce sentences that work, but they can also quietly censor the ideas that first sound wrong. This may be one reason why, as one article puts it, "as researchers age, they produce less disruptive work."

Generalization reopens the search space

Some people do learn almost everything in a field and still produce interesting research, often in the form of generalized theories. To make sense of this, I need one more oversimplification.

Consider a researcher who knows of two different approaches to solving a problem. These could be known methods from the same field, or ideas borrowed from different fields. In the figure, these methods are represented as two separate constraints. The researcher can draw solutions from either one. But after thinking about them and understanding the commonalities, they may be able to produce a single framework that covers both cases. The larger dashed blob in the figure is this generalization. It contains the ideas in the two constrained spaces, but it is not limited to them. It also lets the researcher try things in between, and sometimes extrapolate to new approaches. Abstracting the two methods relaxed their mental constraints and increased the range of accessible ideas. Maybe this higher level of thought can help them find better solutions than those previously known.

Knowledge also helps when we move into complex topics that involve many steps. Trying to solve all the steps at once is fragile. Roughly speaking, each unreliable step compounds the risk of failure. For example, if each step has a 10% chance of success and there are four steps, the probability of reaching a good solution is 0.01%. Knowledge lets us turn hard and unreliable steps into easier and reliable ones, which allows us to tackle more difficult and important problems.

So I arrive at the "uncontroversial" conclusion that knowledge is good, without...

My Mental Model of Learning

Related Articles

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

Old Reddit Is Down

The ultimate female fantasy – A feminist critique of Beauty and the Beast