The Complete Guide to Caveman - AI For Developers
AI For Developers
SubscribeSign in
The Complete Guide to Caveman<br>How to Cut Claude Code Token Usage Without Losing a Single Brain Cell
AI For Developers<br>Jun 29, 2026
23
Share
Someone posted on Reddit earlier this year with the title: Taught Claude to talk like a caveman to use 75% less tokens. Most people assumed it was a joke. It got 10,000 upvotes. Half the comments were laughing. The other half were already installing it.<br>I installed it too. And after a few weeks of actual daily use, I think the joke is hiding something genuinely worth understanding — not just as a productivity trick, but as a signal about where AI-assisted development is heading.<br>Let me break it down properly.
What Caveman is and what it isn’t
Caveman is a plugin for Claude Code, built by a developer named Julius Brussee . The premise is disarmingly simple: it makes Claude respond in compressed, telegraphic language. No articles. No pleasantries. No Great question! Let me explain that for you. Just the answer, stripped to bone.<br>Here’s the same response to fix the null check on line 42 — before and after:<br>Normal Claude:<br>The issue you’re seeing on line 42 is that we’re not checking whether the user object exists before trying to access its properties. I’ll add a null guard there to make sure we handle that case safely. Here’s what I changed and why it should fix the problem you were experiencing...
Caveman mode:<br>L42: null guard added. user?.name — safe now. Done.
The code fix is identical. The logic is the same. What changed is the verbal packaging around it and that packaging is pure cost. Tokens you pay for, latency you wait through, context window you burn, all without getting a better answer.<br>What makes Caveman interesting is what it doesn’t touch: reasoning and thinking tokens. The model still thinks the same way. It just stops narrating the thinking out loud. Brain still big. Mouth small.<br>Thanks for reading! Subscribe for free to receive new posts and support my work.
Subscribe
Installing it
The standard way (Claude Code plugin)
Open your terminal and run two commands:<br>claude plugin marketplace add JuliusBrussee/caveman<br>claude plugin install caveman@caveman<br>Restart Claude Code and you’ll see a small [CAVEMAN] ⛏ badge in the statusline confirming it loaded. Done.<br>If the first command hangs: don’t kill it. It’s fetching from GitHub and can take 30–40 seconds on a slow connection. If it keeps failing, skip to the curl installer below, it’s more reliable.
Windows (without WSL2)
Use PowerShell:<br>irm https://raw.githubusercontent.com/JuliusBrussee/caveman/main/install.ps1 | iex<br>The universal installer (recommended if you use multiple agents)
If you work across Claude Code, Cursor, Codex, Gemini CLI, Cline, or Copilot, this one command auto-detects all of them and installs Caveman for each:<br># macOS / Linux / WSL<br>curl -fsSL https://raw.githubusercontent.com/JuliusBrussee/caveman/main/install.sh | bashIt skips agents you don’t have installed and is safe to re-run. Takes about 30 seconds. Needs Node ≥18. Full per-agent details are in the INSTALL.md.<br>No plugin system? Use CLAUDE.md
You can get most of the benefit without installing anything at all. Add these lines to your global CLAUDE.md file (or your project’s CLAUDE.md):<br>## Communication Style<br>Respond like a caveman. No articles, no filler words, no pleasantries. Short. Direct. Code speaks for itself.This works on any agent that reads context files — Cursor, Cline, Copilot. Less structured than the full plugin, but surprisingly effective.
Using it: the six modes
Once installed, you activate Caveman inside Claude Code with /caveman. There are six intensity levels, and switching between them mid-session is completely fine:<br>Command What it does When to use it /caveman lite Drops pleasantries and filler only. Full grammar stays. Exploring unfamiliar code, learning something new /caveman Default. Fragment sentences, articles dropped. The everyday workhorse for most developers /caveman full Same as default, just explicit. When you want to be sure it’s fully on /caveman ultra Maximum compression. Notes-like, heavy abbreviations. Deep-flow sessions where you know the domain cold /caveman wenyan Classical Chinese compression patterns — the most token-efficient written form humans ever developed. Mostly a curiosity; impractical for most English-language work /caveman off Back to normal Claude for the rest of the session. When you’re genuinely lost and need full explanations<br>One behavior worth knowing: Caveman pauses automatically if it detects you’re confused or asking the same question twice. It temporarily switches back to fuller explanations, then resumes compression once you’re back on track. You don’t manage this manually — it just works.<br>Matching the mode to the task
This is where most people go wrong. They crank it to ultra on day one, find the responses hard to parse, and abandon it.<br>Use lite when you’re debugging...