We Reined In AI Agents With pre-commit

How We Reined In AI Agents With pre-commit · Merrilin.ai Blog↓ Skip to main content

Merrilin.ai Blog

Merrilin

Table of Contents

A lot of teams now seem to rely on AI code review bots to review code written by AI. That does work, to a point, but there is something slightly absurd about paying for a second model to inspect the output of the first when older tools like pre-commit already did a good job of catching many classes of mistakes early, locally, and without expensive licenses. We do use CodeRabbit on our pull requests, but we also wanted to catch code smells much earlier: in pre-commit hooks on developer machines, and again in GitHub pipelines before bad patterns have a chance to settle in. So we added a guard in Merrilin to rein in exactly that behavior. It uses tree-sitter to reject fragile error-handling patterns before agent-written code gets committed. It does not just search for HTTPException or catch. It parses Python, TypeScript, and TSX into syntax trees and asks much more interesting questions: Did someone raise a raw HTTPException instead of a typed API error? Did they catch a database exception and then keep going without a rollback or savepoint? Did they write reader progress in a critical path without a visible recovery boundary? Did frontend code read error.response.data.detail directly instead of using the shared normalizer? Did someone add a .catch() that only logs to console and silently swallows the failure? There is a certain wonder in watching a machine produce so much working code so quickly. There is also a certain exhaustion in watching it rediscover the exact same bad ideas at scale. This guard has been one of the most effective ways we have found to keep AI-assisted development inside the boundaries of our system design. Why we built it Merrilin is an AI reading companion. That means we have one non-negotiable invariant: do not break reading ever.

Optional systems can fail: AI can fail, telemetry can fail, sync can fail, analytics can fail. But opening a book, turning a page, saving progress locally, and resuming offline still need to work. The problem is that AI agents are fantastic at reproducing tiny error-handling shortcuts that look harmless in isolation: raise HTTPException(...) except Exception: logger.exception(...) except IntegrityError: pass catch (error) { console.error(error) } throw new Error("something went wrong") Individually, these are easy to rationalize. Collectively, they create outages, inconsistent client behavior, poisoned transactions, and impossible-to-centralize error semantics, and they are exactly the kind of thing an AI agent will keep doing unless you give it a hard boundary. We wanted something stricter than prompt instructions and more precise than regexes, so we put the rules in code. Where it lives The error-handling guard matters most here, but it sits inside a broader pre-commit setup that gives AI-generated diffs fewer places to hide. Our .pre-commit-config.yaml also enforces: protected branch safety with no-commit-to-branch basic hygiene checks like JSON/YAML/TOML validation, merge-conflict detection, symlink checks, AST validation, private-key detection, whitespace cleanup, and line-ending normalization Conventional Commit messages at commit-msg time Alembic migration naming and single-head checks backend Ruff linting and formatting Prettier for JS, TS, and Markdown ESLint for the web and admin apps That matters because AI agents rarely fail in only one dimension. The same model that invents a broad except Exception will also happily leave formatting drift, hand-write invalid Alembic migrations, or produce inconsistent commit messages unless your repo pushes back. The error-handling guard itself is wired into .pre-commit-config.yaml: - id: error-handling-patterns name: Guard centralized error handling patterns entry: uv run --project backend --extra dev python scripts/check_error_handling_patterns.py --changed-lines language: system files: ^(backend/app/.*\.py|apps/(web|mobile|admin)/src/.*\.(ts|tsx))$ stages: [pre-commit]

Taken together, these hooks keep the codebase clean, the schema history sane, the commit history legible, and the tree-sitter guard keeps the agent from reintroducing reliability bugs you already learned not to ship. The Alembic hooks deserve special mention. Since Feb 7, we have landed roughly 331 PRs in this repo. In that same stretch, we have touched backend/alembic/versions in 93 commits, and the folder currently contains 81 migration files. There is something exhilarating about shipping that fast. There is also the quiet fatigue of realizing your migration history has become a place where humans and agents can both make a mess with full confidence. In that kind of environment, AI agents start getting overconfident. They see a pile of migrations, infer a human numbering convention that was never really meant to be one, and begin writing new files by hand with integer prefixes like...

We Reined In AI Agents With pre-commit

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

It's Not Just X. It's Y

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy