Tuningfork – LLM agent grounding rules derived from human reality-testing

GitHub - T-Chartrand/tuningfork: Grounding rules for LLM agents, derived from human reality-testing · GitHub

/" data-turbo-transient="true" />

Search or jump to...

Search code, repositories, users, issues, pull requests...

-->

Clear

Search syntax tips

Provide feedback

--> We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Cancel

Submit feedback

Saved searches

Use saved searches to filter your results more quickly

-->

Name

Query

To see all available qualifiers, see our documentation.

Cancel

Create saved search

/;ref_cta:Sign up;ref_loc:header logged out"}" Sign up

Appearance settings

Resetting focus

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

T-Chartrand

tuningfork

Public

Notifications You must be signed in to change notification settings

Fork

Star

main

BranchesTags

Go to file

CodeOpen more actions menu

Folders and files NameNameLast commit message Last commit date Latest commit

History 15 Commits 15 Commits

docs

examples

src/tuningfork

tests

.gitignore

LICENSE

README.md

pyproject.toml

View all files

Repository files navigation

tuningfork

Grounding rules for LLM agents, derived from human reality-testing.

Humans who must routinely distinguish real perception from convincing internal fabrication have spent decades refining practical checks for it. Those checks turn out to map directly onto the agent hallucination problem — often more cleanly than the framings in the ML literature. tuningfork is that mapping, written down as nine rules and shipped as a small, dependency-free Python reference implementation.

The name comes from one of those human techniques: a physical tuning fork held to the ear interrupts auditory hallucination through an independent channel. It doesn't argue with the false signal — it breaks the state. That is the design principle of this entire library.

The core insight

A check terminates when the verifier sits outside the system being doubted.

A model re-reading its own output shares its own failure modes — it can fluently confirm its own fabrication. A grep, a parser, a checksum, an exit code cannot. One deterministic confirmation from an independent channel is final; a hundred same-model re-checks are not. Everything here follows from that: the environment is the source of truth, and the model's memory is a cache that may be stale.

The nine rules

Rule Phase One-liner

G0 Asymmetric Trust governs all Content can convict, but never acquit — trust flows from source-tracing only

G1 Verify-Before-Assert foresee A claim that could be tool-checked must be, before it's stated

G2 Closed-Loop Execution recognize Report observed results, never issued commands. Read-only observations are terminal

G3 Disagreement Triangulation recognize Tool beats memory; one independent check on surprises; one deterministic confirmation is final

G4 Negative-Space Probing foresee Probe for existence before relying on remembered entities; keep a catalog of known fabrication signatures

G5 Reproducibility Snapshot snap out After a correction, rebuild state from tool output only — nothing from the broken narrative carries over

G6 Cost-Tiered Budget continuous Tier verification by blast radius, decided before generation; suspiciously perfect claims get their tier raised

G7 Passive Independent Validators continuous Cheap deterministic monitors run on everything and never ask the generator's permission

G8 Source Re-attribution after the verdict A verified-false output is evidence about the generator — mine it; belief and action are decoupled

Full text with rationale: docs/framework.md · The story behind it: docs/essay.md

Quick start

pip install -e .

from tuningfork import (GroundedAgent, ValidatorBank, CitationValidator, PathValidator, JsonBlockValidator)

bank = ValidatorBank([ CitationValidator(valid_source_ids=["1", "2", "3"]), PathValidator(evidence_paths=tool_returned_paths), JsonBlockValidator(), ])

agent = GroundedAgent(generate=my_llm_callable, bank=bank) result = agent.run("Summarize sources [1]-[3] and list the config files involved.")

print(result.tier.rationale) # how the claim was priced before generation print(result.report.summary()) # what the independent channels observed print(result.trustworthy) # validators' verdict, not the model's

The harness permits exactly one regeneration pass on validator failure — fed the validator evidence, not an apology prompt. A second failure is reported as unresolved, because retrying the same channel is re-checking the check.

The child agent

v0.3.0 adds a small runnable agent with the overlay on: an...

Tuningfork – LLM agent grounding rules derived from human reality-testing

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

Claude Fable 5

US Government directive to suspend access to Fable 5 and Mythos 5

It's Not Just X. It's Y