Token-saviour – routing skill for AI agent tool selection (~70% fewer tokens)

skills/token-saviour/SKILL.md at main · vagkaratzas/skills · GitHub

//blob/show" data-turbo-transient="true" />

Search or jump to...

Search code, repositories, users, issues, pull requests...

-->

Clear

Search syntax tips

Provide feedback

--> We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Cancel

Submit feedback

Saved searches

Use saved searches to filter your results more quickly

-->

Name

Query

To see all available qualifiers, see our documentation.

Cancel

Create saved search

//blob/show;ref_cta:Sign up;ref_loc:header logged out"}" Sign up

Appearance settings

Resetting focus

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

vagkaratzas

skills

Public

Notifications You must be signed in to change notification settings

Fork

Star

FilesExpand file tree

main

/SKILL.md

Copy path

Blame More file actions

Latest commit

History History History

151 lines (118 loc) · 8.13 KB

main

/SKILL.md

Copy path

Top

File metadata and controls Preview

Code

Blame

151 lines (118 loc) · 8.13 KB

Raw Copy raw file Download raw file

OutlineEdit and raw actions

name token-saviour

description Pick the most token-efficient tool for a coding task instead of reflexively reading whole files or dumping raw command output into context. Use this skill BEFORE you explore or explain a codebase, locate a symbol/definition/callers, trace a call path, map an architecture, plan a feature across layers, or run verbose commands (tests, builds, git, grep, directory listings) — and whenever context/token budget, cost, or "make this use fewer tokens" comes up. It routes the scenario to serena, graphify, rtk, caveman, or plain tools, with concrete commands and the combinations to use vs. avoid. Reach for it even when the user doesn't name a tool: if you're about to `cat`/Read several files to answer a question, that's the trigger.

token-saviour: spend tokens where they matter

Reading whole files to answer a narrow question is the single most wasteful thing an agent does. On a benchmark over a ~30-file Python app, swapping whole-file reads for semantic retrieval cut total tokens ~66% ; the other tools each own a narrower slice. This skill helps you reach for the right one before you blow the context budget — then get back to the actual task.

The mental model: token cost has four independent layers , and a different tool owns each. Match the tool to the layer the task actually stresses.

Layer What it is Tool Don't bother with

Code-read input Understanding code: symbols, callers, call paths, architecture serena or graphify rtk, caveman

Command-output input Verbose stdout: tests, builds, git, grep, listings rtk serena, graphify

Generated output Your own long, chatty replies caveman the input tools

Tiny/one-off work plain Read/Grep/Bash everything (overhead > benefit)

First, check availability. These are optional third-party tools. Run the relevant --help/--version once; if a tool isn't installed, fall back to the next-best option in its row (ultimately plain Read/Grep/Bash). Never pretend a tool ran — degrade gracefully.

Decision guide

Work top-down. The moment a row matches, use that tool and stop.

"Where is X / what calls X / how does A reach B / what breaks if I change X?" → graphify (a queryable code graph). It shines at navigation and impact: callers, call-path tracing, cross-module flow. Measured: tracing a 4-layer call path cost 65 tokens vs 1,633 reading the files (−89%).

"Explain this module/class, list its methods, summarize the architecture" — or you're about to edit code by symbol → serena (live LSP symbols + semantic edits). It shines at broad comprehension and editing: symbol overviews across many files, and refactors/renames that graphify can't do. Measured: explaining the whole architecture cost 243 tokens vs 3,250 (−85%); one class's methods, 55 vs 458 (−71%).

Running something noisy — test suite, build, linter, git, grep, a big directory listing → rtk (compresses command output). Measured: the test run −65% , the structure listing −38% . Its win scales with verbosity, so it's biggest on failing test dumps and long build logs.

About to write a long, prose-heavy answer (explanations, write-ups, status) → consider caveman (terse "caveman" reply style). It trims filler/hedging/pleasantries. Small on already-concise text and it can grow terse list/structured output by adding markdown — so use it for genuinely chatty output, not for short factual answers or anything where exact wording/order matters (warnings, irreversible steps, ordered procedures).

None of the above / a one-file, one-line lookup → just use plain Read/Grep/Bash. The tools have setup and call...

Token-saviour – routing skill for AI agent tool selection (~70% fewer tokens)

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

Apple WWDC 2026 Livestream

Claude Fable 5

US Government directive to suspend access to Fable 5 and Mythos 5

It's Not Just X. It's Y