I Built an NPM package to stop AI agents from burning through API credits

budget-agent - npm

npm

Search Sign UpSign In

budget-agent

0.4.8 • Public • Published 12 minutes ago Readme Code Beta 0 Dependencies 0 Dependents 6 Versions budget-agent

Stop runaway AI agents from burning through your API credits. Track cost, tokens, runtime, and steps. Enforce hard budget limits for OpenAI, Anthropic, LangGraph, LangChain, OpenRouter, CrewAI, Mastra, AutoGen, and any LLM workflow.

budget-agent helps developers track AI agent costs , enforce token limits , set spending caps , monitor LLM usage , and prevent runaway OpenAI, Anthropic, and OpenRouter agents from exceeding budget. Works with every provider. Zero vendor lock-in.

Install

npm install budget-agent

Quick start

import { AgentBudget, BudgetError } from 'budget-agent';

const agent = new AgentBudget({ apiKey: process.env.OPENROUTER_API_KEY, limits: { maxCostUSD: 0.10, maxSteps: 15, maxTotalTokens: 50_000, maxWallTimeMs: 30_000, }, });

const response = await agent.step({ model: 'anthropic/claude-sonnet-4-5', messages: [{ role: 'user', content: 'Hello' }], });

console.log(agent.getUsage());

Prevent runaway AI agents

Agent loops multiply LLM costs across every step. Without guardrails, a single loop can burn through your entire API budget in seconds. budget-agent blocks each call before and after it hits your provider -- so you never overspend.

const agent = new AgentBudget({ apiKey: key, limits: { maxCostUSD: 0.05, maxSteps: 10 }, });

try { while (true) { const res = await agent.step({ model, messages }); messages.push(res.choices[0].message); messages.push({ role: 'user', content: 'Continue.' }); } catch (err) { if (err instanceof BudgetError) { console.log('Agent stopped:', err.exceeded.reason);

Track LLM costs in production

Get real-time visibility into every API call. See cost per step, total spend, token breakdown, and wall time.

const usage = agent.getUsage(); // { // steps: 12, // totalCostUSD: 0.0847, // totalInputTokens: 24300, // totalOutputTokens: 8200, // elapsedMs: 45200, // stepHistory: [...] // }

agent.summary(); // formatted table in console

Set hard budget caps

Every limit is optional. Set only what you need.

limits: { maxCostUSD: 0.05, // total USD across all steps maxSteps: 10, // total LLM calls maxInputTokens: 50000, // input tokens only maxOutputTokens: 10000, // output tokens only maxTotalTokens: 60000, // input + output combined maxWallTimeMs: 60000, // wall clock time in ms

Runtime limits for AI agents

Kill agents that run too long. Set wall time limits to prevent infinite loops from consuming compute and money.

const agent = new AgentBudget({ apiKey: key, limits: { maxWallTimeMs: 30_000, // 30 second hard stop maxCostUSD: 1.00, }, });

Token usage tracking

Track input tokens, output tokens, and total tokens across every step. Know exactly where your budget goes.

agent.on('step:end', (e) => { console.log(`Step ${e.stepIndex}: ${e.inputTokens} in / ${e.outputTokens} out / $${e.costUSD}`); });

Agent guardrails

Pre-flight checks estimate output cost before the API call. Post-step checks record actual spend. If a limit is hit, the step rolls back and you can retry cleanly.

try { await agent.step({ model, messages }); } catch (err) { if (err instanceof BudgetError) { err.exceeded.reason; // 'cost' | 'steps' | 'totalTokens' | 'wallTime' err.exceeded.usage; // full snapshot at cutoff

OpenAI cost tracking

Use budget-agent with the OpenAI SDK to track GPT-4, GPT-4o, and GPT-3.5 costs in real time.

import { AgentBudget } from 'budget-agent'; import OpenAI from 'openai';

const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });

Anthropic budget limits

Set spending caps on Claude Opus, Sonnet, and Haiku. Track token usage and enforce cost limits for Anthropic models.

const agent = new AgentBudget({ apiKey: process.env.ANTHROPIC_API_KEY, limits: { maxCostUSD: 0.25, maxSteps: 20 }, executor: async (request) => { const response = await fetch('https://api.anthropic.com/v1/messages', { method: 'POST', headers: { 'x-api-key': process.env.ANTHROPIC_API_KEY!, 'anthropic-version': '2023-06-01', 'content-type': 'application/json', }, body: JSON.stringify({ model: request.model, messages: request.messages, max_tokens: 1024, }), }); const data = await response.json(); return { model: data.model, usage: { prompt_tokens: data.usage?.input_tokens ?? 0, completion_tokens:...

I Built an NPM package to stop AI agents from burning through API credits

Related Articles

Claude Fable 5

US Government directive to suspend access to Fable 5 and Mythos 5

Is AI ruining our skills? Early results are in – and they're not good

The Anatomy of an AI-Native Org

Apertus – Open Foundation Model for Sovereign AI