Automatic LLM routing that optimizes cost and speed

terezatizkova1 pts0 comments

Factory Router<br>Log In

Contact Sales

[data-slot='badge-icon']]:border-transparent [&>[data-slot='badge-icon']]:bg-accent-300 [&_*]:text-base-500 gap-3"><br>Router

Frontier performance at lower cost<br>Automatic model selection for every Droid session. Factory Router picks the right model for each task, maintains frontier performance, and cuts cost by up to 25%.<br>$ droid --model router "refactor auth middleware"

Refactor auth middleware to use JWT validationDroid is routing…

Auto-ModelAutoMCP (3)Skills (12)

router-classifierclassifier · ~2s<br>Reads the first user message, recent tool calls and repo signals, then emits a scalar quality probability for each model.<br>SignalWeightComputingScore<br>message0.300.84<br>recent tools0.200.62<br>repo size0.150.77<br>language mix0.200.91<br>difficulty0.150.88<br>Final Score0.80

candidate scoringthreshold 0.70<br>sorted cheapest → most expensivequality_threshold<br>Kimi K2.6Moonshot$0.81<br>MiniMax-M2.7MiniMax$$0.88<br>Claude Opus 4.7Anthropic$$$0.95

Kimi K2.6<br>streaming<br>›Reading src/auth/middleware.ts...<br>›Found legacy session cookie validation<br>›Replacing with JWT verify (RS256)<br>›Generated 7 tests covering edge cases<br>›PR #418 opened — ready for review

promptclassifyscorerun

[data-slot='badge-icon']]:border-transparent [&>[data-slot='badge-icon']]:bg-accent-300 [&_*]:text-base-500 gap-3"><br>The problem

AI coding costs are rising across organizations.<br>Enterprise AI costs are climbing, and a bigger token bill does not mean more work is getting done. To avoid losing on performance, engineers usually default to the most performant model for all tasks. Simple questions, mechanical refactors, documentation updates, small bug fixes, and search-heavy investigations end up on the same premium path as work that truly needs frontier performance. Budgets get exhausted without a clear increase in organization-level output.

[data-slot='badge-icon']]:border-transparent [&>[data-slot='badge-icon']]:bg-accent-300 [&_*]:text-base-500 gap-3"><br>For engineers

Stop choosing a model for every task.<br>Today you pick a model per task and lean on the most expensive one to be safe. With Factory Router you choose once and it picks the best model for each session.

01Select Factory Router.+

02Factory Router picks the right model per session.+

03It recovers if a model struggles.+

Same prompts. Different cost.<br>Without RoutingAlways Claude Opus 4.7<br>“reset my password”Claude Opus 4.7$0.00<br>“add a copyright header”Claude Opus 4.7$0.00<br>“design a caching layer”Claude Opus 4.7$0.00

Total session cost$0.00

With Factory RouterRouted per task<br>“reset my password”Kimi K2.6$0.00<br>“add a copyright header”MiniMax-M2.7$0.00<br>“design a caching layer”Kimi K2.6$0.00

Total session cost$0.00

Savings on identical work0%

[data-slot='badge-icon']]:border-transparent [&>[data-slot='badge-icon']]:bg-accent-300 [&_*]:text-base-500 gap-3"><br>Benchmarks

On our enterprise engineering benchmarks.<br>Compared with Claude Opus 4.7, Factory Router maintains frontier performance at lower cost per session. At enterprise scale, those savings apply across every Droid session, with spend tied to the work being done rather than a blanket default to the most expensive model.<br>Read the announcementArrow Right Icon

TERMINAL-BENCH 2PASS RATE · vs OPUS 4.70%of Claude Opus 4.7 pass rateCOST PER SESSION · vs OPUS 4.70%lowerFactory Router runs at 80% of Opus costCost per successful run · 80.5% of OpusLEGACY-BENCHPASS RATE · vs OPUS 4.70%of Claude Opus 4.7 pass rateCOST PER SESSION · vs OPUS 4.70%lowerFactory Router runs at 75% of Opus costCost per successful run · 78.0% of OpusReported relative to Claude Opus 4.7 · cost measured as full-session cost · averaged across multiple runs

[data-slot='badge-icon']]:border-transparent [&>[data-slot='badge-icon']]:bg-accent-300 [&_*]:text-base-500 gap-3"><br>Reliability

Reliability you can count on.<br>When a provider degrades, rate limits hit, or capacity gets constrained, your sessions keep going. Factory Router routes across models, providers, and capacity to deliver 99.9%+ request reliability.

Claude Opus 4.7Bedrock· degraded

reroute<br>Claude Opus 4.7Vertex· healthy

Provider failover<br>If a provider path degrades, Factory Router keeps the session running on the same model through a healthy provider.

Dedicated TPM<br>Enterprise customers get reserved throughput for critical work instead of relying only on shared public capacity.

Rich frontier models<br>Factory Router keeps frontier models available as they come online, so high-complexity work gets the strongest model class.

US-hosted open-source models<br>Route eligible work to US-hosted open-source models when you need cost-efficient or controlled options.

[data-slot='badge-icon']]:border-transparent [&>[data-slot='badge-icon']]:bg-accent-300 [&_*]:text-base-500 gap-3"><br>For admins

Routing that reflects how your organization works.<br>Routing guidance brings your team's context into Factory Router, so automatic model selection reflects how work actually happens inside your organization. The same...

opus router model icon data slot

Related Articles