Synth beats Fable 5 on deep research

ljlolel1 pts1 comments

Synth beats Fable 5: introducing Iris, Zeus, and Prometheus | TrustedRouter

We're hiring<br>We're looking for PhD researchers to join the team and work on exciting frontier problems.<br>Get in touch &rarr;

&larr; TrustedRouter blog<br>Synth beats Fable 5: introducing Iris, Zeus, and Prometheus

2026-06-24 &middot; TrustedRouter-Fusion-Draco on GitHub

TrustedRouter.com:Synth beats Fable 5 at a fraction of the costDRACO deep-research score vs estimated cost to run all 100 tasks · cheaper → right · Iris / Prometheus / Zeus trace the frontier40455055606570$0$10$25$50$100$180$250estimated cost to run 100 DRACO tasks (cheaper →)DRACO scoreFable 5Opus 4.8GPT-5.5Gemini 3.1 ProDeepSeek V4 ProKimi K2.6Gemini 3 FlashIris 1.062.6 · ~$20Prometheus 1.069.2 · ~$34Zeus 1.073.4 · ~$180Prometheus matches frontier quality at open-model cost. Zeus is the ceiling. Iris is the cheapest way in.TrustedRouter.com

Synth — TrustedRouter's multi-model fusion, where a panel of models each answers a question, a judge weighs the answers, and a synthesizer writes the final one — now ships as three named presets. They share one fusion engine: a Kimi K2.6 judge and a GLM 5.2 synthesizer, the pairing our judge-and-synthesizer tests put on top. What changes between them is the panel. One model id each, the whole thing inside the attested gateway.

presetmodel idpanelDRACOest. $ / 100 tasksIris 1.0 trustedrouter/irisbudget62.6~$20Prometheus 1.0 trustedrouter/prometheusall open-weights69.2 ~$34Zeus 1.0 trustedrouter/zeuscommercial frontier73.4 ~$180<br>The three presets are the efficient frontier. Plot DRACO deep-research score against what it costs to run the whole 100-task benchmark and the three trace the upper-left edge — every standalone model, open or frontier, sits below them. Fable 5, the model OpenRouter built its best fusion on, scores 65.3 for an estimated $250 a run; Prometheus scores 69.2 for about $34. Synth beats Fable 5 by four points at roughly a seventh of the cost.

Prometheus is the one most people should reach for. Its panel is all open-weights — MiniMax M3, Kimi K2.6, GLM 5.2, Gemma 4, DeepSeek V4 Pro — so nothing in it is closed or priced like the frontier, and it still lands within four points of the best score we have ever measured while clearing every frontier solo: Opus 4.8 at 60.7, GPT-5.5 at 63.0, Fable 5 at 65.3. Near-frontier deep research at open-model cost.

Zeus is the ceiling. Put the commercial frontier on the panel, keep the same open-model judge and synthesizer, and Synth reaches 73.4 — the state of the art on DRACO, above OpenRouter's published best. It runs about five times the cost of Prometheus, so it is the preset for when the answer matters more than the bill. Iris is the cheapest way in — a small budget panel, the same judge and synthesizer, 62.6 for about $20, above any single budget model.

Pick by id — trustedrouter/iris, trustedrouter/prometheus, or trustedrouter/zeus — on the same OpenAI-compatible API, with the panel, judge, and synthesis all running inside the attested gateway. Send the cheap prompts to Iris, the everyday hard ones to Prometheus, and the few that have to be right to Zeus.

A note on the numbers. Scores are DRACO, graded the way the rest of this series is. The cost figures are estimates for a full 100-task run, derived from measured token usage and public per-token pricing and anchored to two we have published (an open model around $9 a run, Fable 5 around $250) — read them as order-of-magnitude, not invoices. The eval harness and per-task scores are public. Try Synth &rarr;

More posts<br>Models<br>PrometheusBench

Sign in

Choose a sign in method.

G Continue with Google

= Continue with GitHub

M Continue with MetaMask

trustedrouter model synth fable prometheus frontier

Related Articles