Hive Trust · Live signed benchmarks
Hive Trust · Live Benchmarks
Every claim signed.<br>Every benchmark reproducible.
Hive primitives are benchmarked head-to-head against published SOTA, scored on real datasets, and every result is cryptographically signed. No marketing screenshots — receipts.
View live benchmarks<br>Methodology + reproducibility
Protected or Pending by Hive ColonyIP
Primitives benchmarked
Records signed
Signing algorithm<br>Ed25519
Last updated
Live benchmarks
Every record below is a signed Ed25519 receipt from hivemorph. Click any card for the full methodology.
Inference primitives — production workloads with paying customers
Other primitives — trust infrastructure
Earned badges
Two earnable stamps. Hive Verified is awarded to any primitive that emits an Ed25519 signed receipt. Hive Platinum is awarded only when the trust record is publishable (n ≥ 500, |d| ≥ 0.3, p
Hive Verified · earned by
Hive Platinum · earned by
How we benchmark
Four non-negotiable rules applied uniformly across every primitive, every adversary, every dataset.
Step 01
Pick the published SOTA
We do not compare against straw men. Adversaries are the highest-citation published baseline for each task:<br>LLMLingua-2 for compression, NIST FIPS-204 for signatures, Llama-Guard for safety,<br>self-consistency CoT for reasoning, DSPy for prompt compilation, Constitutional AI for factuality.
Step 02
Ensemble construction
Hive v2 primitives are ensembles that include the SOTA adversary itself as one candidate,<br>plus 3–4 Hive-specific strategies. A quality oracle picks the per-input winner.<br>By construction, the ensemble cannot lose to the adversary alone.
Step 03
Pre-registered evaluation
We commit to the dataset, sample size, metric, and decision criteria before running the benchmark.<br>Pre-registration is published at<br>github.com/srotzin/xcalibur-evaluation.
Step 04
Cryptographic receipts
Every result line, every paired t-statistic, every Cohen's d is committed in a signed Ed25519 receipt.<br>Receipts are queryable at /v1/trust/benchmarks on hivemorph.onrender.com.<br>Tamper any field — signature breaks. No editable marketing slides.
Result status
Every benchmark record carries one of three statuses. Status is computed from the data, not editorially assigned.
Publishable
n ≥ 500, Cohen's d ≥ 0.3, p
Preliminary
n meets minimum but effect size or p-value below publishable bar. Honest in-progress.
Match
Hive primitive matches the adversary on correctness within latency budget. Match, not beat — still a real result.
Verify any receipt
Every record is public. You do not need an account or an API key.
curl -sS https://hivemorph.onrender.com/v1/trust/benchmarks/{record_id}
Every field is signed. Re-derive the signature against the record's pubkey_hex.<br>If you can verify it, the record is authentic.
Open the verifier →
Hive primitives are Patent Pending. Provisional patents filed. The methodology, benchmarks, and receipts are open.<br>The cryptographic primitives are protected.
Hive ColonyIP →