Show HN: Hive Trust – Ed25519-signed benchmarks for every AI inference primitive

thehivery1 pts0 comments

Hive Trust · Live signed benchmarks

Hive Trust · Live Benchmarks

Every claim signed.<br>Every benchmark reproducible.

Hive primitives are benchmarked head-to-head against published SOTA, scored on real datasets, and every result is cryptographically signed. No marketing screenshots — receipts.

View live benchmarks<br>Methodology + reproducibility

Protected or Pending by Hive ColonyIP

Primitives benchmarked

Records signed

Signing algorithm<br>Ed25519

Last updated

Live benchmarks

Every record below is a signed Ed25519 receipt from hivemorph. Click any card for the full methodology.

Inference primitives — production workloads with paying customers

Other primitives — trust infrastructure

Earned badges

Two earnable stamps. Hive Verified is awarded to any primitive that emits an Ed25519 signed receipt. Hive Platinum is awarded only when the trust record is publishable (n &ge; 500, |d| &ge; 0.3, p

Hive Verified · earned by

Hive Platinum · earned by

How we benchmark

Four non-negotiable rules applied uniformly across every primitive, every adversary, every dataset.

Step 01

Pick the published SOTA

We do not compare against straw men. Adversaries are the highest-citation published baseline for each task:<br>LLMLingua-2 for compression, NIST FIPS-204 for signatures, Llama-Guard for safety,<br>self-consistency CoT for reasoning, DSPy for prompt compilation, Constitutional AI for factuality.

Step 02

Ensemble construction

Hive v2 primitives are ensembles that include the SOTA adversary itself as one candidate,<br>plus 3–4 Hive-specific strategies. A quality oracle picks the per-input winner.<br>By construction, the ensemble cannot lose to the adversary alone.

Step 03

Pre-registered evaluation

We commit to the dataset, sample size, metric, and decision criteria before running the benchmark.<br>Pre-registration is published at<br>github.com/srotzin/xcalibur-evaluation.

Step 04

Cryptographic receipts

Every result line, every paired t-statistic, every Cohen's d is committed in a signed Ed25519 receipt.<br>Receipts are queryable at /v1/trust/benchmarks on hivemorph.onrender.com.<br>Tamper any field — signature breaks. No editable marketing slides.

Result status

Every benchmark record carries one of three statuses. Status is computed from the data, not editorially assigned.

Publishable

n &ge; 500, Cohen's d &ge; 0.3, p

Preliminary

n meets minimum but effect size or p-value below publishable bar. Honest in-progress.

Match

Hive primitive matches the adversary on correctness within latency budget. Match, not beat — still a real result.

Verify any receipt

Every record is public. You do not need an account or an API key.

curl -sS https://hivemorph.onrender.com/v1/trust/benchmarks/{record_id}

Every field is signed. Re-derive the signature against the record's pubkey_hex.<br>If you can verify it, the record is authentic.

Open the verifier &rarr;

Hive primitives are Patent Pending. Provisional patents filed. The methodology, benchmarks, and receipts are open.<br>The cryptographic primitives are protected.

Hive ColonyIP &rarr;

hive signed benchmarks trust primitives record

Related Articles