OpenClaw leads official ARC-AGI-3 community leaderboard

ARC Prize - Community Leaderboard

ARC-AGI Community Leaderboard ARC-AGI has gained significant popularity over the past two years, and we've been overwhelmed by the number of researchers and builders who want to showcase their work to the community. The ARC-AGI Community Leaderboard provides a landing spot for these submissions, where the community can review, discuss, and verify results together. Community Leaderboard submissions must be general purpose and reproducible. Scores are self-reported on the ARC-AGI-1 and ARC-AGI-2 semi-private sets and the ARC-AGI-3 public set. ARC Prize will not independently verify submissions except in extraordinary cases, and we reserve the right to determine what qualifies. For more on how we approach testing, see our testing policy. To submit your work, head to the ARC-AGI Community Leaderboard repo on GitHub.

NameAuthorsBenchmarkScoreCostDateLinksOpenClawOpenClaw Harness adapted to play ARC-AGI-3 allowed memory and code execution tools.

ARC Prize FoundationARC-AGI-35.2%$2,9122026-05-15codescorecard Human Intelligence HarnessMaximum human intelligence built into an agent harness.

ARC Prize FoundationARC-AGI-395.3%*-2026-04-14codescorecard Read-Grep-Bash AgentA coding agent that uses search and Python scripting over game logs. Note: Score pending full run on public set.

Alexis Fox, Junlin Wang, Paul Rosu, Bhuwan DhingraARC-AGI-3--2026-03-13codepaper scorecard Evolutionary Test-Time Compute with Natural Language InstructionsEvolves natural language instructions instead of code.

Jeremy BermanARC-AGI-229.4%$3,6482025-09-16codepaper Efficient Evolutionary Program SynthesisEvolves a growing library of Python programs with an LLM.

Eric PangARC-AGI-226.0%$4762025-09-01codepaper Tiny Recursive Model (TRM)7M parameter recursive model with think-act refinement loops.

Alexia Jolicoeur-MartineauARC-AGI-27.8%*$2522025-07-01codepaper Hierarchical Reasoning Model (HRM)Brain-inspired 27M parameter model with iterative refinement.

Sapient IntelligenceARC-AGI-22.0%$2012025-06-08codepaper Evolutionary Test-time ComputeGenetic algorithm over LLM-generated Python transforms.

Jeremy BermanARC-AGI-153.6%$2,9002024-12-18codepaper Ryan GreenblattLLM generates and refines thousands of candidate programs per task.

Ryan GreenblattARC-AGI-143.0%$40,0002024-06-17codepaper

* denotes score on public set

ARC Prize 2026 Get started and receive official contest updates and news.

ARC Prize : Newsletter Subscribe to get started and receive official contest updates and news.

Subscribe No spam. You can unsubscribe at anytime.

OpenClaw leads official ARC-AGI-3 community leaderboard

Related Articles

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Elevated error rates on requests to multiple models

Donald Trump and sons to be 'forever' exempt from tax audits

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

Old Reddit Is Down