BenchPress · Predict LLM Scores
BenchPress
Predict any LLM's score on any benchmark.
Try it<br>Predict your own model
01 / 02
Pick a model. Pick a benchmark.
01<br>Choose a model
02<br>Choose a benchmark
Score
Select a model and benchmark above.
Leaderboard
On this benchmark.
Full (incl. predicted)<br>Observed only
Resources
Paper, code, and data.
Use the code to reproduce the paper, or download the score matrix behind the predictor.
Paper code<br>Dataset<br>arXiv
Contribute
Have more scores?
Report benchmark scores for a model. Include the model, benchmark, score, evaluation setting, effort, and source; we will review provenance before adding it to the matrix.
Report scores