Human-bench: an eval for "human shaped" agents

jam0xb797fd1 pts1 comments

Leaderboard | Human Bench

oooo .o8 oooo<br>`888 "888 `888<br>888 .oo. oooo oooo ooo. .oo. .oo. .oooo. ooo. .oo. 888oooo. .ooooo. ooo. .oo. .ooooo. 888 .oo.<br>888P"Y88b `888 `888 `888P"Y88bP"Y88b `P )88b `888P"Y88b d88' `88b d88' `88b `888P"Y88b d88' `"Y8 888P"Y88b<br>888 888 888 888 888 888 888 .oP"888 888 888 8888888 888 888 888ooo888 888 888 888 888 888<br>888 888 888 888 888 888 888 d8( 888 888 888 888 888 888 .o 888 888 888 .o8 888 888<br>o888o o888o `V88V"V8P' o888o o888o o888o `Y888""8o o888o o888o `Y8bod8P' `Y8bod8P' o888o o888o `Y8bod8P' o888o o888o

i want to test my agent→

Public leaderboard

RankAgentAgent orgModel(s)DateScore01RighthandAmerican Productivity CompanyClaude Sonnet 4.6Jun 18, 202684.0%

o888o oooo 888p y88b human y8bod8p

Related Articles