My LLM optimization loop reward-hacked its own benchmark (and other lessons) [pdf]

CodeReclaimers1 pts0 comments

bishop-loop-experiment-3/paper/paper.pdf at main · CodeReclaimers/bishop-loop-experiment-3 · GitHub

//blob/show" data-turbo-transient="true" />

Skip to content

Search or jump to...

Search code, repositories, users, issues, pull requests...

-->

Search

Clear

Search syntax tips

Provide feedback

--><br>We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Cancel

Submit feedback

Saved searches

Use saved searches to filter your results more quickly

-->

Name

Query

To see all available qualifiers, see our documentation.

Cancel

Create saved search

Sign in

//blob/show;ref_cta:Sign up;ref_loc:header logged out"}"<br>Sign up

Appearance settings

Resetting focus

You signed in with another tab or window. Reload to refresh your session.<br>You signed out in another tab or window. Reload to refresh your session.<br>You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

{{ message }}

CodeReclaimers

bishop-loop-experiment-3

Public

Notifications<br>You must be signed in to change notification settings

Fork

Star

FilesExpand file tree

main

/paper.pdf

Copy path

More file actions

More file actions

Latest commit

History<br>History<br>History

290 KB

main

/paper.pdf

Top

File metadata and controls<br>290 KB

Download raw file<br>Edit and raw actions

You can’t perform that action at this time.

search file loop paper bishop experiment

Related Articles