Footnotes of a Curious Mind 📝
Subscribe
Sign in
Share this post
Footnotes of a Curious Mind 📝
Gaming the System: Reward Hacking in Language‑Model Training
Copy link
Facebook
Email
Notes
More
Gaming the System: Reward Hacking in…
Darpan
May 13
Share this post
Footnotes of a Curious Mind 📝
Gaming the System: Reward Hacking in Language‑Model Training
Copy link
Facebook
Email
Notes
More
How clever shortcuts can derail our smartest algorithms—and what is being done about it
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
Gaming the System: Reward Hacking in…
Share this post
How clever shortcuts can derail our smartest algorithms—and what is being done about it