Reward Hacking in Reinforcement Learning | Lil’Log

lilianweng.github.io/posts/202…

Taiju Muto @tai2