Kav
News
Tags
EN
ID
Tag: Reinforcement Learning
Fix the inference engine before you patch the RL objective
(huggingface.co)
Engineering
·
1 week ago
· May 6, 2026
Reward Hacking: Why Better Models Game You More
(lilianweng.github.io)
AI
·
1 year ago
· November 28, 2024
← all tags