Hackernews
new
show
ask
jobs
Natural Emergent Misalignment from Reward Hacking in Production RL [pdf]
1 points
posted 2 months ago
by samlinnfer
(assets.anthropic.com)
No comments yet