Hackernews
new
show
ask
jobs
Reward models for LMs are fundamentally broken
2 points
posted 9 hours ago
by panthertrax
(twitter.com)
No comments yet