Reward models for LMs are fundamentally broken

2 pointsposted 9 hours ago
by panthertrax

No comments yet