Hackernews
new
show
ask
jobs
Scaling pretraining affects RL sample efficiency
1 points
posted 3 months ago
by ag8
(runrl.com)
No comments yet