Hackernews
new
show
ask
jobs
Scaling pretraining affects RL sample efficiency
1 points
posted a day ago
by ag8
(runrl.com)
No comments yet