Hackernews
new
show
ask
jobs
Scaling Reinforcement Learning for Trillion-Scale Thinking Model
3 points
posted 18 hours ago
by mountainview
(arxiv.org)
No comments yet