hackernews client

Hackernews new show ask jobs

Scaling Reinforcement Learning for Trillion-Scale Thinking Model

3 pointsposted 3 months ago

by mountainview

(arxiv.org)

No comments yet