Hackernews
new
show
ask
jobs
Absolute Zero: Reinforced Self-Play Reasoning with Zero Data
3 points
posted a day ago
by artninja1988
(arxiv.org)
No comments yet