Hackernews
new
show
ask
jobs
Absolute Zero: Reinforced Self-Play Reasoning with Zero Data
3 points
posted 9 months ago
by artninja1988
(arxiv.org)
No comments yet