Hackernews
new
show
ask
jobs
PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost
1 points
posted 12 hours ago
by matt_d
(arxiv.org)
No comments yet