Hackernews
new
show
ask
jobs
JustRL: Scaling a 1.5B LLM with a Simple RL Recipe
1 points
posted 12 hours ago
by simonpure
(arxiv.org)
No comments yet