Hackernews
new
show
ask
jobs
JustRL: Scaling a 1.5B LLM with a Simple RL Recipe
1 points
posted 2 months ago
by simonpure
(arxiv.org)
No comments yet