JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

1 pointsposted 12 hours ago
by simonpure

No comments yet