rurban5 months agoThis is about the R1 reinforcement learning paper from January 2025, not something new.