Hackernews
new
show
ask
jobs
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
4 points
posted 6 hours ago
by rntn
(nature.com)
No comments yet