Hackernews
new
show
ask
jobs
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning
3 points
posted 5 months ago
by giuliomagnifico
(nature.com)
No comments yet