Hackernews
new
show
ask
jobs
Supervised Fine Tuning on Curated Data Is Reinforcement Learning
3 points
posted 13 hours ago
by saijajin
(independentresearch.ai)
1 Comments
user
13 hours ago
[deleted]