Supervised Fine Tuning on Curated Data Is Reinforcement Learning

3 pointsposted 7 months ago
by saijajin

1 Comments

user

7 months ago

[deleted]