Reinforcement Learning from Human Feedback (RLHF) in Notebooks

72 pointsposted 7 months ago
by ash_at_hny

3 Comments