Hackernews
new
show
ask
jobs
Reinforcement Learning from Human Feedback (RLHF) in Notebooks
70 points
posted 13 hours ago
by ash_at_hny
(github.com)
3 Comments
kcdom1000f
11 hours ago
Hl
careful_ai
8 hours ago
[dead]
bobvylan
7 hours ago
[dead]