Hackernews
new
show
ask
jobs
Reinforcement Learning from Human Feedback (RLHF) in Notebooks
72 points
posted 7 months ago
by ash_at_hny
(github.com)
3 Comments
kcdom1000f
7 months ago
Hl
careful_ai
7 months ago
[dead]
bobvylan
7 months ago
[dead]