Reinforcement Learning from Human Feedback

48 pointsposted 4 hours ago
by onurkanbkrc

3 Comments

klelatti

3 hours ago

Web version with links, etc:

https://rlhfbook.com/

verdverm

2 hours ago

Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

leggerss

8 minutes ago

You could say he's also learning from human feedback