hackernews client

Reinforcement Learning from Human Feedback (RLHF) in Notebooks

72 pointsposted 7 months ago

3 Comments

kcdom1000f

7 months ago

Hl

careful_ai

7 months ago

[dead]

bobvylan

7 months ago

[dead]