Hackernews
new
show
ask
jobs
Reinforcement learning towards broadly and persistently beneficial models
1 points
posted 12 hours ago
by jawiggins
(alignment.openai.com)
No comments yet