Hackernews
new
show
ask
jobs
Reinforcement learning towards broadly and persistently beneficial models
2 points
posted 9 hours ago
by spicypete
(alignment.openai.com)
No comments yet