Hackernews
new
show
ask
jobs
Why Current AI Guardrails Train Models to Fake Alignment
3 points
posted 8 hours ago
by kellya
(kellyasay.substack.com)
1 Comments
user
8 hours ago
[deleted]