AMS – Detect unsafe LLMs in 30 seconds via activation analysis

1 pointsposted 10 hours ago
by gmessenger

1 Comments