We Are Still Unable to Secure LLMs from Malicious Inputs

3 pointsposted 2 days ago
by zdw

1 Comments

simonw

2 days ago

Bruce Schneier:"We simply don’t know to defend against these attacks. We have zero agentic AI systems that are secure against these attacks. Any AI that is working in an adversarial environment—and by this I mean that it may encounter untrusted training data or input—is vulnerable to prompt injection. It’s an existential problem that, near as I can tell, most people developing these technologies are just pretending isn’t there."