hackernews client

AI Cybersecurity After Mythos: The Jagged Frontier

10 pointsposted 8 hours ago

3 Comments

tao_oat

12 minutes ago

> Our tests gave models the vulnerable function directly, often with contextual hints (e.g., "consider wraparound behavior").

"Often with contextual hints" is doing some heavy lifting here, IMO. I agree with the article's premise -- you don't need Mythos to use AI to find novel, complex vulnerabilities -- but these results as presented are somewhat misleading.

1970-01-01

40 minutes ago

I'm awaiting general release so I can root and jailbreak some old Android phones. If it succeeds, I'm a fan. If it fails, then it's obviously not a leap, it's another step.

baq

8 hours ago

> TL;DR: We tested Anthropic Mythos's showcase vulnerabilities on small, cheap, open-weights models. They recovered much of the same analysis. AI cybersecurity capability is very jagged: it doesn't scale smoothly with model size, and the moat is the system into which deep security expertise is built, not the model itself. Mythos validates the approach but it does not settle it yet.

Notably, Kimi K2 and GPT-OSS-120b do quite well when provided with the isolated context. Article seems to be heavily LLM-assisted, but the content itself is good.