How fast is autonomous AI cyber capability advancing?

3 pointsposted 6 hours ago
by dcre

1 Comments

dcre

6 hours ago

A new Mythos checkpoint improves significantly on the previous one (and beats GPT-5.5-Cyber) on this benchmark.