BaxBench: Can LLMs Generate Secure and Correct Back Ends?

2 pointsposted 15 hours ago
by chillax

1 Comments

lucasluitjes

15 hours ago

I've seen LLMs generate plenty of wildly insecure code, but the percentage of insecure solutions out of the solutions that are functional, is even higher than I expected.

Also, I'm curious how the average coder would fare on this benchmark.