BaxBench: Can LLMs Generate Secure and Correct Back Ends?

2 pointsposted 7 months ago
by chillax

1 Comments

lucasluitjes

7 months ago

I've seen LLMs generate plenty of wildly insecure code, but the percentage of insecure solutions out of the solutions that are functional, is even higher than I expected.

Also, I'm curious how the average coder would fare on this benchmark.