hackernews client

lucasluitjes

7 months ago

I've seen LLMs generate plenty of wildly insecure code, but the percentage of insecure solutions out of the solutions that are functional, is even higher than I expected.

Also, I'm curious how the average coder would fare on this benchmark.

BaxBench: Can LLMs Generate Secure and Correct Back Ends?

1 Comments

lucasluitjes