Show HN: Datetime-bench: which datetime formats LLMs get right (and wrong)

2 pointsposted 11 hours ago
by diwank

1 Comments

ishita159

10 hours ago

surprised to see gemini > sonnet 4.6 > opus 4.6

why do you think sonnet is better than opus on this?