5555watch
5 minutes ago
My guess is that with the new UI they've either mistakenly or deliberately reduced the thinking effort or the budget, even if they write "extended thinking" in the app. And it seems that they're (mistakenly or deliberately) now routing some queries to the Flash model despite the selection of Pro/Thinking.
It was super bad right after launch, yet the AI Studio or API calls were on par to the earlier experience. Despite the selected "extended thinking", for the first time in a very long time the LLM outputed fully incorrect and a non-working code for a super simple problem (schema matching), which it can solve easily (and did through the API). So it's definitely not a coincidence.
I'm hoping that it's temporary, because otherwise we're back to GPT-5.0 levels of bad and it will kill the coding usage.