PiRho3141
an hour ago
This is where open source models are important.
The latest deepseek v4 pro model is 2-5x cheaper than Claude Sonnet 4.6. Cursor's Compose 2.5 that was just recently released is 6x cheaper than Sonnet.
The state of the art models are going to get better and more expensive and smaller models are going to get cheaper.
There will be a point where the intelligence of both the cheap and state of the art models are indistinguishable by humans like it is indistinguishable for me to understand the difference the difference between Terrance Tao and my university math professor.
I don't always need the smartest and most expensive models. I will need it every once in awhile and will gladly pay that price if I had to. What I do need is the model that will solve the current problem I have in a reasonable amount of time.
clhodapp
an hour ago
I know it comes off as pedantic to point this out but: Those are open weight models not open source models.
Closed weight models are the equivalent of SaaS. Open weight models are the equivalent of binary driver blobs or Windows software. We don't really have actual open source LLMs, which would need to publicly release their training data and technique so you could train a similar model yourself, or use their work as a baseline for your own model.
This distinction matters because an actual open source LLM would be extremely important from an ecosystem point of view, if someone ever actually released one.
yogthos
3 minutes ago
There are absolutely fully open source models. These are not frontier models, but they very much do exist. OLMo is one of the models explicitly mentioned as having passed the OSI's validation phase. Pythia was also validated by the OSI as meeting its requirements for an open-source AI system. Lucie-7B is a multilingual model is one of the first LLM compliant with the OSI AI definition. Its creators explicitly state that the training dataset, data preparation code, and model weights are all publicly available under open licenses.
gruez
34 minutes ago
>The latest deepseek v4 pro model is 2-5x cheaper than Claude Sonnet 4.6. Cursor's Compose 2.5 that was just recently released is 6x cheaper than Sonnet.
It's ironic how in a thread about "AI subsidies" that people don't think free model releases from AI don't count as subsidies. Whatever AI winter that would cause AI companies to stop subsiding tokens, would probably cause other AI labs to stop doing free model releases. They might not be able to un-release the current crop of open models, but assuming proprietary model development still happens, they'll quickly go obsolete.
zozbot234
28 minutes ago
The currently-released models don't really go away. Even if they collectively only release a new model every few years for the sake of influence and public image, that's plenty enough to keep the competitive aspect going.
gruez
21 minutes ago
>Even if they collectively only release a new model every few years for the sake of influence and public image, that's plenty enough to keep the competitive aspect going.
This is unpersuasive. Why would AI companies (American or Chinese) stop subsidizing tokens, but keep doing open model releases? At least for the former you can argue it's a lead generation tool for enterprise contracts (eg. hobbyist uses claude code personal plan, then asks the company to buy claude code enterprise, which are billed at API rates), but what's the business case for doing open model releases? You might get some mindshare, but are also arming your competitors in the process. Moreover what makes you think the model releases will be at all competitive to frontier models? Google released gemma 4 a few weeks ago to acclaim, but it's in no way competitive to even GPT-5.4 or Opus 4.6.
greenmilk
an hour ago
> The state of the art models are going to get better and more expensive and smaller models are going to get cheaper.
Why do you think this will be true?
Right now I see the major US labs betting on gaining an advantage from having way more compute, and I see Chinese labs competing with one another in a resource-scarce environment, so they place much more emphasis on compute-efficiency.
But the supply chains that feed into the massive data center growth in the US are strained; there are energy, memory, and logistical bottlenecks to name a few.
In the medium-long run, compute capacity will not grow exponentially forever. Somehow it has for decades, but there can be no infinite exponential growth, and that point may be when the planet really starts to cook itself.
Maybe the US labs will become more compute-constrained, and then have to compete on efficiency.
Or maybe things change fundamentally in some other way I'm not thinking of.
nightski
an hour ago
The labs have a perverse incentive to make things as expensive compute wise as possible. The only thing keeping this somewhat in check is competition, but it's intentionally being gatekept by locking up the supply of computing infrastructure. With 3 players it's pretty easy to collude even if indirectly. They can't burn trillions forever. Nvidia's 75% profit margins are not sustainable forever.
Things will normalize, but it will take time.
gruez
31 minutes ago
>The labs have a perverse incentive to make things as expensive compute wise as possible. The only thing keeping this somewhat in check is competition, but it's intentionally being gatekept by locking up the supply of computing infrastructure. With 3 players it's pretty easy to collude even if indirectly.
By all accounts the AI capex boom is justified up by actual usage, rather than some nefarious plan for "locking up the supply of computing infrastructure". Just look at people complaining about claude availability and anthropic adding various load-shedding measures a few months ago.
gruez
33 minutes ago
>so they place much more emphasis on compute-efficiency.
Maybe on training, but on inference they use more tokens than comparable western models.
https://artificialanalysis.ai/?output-tokens=intelligence-vs...
squidbeak
an hour ago
Deepseek V4 Flash is far cheaper still, and a better model to compare to Sonnet 4.6. I'm finding it a reliable workhorse.
anonzzzies
an hour ago
Yep, people who never used it say it is not good.
sometimelurker
41 minutes ago
sorry to nitpick (I totally agree with what ur saying btw, I run Ministral-3b on my hardware as my go-to bc I don't usually need the "smartest and most expensive models")
> This is where open source models are important
open-weights, the training data isn't public
jplusequalt
34 minutes ago
>The latest deepseek v4 pro model is 2-5x cheaper than Claude Sonnet 4.6. Cursor's Compose 2.5 that was just recently released is 6x cheaper than Sonnet.
The only way you're running Deepseek V4 with comparable quality/performance is through OpenRouter, at which point you're still susceptible to being price gouged in the future, or by spending >$20k on hardware.