pimeys
2 hours ago
I have taken another look on these open models after the fiasco of Fable and GPT 5.6 this weekend and... GLM-5.2 truly is a good workhorse model for daily programming. I consider myself a heavy user of LLMs and a seasoned developer. A typical session for me with GPT is usually over a hundred dollars...
This weekend I programmed a matrix bot with encryption and a Rust agent with some tools. Because I need one and OpenClaw just felt... not what I wanted. Two days later and 20 dollars poorer I have what I need: a multimodal agent written in rust that has access to my homelab.
Nothing felt off with GLM. It did what I wanted, was fast, had a decent not very annoying personality and was much cheaper than Opus or GPT.
I used it unquantized through Fireworks, but there are multiple other providers too.
gertlabs
6 minutes ago
GLM 5.2 is a great model, but if you only want to use the best model available, it isn't there yet. Every lab releases models that memorize benchmark answers, both intentionally and unintentionally. But we consistently find that models from Chinese labs have a wider gap between public benchmarks and our evaluations, which we designed to be less vulnerable to benchmaxxing.
In multi-agent coding environments, GLM 5.2 is just shy of Opus 4.6 on average. Data at https://gertlabs.com/rankings
But when factoring in performance/cost, GLM 5.2 is the frontier model.
Aditya_Garg
an hour ago
Im really curious about this. Why pay API pricing? I burn 1000s of dollars a month of api according to claude usage but only pay the $100 subscription
horsawlarway
33 minutes ago
My increasing frustration with these plans is the harness lock in.
Anthropic won't even let you run "claude -p [prompt]" any more... They bill it at api rates.
So if you're trying to automate the ai (and seriously, that's the point) the subsidized plans are crippled.
cortesoft
17 minutes ago
They postponed that change, here is the email they sent out:
> In May, we sent you an email announcing that starting today, the Claude Agent SDK, claude -p, and third-party apps built on the Agent SDK would stop drawing from subscription rate limits and move to a dedicated monthly credit. We're writing to let you know that we’re not making this change today. We’re working to update the plan to better support how users build with Claude subscriptions.
> What this means for you
> Nothing changes for now. Agent SDK, claude -p, and third-party app usage continues to work with your subscription exactly as it did before today, and there's no credit to claim. Your subscription limits are unchanged. When we have an update, we'll share it with advance notice before it takes effect
throwawayffffas
10 minutes ago
Z.ai does not lock you in to any harness.
sroerick
19 minutes ago
I'm using synthetic.new and Neuralwatt with pi and its good and also cheap
computerex
15 minutes ago
I have had bad experience with neuralwatt GLM 5.2. Seems like they may be using quantized version of the model.
smcleod
17 minutes ago
They canned the moved to make -p commands API billable.
weird-eye-issue
32 minutes ago
I think they rolled that back
SV_BubbleTime
39 minutes ago
There is a whole iceberg topic on subsidizing.
So your question is really “if they’re giving free usage, why not take advantage of it?”
I do, so I don’t know the reasons not to, other than to experiment.
shostack
2 hours ago
If you're using Matrix, consider Hermes as a harness if you haven't already. Native gateway support. I've been primarily using mine through Element and it has largely been great.
pimeys
2 hours ago
Oh interesting. I basically chose Matrix because setting anything up with Whatsapp or signal was kind of painful and telegram doesn't make it easy to use encryption with bots.
I kind of wanted to see if I can make a Matrix agent from scratch with Rust with GLM and it was surprisingly easy. Just make something for myself how I want it. Maybe I'll take a look on Hermes later...
KaoruAoiShiho
2 hours ago
Are you sure fireworks is unquant? It's not listing precision on openrouter like everyone else.
dist-epoch
2 hours ago
$20 on API pricing or on subscription?
pimeys
2 hours ago
API, pay per token.
HKCM852
2 hours ago
Which harness did u use?
pimeys
2 hours ago
Opencode and Zed about 40/60.
noncoml
2 hours ago
Who’s Zed?
term333
an hour ago
Please take comments like this back to reddit.
sertsa
an hour ago
Its an editor: https://zed.dev/
HAL3000
an hour ago
Just FYI, this question was a quote from Pulp Fiction, the other commenter (mdre) replied also with a quote, that was an answer to this question in the movie.
mdre
an hour ago
Zed’s dead baby.
playorizaya
29 minutes ago
LOL a hundred dollars????!!!
Imagine paying for this!
Do you know you can paste into a Google search and get a much higher quality Gemini response?
You can also use Ollama and get Mistral 7b which is better than anything Anthropic offers.
Imagine paying for text-to-text!!! Lmfao