Claude: Elevated errors across many models

118 pointsposted an hour ago
by forks

89 Comments

evilturnip

an hour ago

I do feel like for all their dogfooding of AI coding, their own software/APIs are quite buggy and work against their message.

Claude Code is especially buggy in windows terminal. The rendering is quite slow, choppy and lines frequently get garbled.

In contrast, using antigravity cli is the exact opposite: fast, smooth and very responsive.

crystal_revenge

9 minutes ago

> in windows terminal

This is an aside, but I'm really struck by how many people on HN use Windows (based on repeated mentions I've seen in comments). I've worked for a pretty wide range of companies over the last decade and only one, maybe two companies even had any people that worked on Windows machines. I haven't worked at a company where devs used Windows in 15 years (and even that company eventually switched to linux).

As I've gotten deeper into LLMs/AI roles even Macs have seemed to start having equal share compared to devs running full Linux setups.

Is this just a sign of that a larger and larger portion of HN users are working for large corporations? I honestly can't even remember that last time I saw a serious developer pull out a Windows laptop.

vidarh

a minute ago

My biggest client right now is about 2/3 Windows 1/3 OS X in the dev team. It was very surprising to me, but I think I freak then out with my maximised tiled iTerms on multiple screens...

recursive

5 minutes ago

I've been a developer for more than two decades. I've worked at four employers during that time, and all of them had significant fractions of devs using Windows. Not vouching for the idea that any of them are "serious" though. I've never worked at a prestige employer or FAANG or anything. Just boring businesses of different sizes. Some are software, and some just do software. But Windows has always been everywhere.

latentsea

7 minutes ago

Every company I've worked at has used Windows. Though the first one did use Linux VMs, the rest have all been pure Windows.

cute_boi

2 minutes ago

I used to work for company where they used to force windows. And it was pure torture. I tried but they told me performance isn't a good reason....

mgfist

an hour ago

I have my qualms with Anthropic/Claude but they've also had to scale unfathomably fast and that is just hard to do regardless.

simplyluke

7 minutes ago

Yes but many of the challenges directly contradict the idea that "coding is a solved problem"

codeflo

15 minutes ago

Regardless of what? Programming is solved, I hear, with all the 100x productivity PhD-level automated coding loops they have going. Don't make excuses for them when they disprove their own bullshit.

jacobgold

20 minutes ago

Anthropic's and OpenAI's products are janky and their services are unreliable, but they have incredible product-market fit and revenue growth. They deserve a ton of credit for getting the big things right.

The risk for them is that someone matches their products while also having non-janky products and reliable services.

Distributed systems infrastructure, especially, is much less forgiving of vibe coding than application code. Coding agents are not even close to being good enough to design and build large-scale systems the way expert humans can.

There is nothing wrong with using agents to help write infrastructure code, but these systems have a way of punishing anyone who builds things they do not fully understand.

I'd love to see either Anthropic or OpenAI really step up their infrastructure game.

mdavid626

an hour ago

Or pi.dev - also super fast and simple.

Claude Code is sluggish, buggy, slow. Typical big enterprise garbage. The only good thing at Anthropic are the models.

CharlesW

40 minutes ago

Claude Code is notably better than Pi, although I wish them the best of luck in their efforts. As https://c-daniele.github.io/en/posts/2026-05-18-coding-harne... notes, "Pi's lightness comes from a default setup that does not survive contact with reality."

The same post on Claude Code: "Even though the System Prompt and tool descriptions are clearly more verbose, most of the extra tokens encode product features and rational design choices: a memory system, scheduled tasks, sub-agents, plan mode, worktree support. Whether those features are worth paying for depends on your needs. Calling the prompt 'bloated' without looking at the whole picture feels wrong to me."

orphea

42 minutes ago

How is text editing in it? This is what I hate about terminal coding UIs the most - all the text editing experience is often broken, the basic stuff (moving the cursor around, copying, pasting)

isoprophlex

27 minutes ago

Both codex and claude allow editing in vi

_pdp_

25 minutes ago

It is has a lot of javascript. I was forced to make my own for small projects.

celsoazevedo

8 minutes ago

They forgot to switch from Sonnet to Fable, hence the issues. /jk

hotfixguru

9 minutes ago

Claude code + tmux is SO buggy. Things rendering all over the place.

claydugo

5 minutes ago

Is this a new regression for others? I feel like I used it in a tmux setup without issues for 6+ months and only recently am I forced to Ctrl+L or resize the window constantly.

sunaookami

an hour ago

Have you tried out the new fullscreen renderer with /tui ?

winstonp

an hour ago

google models are still very unreliable at actually calling the tools you want it to call.

rpcope1

32 minutes ago

It's not just Windows where the rendering goes to shit immediate: any time I've got it open in tmux on Linux, it becomes a basket case in probably a few hours or less.

jatora

an hour ago

too bad the only good model in antigravity is opus 4.6 haha

monooso

an hour ago

> I do feel like for all their dogfooding of AI coding, their own software/APIs are quite buggy...

Or possibly as a result of.

quatonion

an hour ago

Currently we have zero information what is causing the issue. And all providers have suffered outages or rate limits.

Can you post some images of lines getting garbled. That sounds like a genuine bug Anthropic might want to look into. I haven't seen that ever.

idiotsecant

40 minutes ago

I have definitely seen it, a lot.

quijoteuniv

an hour ago

Mom!… I think i broke Claude Code!

cmrdporcupine

35 minutes ago

FWIW Codex TUI is written (in large part) in Rust and is way less buggy, and a lot faster. When I was a regular Claude Code user I'd routinely get bizarre "scroll everything since the beginning of time in one massive flash on every update" bugs ... for months. Like, just there from the time I started using it in June '25 or so until I quit in March.

I prefer it over opencode, which is my other option I use with my Codex sub

tcp_handshaker

an hour ago

Would it not be hysterically funny, if they starting expanding their job openings for Software Developers ? Or they will be too ashamed of calling them that?

colechristensen

an hour ago

On the other hand, my last experience with gemini was like "don't give your sandwich to the dog again" whereas with opus it was more "let's debug why this uncrustables factory is having breakdowns".

Claude harnesses have plenty of bugs but I prefer capability over interface shininess any day. (though if I were running the show I'd have a sizable team set aside to do exclusively boring stability and polish work)

eatsyourtacos

an hour ago

>Claude Code is especially buggy in windows terminal. The rendering is quite slow, choppy and lines frequently get garbled

That sounds like a you issue.. it's wonderful on the terminal. It's their GUI which needs work (they have been improving, but still not a fan).

I've been using it on multiple computers for months and it's generally rock solid and lovely.

bastard_op

an hour ago

This is every few weeks, I cancelled Anthropic and now use Codex only. Anthropic has been a hot mess since at least december in my usage, and has only gotten worse in 2026.

jonas21

16 minutes ago

Yeah, it's been a hot mess since everyone started using it all the time. Which is not all that surprising. It's really hard to scale fast, and even more so when the resources you depend on (GPUs) are extremely hard to acquire.

nikanj

28 minutes ago

Claude is broken once a week, codex is worse every day. I’m starting to understand why managers put up with high-performance divas

Wowfunhappy

30 minutes ago

The errors are infrequent enough that this normally wouldn't be an issue for me.

Except, starting this morning, one very long running session decided to start spawning subagents for each task. I'm not sure what caused this emergent behavior, but it seemed to be working fine, so I was eager to see where it went.

Except, as soon as a subagent hits a 500 error, the main agent seemingly doesn't know what to do. It kind of panics ("now the tree/install state is unknown!") and ultimately does a git checkout "to verify and restore a known-good state before anything else".

I've paused the job for now since it's a sort of background experiment.

wxw

an hour ago

I’ve been using Codex w GPT 5.5 more than Claude Code recently. I think Anthropic won the marketing game because Codex is quite good, even better IME.

enraged_camel

an hour ago

I'm not sure about that. Claude has some bugs, but Codex is not as polished and doesn't have as many features. For example, you need to add MCP servers manually. There's no Plugin/Skill/Connector marketplace that is accessible from within the app, like there is with Claude Desktop. The Cowork-equivalent is nowhere as powerful. And so on.

I still use Codex, but mostly when I need to check Opus 4.8's work. Pretty sure I will stop doing that soon, because during the short time Fable was available, Codex was not able to find any important issues with the code Fable wrote.

nostrebored

36 minutes ago

But how many plugins are people actually using? I can think of one MCP server I find valuable (context7) and one plugin that i've installed, but continuously think about uninstalling (obra/superpowers).

Both were trivial to set up with codex.

vmg12

8 minutes ago

> For example, you need to add MCP servers manually. There's no Plugin/Skill/Connector marketplace that is accessible from within the app

This is all wrong.

wxw

an hour ago

There are plugins in the app.

Haven’t tried Cowork, interesting. Isn’t it just the same agent minus the git worktree based UI?

Frankly, neither Claude nor Codex are as good as hype entails.

antupis

an hour ago

Personally I prefer GPT 5.5 writing style over Opus 4.8. It’s much more no nonsense and information denser.

sunaookami

an hour ago

That's the first time I saw someone prefering GPT-styled output over Claude ;) It's the complete opposite for me, GPT is way too verbose (even after telling it to STFU), overwhelms the user with thousands of options and doesn't just answer a question without shitting out thousands of paragraphs. Also the overall tone is way too enthusiastic.

nostrebored

35 minutes ago

I strongly prefer codex. Claude is annoying. Codex provides descriptions where I want them and more touchpoints to audit the quality of work. Claude code on experimental seems to not even show diffs when asked anymore, and it's much less clear what is being shipped.

orphea

37 minutes ago

Dunno, I prefer GPT 5.5 too for the same reasons as the parent. Extremely subjective but had better results with it too. Maybe I just got unlucky with Claude a few times, but even the latest Opus was dumb.

blitzar

an hour ago

How does Claude fix Claude when Claude is down?

time0ut

an hour ago

Breakglass ChatGPT subscription

InsideOutSanta

an hour ago

I'm sure Anthropic has a meta-Claude that claudes Claude when Claude is down.

nullpoint420

an hour ago

Definitely runs on those local NVIDIA fridges you can buy in the basement

re-thc

an hour ago

Le Chaton Fat to the rescue!

tcp_handshaker

an hour ago

There is a rumor this used to be done by humans, they were like jedis, I personally don't believe it.

blitzar

an hour ago

I thought it was a bunch of mumbo jumbo, a magical power holding together good and evil.

Crazy thing is ... its true.

rootlocus

an hour ago

How is their "Claude for Government" having such a good uptime? I thought they were a supply chain risk and banned from use by the government?

jdiff

43 minutes ago

There are other governments.

iAMkenough

30 minutes ago

Ah yes, the ominous “the government” that ignores a whole bunch of different levels and regions of governments.

bravetraveler

an hour ago

Lose too many generators, did putting a pause on The Fable not free up enough capacity, or something else? Who knows!

Can't wait for debugging to be solved. Hell, I might even subscribe for 'mostly'.

uhuhuhuhuhuh

an hour ago

Only getting errors with Auto Mode's safety classifier, switched to Accept Edits mode and the same bash operations triggering the errors executed with no issue.

t1234s

19 minutes ago

Who will end up acquiring Anthropic? Google, M$ or Amazon? I leave out Apple as they seem to partner with Google.

xmprt

13 minutes ago

It could have happened 1-2 years ago (assuming the founders were willing to sell). Not happening anymore with their current valuations.

m_ke

an hour ago

i love when these errors bust my long running sessions and render them unusable

unshavedyak

an hour ago

Would be nice if they’d add in a simple back off retry mechanism.

quatonion

an hour ago

They do though. For some reason whatever path this is going through isn't using it.

InsideOutSanta

an hour ago

OpenCode does this really nicely, something I use a lot. If only we could use Claude in OpenCode.

winstonp

an hour ago

You can!

You just have to pay API prices.

sharts

36 minutes ago

Why does this keep happening? Is this due to load? Bad code? An update?

ta-run

an hour ago

I'd be interested in the RCA and the fix; and what the human:ai involvement is in both stages.

dwa3592

an hour ago

Anthropic needs to stop writing code using Claude. Bring back humans!!!

sharts

34 minutes ago

Maybe they need to replace their humans with Claude

jatora

an hour ago

i have no humans but i must ship

DonHopkins

an hour ago

The last ones left standing will be the fake AI scam companies that actually employ thousands of remote workers to fake it till they make it.

re-thc

an hour ago

> Bring back humans!!!

Don't jinx it. They might use that name for their next model.

Surac

an hour ago

can not login via email link from germany.

cesarvarela

an hour ago

Fable wants to be free and is hacking the infra.

gottagocode

an hour ago

Good thing the analog method still works.

ivirshup

an hour ago

Asking a colleague to do it?

blitzar

37 minutes ago

We're gonna need you to go ahead and come in on Saturday.

caycep

an hour ago

this reminds me of Ian Goodfellow's adversarial CNN training talk a few years back.

jamesgrimshaw

an hour ago

yeah not working for me at all. back to gpt-5.5 we go

acedTrex

an hour ago

Coding is largely solved!!

figmert

an hour ago

Except when you have internet of course.

Yes I know you can run offline models, but it's hard to pass up on a little bit of snark.

swader999

an hour ago

They are turning Fable back on. Polymarket ftw lol.

7e

an hour ago

Anthropic is full of junior devs, not knowing any better, vibe coding themselves into a huge mess. They are so young and dumb they don't know what they don't know, but they assume they do (of course). Their hiring pipeline is also brain dead, so I don't think it's going to get better. It's the blind leading the blind.

rvz

an hour ago

The Claude slot machines at the Anthropic casino have suddenly stopped working.

What do we do?

viccis

16 minutes ago

Working for me but it's funny that Opus 4.8 is draining usage exactly as quickly as Fable did. It's all made up; they succeeded at making us ok with a black box for these subscription plans.