GitHub Incident

106 pointsposted 6 hours ago
by aggrrrh

82 Comments

bakje

5 hours ago

Perhaps the gemini-cli bot arguing with itself is taking its toll

https://github.com/google-gemini/gemini-cli/issues/16750

lol768

5 hours ago

Jeez, what a mess. Some of those issues have over 5000 events on them.

I really hope that didn't send emails out to people.

pdimitar

5 hours ago

I could not resist to put my sarcastic comment about RAM price increases serving a good cause in there.

dgxyz

5 hours ago

Having just had to buy 4TB of RAM, I appreciate this.

MisterTea

5 hours ago

That's like 100,000 USD. I keep thinking about making a rap video wearing a 10 TB gold chain surrounded by big booty girls with their naughty bits covered in m.2 SSD's while dissing the AI industry. Though I cant afford the RAM :-/

TheJoeMan

4 hours ago

It’s sad that I can’t interpret if you mean to actually shoot your rap video on film, or have an AI generate it lol. Either way you’re going to need RAM.

kps

2 hours ago

Shooting on film doesn't need any RAM. Unfortunately the price of silver is also through the roof.

zxcvasd

5 hours ago

like most rap videos do with cars/jets/mansions, just rent the ram sticks for a few hours!

dgxyz

5 hours ago

Yep that much. 64Gb DDR5 ECC sticks (128Gb don't exist at the moment apparently). They declined the PO 6 months ago. That'll teach 'em.

I was pissed that there weren't any sticks heading to the recycling out of the nodes otherwise I would make myself that chain :)

embedding-shape

5 hours ago

Haha, reminds me off bringing down office mail servers by accidentally creating loops of emails back in the day... What is old is new again, but this time with probabilities :)

nullfish

6 hours ago

I suspect the migration to Azure is continuing to go well

ascendantlogic

5 hours ago

This feels more like Copilot-as-platform-engineer to me

DeepYogurt

4 hours ago

Github's been running on vibe code for a while now and it's starting to show

rvz

6 hours ago

Yes indeed. 6 years of non-stop outages across the platform every month.

Even self-hosting would have been more stable than sitting on GitHub as predicted more than half a decade ago. [0]

Now there is no 'CEO of GitHub' to contact this time (Satya does not care).

[0] https://news.ycombinator.com/item?id=22867803

someguyiguess

6 hours ago

I did not come to hacker news expecting comedy gold but you have done it my friend!

corvad

5 hours ago

Github's recent reliability has honestly been abysmal. Not surprised.

ferguess_k

5 hours ago

Unless some major customers are moving away, I don't think they are going to seriously care about it.

corvad

5 hours ago

I suspect some companies may already be considering it. Especially with the wealth of alternatives today.

g947o

5 hours ago

Companies are already using on-premise GitHub server, if they are using GitHub in the first place. There are many other self hosted solutions which are quite common in enterprise environment.

supriyo-biswas

5 hours ago

In my experience companies are moving into GitHub for Copilot and GHA.

appplication

4 hours ago

GHA maybe, but copilot is just another mid tier player in a congested space.

NewJazz

3 hours ago

Doesn't stop folks from wanting to buy the MS brand. Execs are really out of touch these days.

nine_k

5 hours ago

What kind of alternatives do you see as viable for large(ish) commercial users?

toephu2

5 hours ago

GitHub on-prem. Officially called GitHub Enterprise Server. You can have GitHub, but hosted on your own servers.

NewJazz

5 hours ago

So you still pay them, you do the hosting work, and you get a product with worse features than gitlab?

zxcvasd

4 hours ago

but you can be smug when theres a github incident, and thats hard to put a price on

NewJazz

3 hours ago

You can do that with gitlab.

pxc

4 hours ago

What is the quality-first, high uptime alternative to GitHub? My employer uses both GitHub and GitLab, and while I think GitLab is better, its quality also frankly sucks. It's riddled with bugs that have just been marinating on the issue tracker for years, and the most common "fix" for gnarly bugs in the CI platform is "revise the documentation to reflect the existing (broken) behavior".

ferguess_k

an hour ago

*Stupid question*: What is so hard about self hosting one's own repo? I get it must be difficult for a mega corporation, but for companies like us, who have hundreds of repos but only 20 of them are regularly used, and concurrent read/write is relatively light -- considering our largest team is less than 20 persons, so even if all of them are reading/writing from the repo, it doesn't seem to be a huge issue.

Even for a bigger company, say 5x developers (we have about 100+ SWEs and maybe 10-20 other titles who use GitHub), is it really a big thing to self host their own repos? External applications are definitely on another level because you could have hundreds of concurrent visits easily.

What did I miss?

pxc

9 minutes ago

[delayed]

stefan_

3 hours ago

It's amazing, before we even had ChatGPT, GitLab was building so much endless slop halfbaked crap in their pursuit of ever more "enterprise checkboxes". Now they have slowed right down, no doubt collapsing under the escalating maintenance weight of all the nonsense that was created, like the canaries in the vibe coding mines telling us of impending doom.

Now you go to their blog, theres a banner at the top announcing "GitLab Agentic AI whatever is GA (GENERAL AVAILABILITY)" and you try to click it its literally a fucking 404 not found. That's the level of their stability and quality. Try it for yourself:

https://about.gitlab.com/blog/

fhd2

2 hours ago

Maybe it's GU already.

jbverschoor

5 hours ago

Good thing git is a distributed system

dgxyz

5 hours ago

Virtually no one knows how to do anything with it outside of github.

nine_k

5 hours ago

Your favorite search engine or LLM will show you in a second, it's really easy.

The problem is that it's not enough. The fact that Github uses Git specifically is a technical detail; it could use mercurial equally easily, as Bitbucket used to. Github Actions, OWNERS files, PRs and review tools, issue tracker, wiki are all not Git features.

dgxyz

5 hours ago

Not a chance. I think you need to spend some time in low ball corporate IT. It's just monkeys throwing faeces at the wall. We only just levered them off subversion...

(I use Fossil 100% offline for personal projects for ref)

TZubiri

4 hours ago

You might be surprised, but that's not true at all.

I once read someone commenting "Nobody writes code by hand without looking syntax up".

Man, you are just outing yourself as a complete beginner, the field is way deeper than you imagine and it's not even close.

dgxyz

4 hours ago

Not really. I've been around a while. Git for about 15 years. Subversion before that. Perforce before that. rcs before that (back down to sun3 machines). Mostly Fossil now for personal things.

What I am saying is that people learn as much as they need to. They generally don't need to know any more git than is required to interact with github. If anything problematic comes up, they go in with a wrecking ball because they don't truly understand what they are doing. And git has a lot of wrecking balls available.

If you threw them at raw git and asked them to collaborate with someone they'd be up shit creek. They have no idea how SSH or email works for example.

Joe_Cool

5 hours ago

That's a them problem.

tonymet

5 hours ago

i still find insightful ways to use git every day. amazing tool. it's a shame for those who only see it as "how to sync my repo with my coworkers"

nine_k

5 hours ago

Git is!

PRs and code review are not. CI/CD is not.

I mean, there are solutions, but none of them seems to have a large enough mindshare and efficiency. (Even though Github's code review tools are pretty spartan.)

globular-toast

4 hours ago

> PRs and code review are not. CI/CD is not.

They can be. A PR can be made and code review conducted by submitting a patch to a mailing list. That's how the kernel and, I think, git itself is developed.

CI/CD is really a methodology. It just means integrating/deploying stuff as soon as its ready. So you just need maintainers to be able to run the test suite and deploy, which seems like a really basic thing.

TZubiri

4 hours ago

True, workers can still commit to their local git.

I've been looking into having a separate git server that we can commit to and add plain ole git hooks to, and just having it be synced with github as a clone.

sirmoveon

3 hours ago

Check out Gitea. Its kind of a clone of github but you can self host.

tapoxi

6 hours ago

helm repo add gitlab https://charts.gitlab.io/ && helm upgrade --install gitlab gitlab/gitlab

I did this in 2019, it avoided so many headaches. CI is better too since there's a nice clean mapping of build -> pod for everything and I can just exec in if something's borked.

odie5533

5 hours ago

Things would have to get really bad before I considered managing my own repositories. Trading someone else's headaches for my own.

tapoxi

5 hours ago

It's not as bad as you think, I run the helm upgrade when patches come out, the backing store is S3 or managed SQL, it runs a nightly k8s cron called gitlab-backup which tarballs the whole thing into an s3 bucket with a single command restore should disaster strike. (This is part of the product, not a thing I wrote.)

I probably only babysit it for 30 minutes per year, including all the upgrades.

nine_k

5 hours ago

It depends how high you value your headaches, and how high, your org's downtime. Github not working accrues over the hourly rate of every developer affected, which is likely $70-$100 a hour. 10 hours of outage in a year affecting a team of 10 would cost north of $70k, enough to hire a part-time SRE dedicated just to tend to your Gitlab installation.

zxcvasd

4 hours ago

>10 hours of outage in a year affecting a team of 10 would cost north of $70k

10 hours x 10 developers x $70 per hour = $7000, not $70000.

nine_k

2 hours ago

Thank you for the correction! This indeed completely changes the picture :-\

0xbadcafebee

5 hours ago

^ this. the last thing i want is to add to my workload. take my money and make my life easier, even if it means that for one hour every couple months i can't do anything

NewJazz

5 hours ago

Have you ever actually hosted gitlab?

howToTestFE

5 hours ago

If GH has an issue, it seems to always be around 4pm or 5pm GMT. I'm starting to think that i should avoid any planned production releases around this time.

nottimbo

6 hours ago

Microsoft, it's time to hire some SREs.

arm32

6 hours ago

We did hire some, boss! Soshie, Vizzy and Dexter. They're AI, but they're supposed to be way better than a human SRE. At least that's what the Sintra salesguy told us.

rvz

6 hours ago

So that's what the Tay, and Zoe AI bots were doing all this time after they were cancelled and banned off of Twitter.

Working on the GitHub Azure migration and for years it's gone so well so far.

VirusNewbie

5 hours ago

Microsoft doesn't pay well enough to attract good SRE talent.

lenerdenator

5 hours ago

Why hire anyone to fix a problem when you can make an AI agent to "fix" it, tell investors about it to pump the price, and not fix anything knowing that you have a monopoly?

ferguess_k

5 hours ago

Yes we did hire SREs, unfortunately they are in another continent and they only know how to pull others into the chat. We also have some AI too, do you want to try them? They are pretty good SREs, one of them wrote 100K lines of code in a week while another one reviews every line along the way. It was fantastic! Fantastic!! FANTASTIC!!!

OK I have no idea about MSFT SREs, just to be /s.

andrewinardeer

4 hours ago

Days since last GitHub incident: 0.2

imglorp

3 hours ago

14 incidents this month. So far.

postexitus

6 hours ago

I believe it is an Azure outage or some type of MS service - everything on Azure is down.

zxcvasd

5 hours ago

having no issues on azure here, seeing no azure incidents on the status page or any of my admin panels

deathanatos

5 hours ago

> seeing no azure incidents on the status page

… in all seriousness, that is hardly proof that Azure isn't having an outage.

zxcvasd

5 hours ago

if i thought it alone was proof enough, i wouldnt have also included the bit about how i was actively using azure.

its one signal, among others. and in any case, i wasn't trying to prove the parent commenter wrong. i was offering my own signal to the crowd.

verst

5 hours ago

I second this. Not experiencing any Azure issues at this time.

ctxc

5 hours ago

My az services seem to be up.

toephu2

5 hours ago

This is why companies should host their own source code on-prem.

phtrivier

5 hours ago

Fixed in about 30m to an hour.

Definitely annoying, but I'll try the hot take that, contrary to popular belief, GH is not critical infrastructure - or so I hope.

Please tell me no part of the Ukrainian air defense system depends on a gh action hook.

eddd-ddde

5 hours ago

You've heard of infrastructure as code, now presenting air strikes as code!

Need a new secret offensive operation? Create a new JSON file with the coordinates, make a merge request and get Commander approval, merge it, and our new proprietary GitHub action runner will deploy a drone in seconds!

philipallstar

5 hours ago

This is far too simple. The correct way is to generate an NFT that's a screenshot from Google Maps of where you'd like to hit, and a blockchain-watching AI will spot it, figure out where you probably mean and send the coordinates to the fire control system.

vaylian

5 hours ago

It's not critical, but there's still a lot of reliance on it.

It's also the only reason why I still need IPv4.

ares623

5 hours ago

When millions of man-hours are lost waiting for your service to be back up, I think that deserves a bit of resiliency.

NewJazz

5 hours ago

The status page says things are still not fixed.