Cyan488
6 hours ago
> "The tool itself worked properly and functioned as intended; however due to a bug in a separate code path, the system did not properly verify that the email address provided by the individual requesting a password reset matched the email address associated with that user’s Instagram account," said Meta in its breach notice.
I'm not sure "worked properly" and "as intended" accurately describe this situation.
vb-8448
5 hours ago
In italian we say "l'operazione è riuscita perfettamente, ma il paziente è morto" -> "the surgery was a complete success, but the patient died"
embedding-shape
5 hours ago
Both this and what Meta said reminds me of "Clarke and Dawe - The Front Fell Off" (https://www.youtube.com/watch?v=3m5qxZm_JqM)
I also can't believe the people who were involved with writing this response from Meta, didn't realize how obviously bad it sounds. It's like there is no humans working and writing there anymore.
vb-8448
4 hours ago
> It's like there is no humans working and writing there anymore.
Don't know if AI is to blame, but I've used to see these kinds of nonsense post-mortems even in the pre-llm era, and it's always due to some internal fighting ongoing between various departments.
Arainach
15 minutes ago
> like there is no humans working and writing there anymore
Meta has never been a place for people with empathy to thrive or succeed. They literally enabled a genocide. Despite being warned by internal employees, profits were more important.
rothfuss
5 hours ago
I was reminded of the Murray Walker quote. “There's nothing wrong with the car except it's on fire”
lelandfe
4 hours ago
My dad says, "But other than that, Mrs. Lincoln, how was the play?"
(Usually said jocularly when everyone is at their most upset, e.g. a vacation ruined)
RRWagner
4 hours ago
A friend said at one of those moments, "And other than that, how was the play Mrs Lincoln?" And the 3rd person replied, "I don't know, I've never seen the play 'Mrs Lincoln'"
maxall4
an hour ago
“The strait of Hormuz is open so long as Iran does not fire missiles at ships.”
prinny_
4 hours ago
Does it matter if the response is tone deaf or simply misguided? I am a bit nihilistic here, but in one week absolutely nobody will be talking about this. Are the affected individuals going to abandon instagram? Are people going to reduce their usage out of concern for the safety of their accounts? Nothing will happen, hence there is no need for actual humans writing a good, well intended response.
vb-8448
4 hours ago
> Does it matter if the response is tone deaf or simply misguided?
I agree with you that in a week nobody will be talking any more, but I'm pretty sure it's a GDPR data breach, and they can have some trouble within EU.
Yeah, they probably don't give a fu.. about EU, but if the response doesn't matter at all why did they spend time on it?
raffael_de
5 hours ago
"operation successful, patient dead."
ChuckMcM
3 hours ago
Read that as "worked as written" and "we disclaim any consequential or incidental damages and do not warrant this software."
I continue to believe we could fix a lot of things in the US if we updated the UCC[1] to disallow 'disclaiming liability on software used in a product.'
[1] Universal Commercial Code -- https://www.law.cornell.edu/ucc
jjmarr
3 hours ago
I've always wanted to expose myself to unlimited legal liability by distributing open source software.
Terr_
2 hours ago
That seems like a false-dichotomy between two extremes when there's all sorts of space in the middle... It's also assuming developer-to-developer tools would have the same rules and exposure as in service-to-consumer.
If I sell a physical motor (let alone plans for one) I'll have some liability for things like it Not Exploding. If someone buys a dozen of those motors to assemble a tragically unsafe "rollercoaster" of their own design and construction, I'm almost certainly not responsible for any terrifying decapitations.
In other words, most of the world already does not rely on the issuance of "Get Out Of Infinite Liability Free" cards.
ChuckMcM
2 hours ago
Exactly this. (and it is a false dichotomy to argue infinite liability).
To Terr_'s point, if you were publishing open source you would also publish exactly the things you intended it to be used for and anything else would violate your warranty (possibly implied) that it does what the documentation says it does.
There is a huge amount of tort law that covers exactly when it becomes a problem for you the creator vs you the user in your own project. And that liability is also based on once you know something bad could happen you make an effort to notify people[1].
[1] https://www.cpsc.gov/Newsroom/News-Releases/2026/Clorox-Agre...
Ajedi32
2 hours ago
Software can be copied infinitely, so even $1 of liability is effectively infinite since an unlimited number of people can potentially use it and sue you when it blows up.
Nobody's going to be distributing software on the internet for free if the cost of insurance alone precludes that.
ChuckMcM
an hour ago
This is not how liability works, anywhere. So I write a piece of code that "makes your screen do cool things" and it causes the power supply to fail on those screens. Someone reports that bug to me and I check it out and say "Oh, shit it does break power supplies." Then I immediately put a notice on and in the code that says "WARNING: This code will break the power supply of your montitor." And I put that warning in the repo. And if there is a Discord or a mailing list I tell everyone "Hey, this is important, if you run this code it can break your monitor."
Guess what, I'm not liable for the damage. Why? Because I immediately responded once I knew that it could, I made a good effort to warn people who might already have the code of the risk, and I made it clear in the code that this risk is there.
Ever wonder why you get a booklet of warnings when you buy a product with even really stupid things like "Don't clean with gasoline" warnings? That's because once you have discharged your duty to warn you are not longer liable in what happens if someone ignores your warning.
The flip side is also true, you cannot say in your product both "Hey this product does these cool things" and "We don't warrant the product to actually do anything." This is especially true if there is money involved (like your user paid your some $ for the product.) There is always an implied warranty that the thing will do what you says it will do, which exists as long as the user has heeded all your warnings.
aleqs
an hour ago
There's a pattern I noticed, especially on this site, where people claim various VC/ad/tech dark patterns, enshitification, privacy violations, dishonest marketing, etc MUST be allowed, otherwise open source or 'the internet' will face some sort of existential risk.
No bro - open source and the internet existed long before SV tech parasitism did and will exist long after.
ChuckMcM
an hour ago
I don't disagree, that pattern exists, but it is essentially true. Just not in the way the folks saying it is true understand it. If the "VC/ad/tech dark patterns, enshitification, privacy violations, dishonest marketing, Etc." wasn't allowed then their job might not exist. That can be true. What is missed is that if there is value in the thing, then it will exist.
When I reflect back to someone making this argument by saying, "So your argument is that you make your living as a pick pocket, but if pick pocketing is made to be illegal, you won't be able to make a living." Which of course would only be true if they only thing they could do was 'be a pick pocket'. Its a very common rhetorical technique to argue that the status quo cannot be changed. All the arguments that "you'll put all coal miners out of business if you require only green energy" And yet the people, the miners themselves, will likely be fine. The firms might not, but there are other firms that could exist.
This isn't a new problem, or one specific to this web site, although it does get disproportionately hit because so many technology companies saw what Google started in the 2000's and said, "Man there is soooo many ways to get money for this." rather than, "Is this a reasonable way to make money? Sure it is 'perfectly legal' but is it right? Is it moral?" The type of person who thinks that something is "Only illegal if you get caught" is neither moral nor particularly concerned about what is right. And we got a lot of that type.
aleqs
30 minutes ago
Very well put.
ncallaway
2 hours ago
Would that be software used in a product? I don't think that would qualify?
nkrisc
6 hours ago
The tool worked correctly and as intended, but due to a bug it did not work correctly nor as intended.
thih9
5 hours ago
To be fair, that quote in the original article could have more context. By "The tool" they meant "AI-assisted support tool"[1]; perhaps they meant that the issue was not an AI hallucination inherent of the tool, but a fixable bug.
[1]: https://www.documentcloud.org/documents/28202858-meta-ai-ag-...
nvme0n1p1
5 hours ago
In that case, the statement is so meaningless as to be useless. Why should we care how Meta splits up their microservices? The tool still failed. They just want to redefine the "tool" as something else, anything else, to avoid having to admit something negative about their precious AI.
> The LLM correctly generated tokens according to user input, however due to a bug in a separate code path, the system did not properly verify the email address
> Nginx correctly handled the user requests according to the HTTP standard, however due to a bug in a separate code path, the system did not properly verify the email address
csallen
4 hours ago
I mean, I think many of us are curious and enjoy hearing more details about how and where bugs like this occur. What's wrong with that?
nvme0n1p1
4 hours ago
I'd love to read a proper technical post-mortem, but this obviously isn't it. It's a carefully-worded statement from a lawyer meant to minimize liability and reputational damage to the company.
albedoa
2 hours ago
There is nothing wrong with that, and nobody is saying there is. In fact, it is exactly what is being requested here!
theptip
5 hours ago
Sounds like they are saying the agent did not malfunction, and this vuln could have been triggered by a human support agent too.
mikeocool
an hour ago
Kind of interesting that LLMs are basically being sold as having “human-like” reasoning capabilities, but in this case when “obamawhitehouse” asked to have it’s password reset sent to bob12345667@gmail.com the LLM didn’t question it and just triggered the process that happened to have a bug.
Humans support agents certainly fall prey to social engineering all the time, but I can’t think of a case where it was done on this scale so easily.
trehalose
5 hours ago
It probably could have been, but how likely is that compared to with the AI agent? I'd assume (and I'm ready to look like an idiot if I'm wrong) that the humans are trained to send the verification code to the email address on file, rather than any address the client asks them to. I'd certainly assume most of them are more afraid of the consequences than the AI is.
theptip
an hour ago
For sure. Social engineering attacks on human support staff are common and well known, but the skill floor is non-trivial; you need to actually be able to convince a human of your ruse.
Having a support agent likely made it easier to enumerate the vuln, and certainly made it easier to scale out exploitation once it was discovered.
dd8601fn
an hour ago
I think they’re blaming a tool function so as not to admit the overall agent process was shit.
But it’s irrelevant, outside of PR. We know at least THREE bad components to this process and they were constituent parts.
TZubiri
5 hours ago
I get the joke, but it's a relevant nuance that the new code, the chatbot, did not have 'the bug'. I still think that the mistake and head that should roll should be the one that published the chatbot.
But it's important to acknowledge that there was a 'bug' in an underlying tool and not in the chatbot, and still PIP/fire those responsible for publishing the chatbot and exposed an otherwise internal tool to the public, and not those that introduced the 'bug' to an internal tool.
nico
6 hours ago
That sounds a lot like the justifications Claude and ChatGPT give when confronted about something they did wrong, or when asked to provide a customer support response about software issues
dmoose
5 hours ago
I've lost track of the number of times Claude has basically said "it was like that when i got here" in the face of a clearly bogus choice and easily disproved explanation.
Chu4eeno
2 hours ago
You need to hit the retry/regenerate button more, it's there for a reason.
While the "stochastic parrots" thing is a bit overblown, IME most LLMs tend to surprisingly different responses even without changing the context, especially if they're hallucinating or doing something "wrong".
AlienRobot
4 hours ago
They should add a feature called "auto-really" that just automatically says "really?" after the chatbot answers a question to check if it's going to 180 upon this tiniest bit of scrutinity.
lou1306
3 hours ago
You joke but this is almost literally what Chain-of-Thought does, at least in the early days. They basically just added "Wait," to the model's output and fed it back to the model iirc
AlienRobot
an hour ago
This can't be a trillion dollar industry...
MattDaEskimo
a minute ago
Calling it a "bug in a separate code path" is so strange.
Are they indicating that their internal services were always compromised, it just took some AI with tool calling to discover it?
RobRivera
6 hours ago
Oh it was a downstream dependency. The tool worked, it was the downstream dependency. Glory to Arstotszka
moffkalast
3 hours ago
Tool so great, downstream dependency not required! Right?
Cpoll
6 hours ago
The argument here is that the AI is a glorified input page. The input field asks for your username and email and sends it to a backend function. Such an input page is working as intended.
The problem is when the backend function doesn't verify that the email matches the username.
dgoldstein0
5 hours ago
Why on earth would the backend function even take an email?
Or perhaps said different: use the submitted info to identify the account; send any sensitive messages (recovery codes, password resets whatever) to only the contact info on file. If the chat bot can send such email it should do so via an API that sends only to contact info on file for the associated account and not to an email that's provided by the bot.
duskwuff
5 hours ago
> Why on earth would the backend function even take an email?
In principle, it could be designed to do so to handle cases where a new email address has been confirmed out of band, e.g. for an account representing a company or a political office. But that's a relatively unusual situation, not something you'd want to be available to every user writing in. (Even if you had an all-human support department, this sort of functionality would only be available to a select few agents.)
jgalt212
6 hours ago
Fair enough. Never trust client-submitted browser form, but always trust LLM-submitted form.
lou1306
3 hours ago
If the backend function was so poorly coded to allow such a gargantuan security hole, then it is an even worse problem. Basically Meta is throwing its own engineers under the bus so that its AI chatbot can save face. Scary stuff.
Unless the backend was _also_ vibe-coded, in which case it is still an AI problem.
ofjcihen
6 hours ago
Maybe they’re communicating exactly what it sounds like and are just owning up to being complete morons?
tyleo
37 minutes ago
Error: Success!
ludwik
6 hours ago
I like to dunk on Meta as much as the next guy, but I think this makes sense: deterministic verification like this is not, and should never be, the LLM’s job. The tools it has access to should enforce the permissions layer, ensuring that the LLM can never perform actions the user themselves should not be allowed to perform. In this case, the tool failed to do that.
TZubiri
5 hours ago
>deterministic verification like this is not, and should never be, the LLM’s job.
But when humans handled it, this was not as much as a problem. That is, the humans did the job, because they recognized the need to do that job.
Sure sometimes accounts could get recovered if a human was tricked, but evidently it was easier to trick the LLM in masse than humans.
ajross
5 hours ago
> But when humans handled it, this was not as much as a problem.
In fact it's arguably a feature. The ability of support staff to short-circuit nitpicky rules when there's an obvious external validation happening (e.g. you're on the phone with a user who's presenting ID in real time and correlating it with previous use of the account, etc...) makes for better data quality and happier customers.
Obviously, yes, you can then human-engineer an authentication breach. But that was very difficult, because people are "common-sense careful" in a way we haven't been able to tease out of AI yet.
az226
2 hours ago
I'm sure. It was not working properly nor as intended.
stephenhuey
3 hours ago
What was that mantra? Something about broken software is what they aim for?
totetsu
6 hours ago
Then ‘ The tool itself’ was not appropriate to the job in the first place
teaearlgraycold
3 hours ago
> The tool itself worked properly and functioned as intended
The author of the post is close to the author of the AI code on the org chart
> however due to a bug in a separate code path, the system did not properly verify
The author of the post is far from the author of this "code path" on the org chart
laweijfmvo
6 hours ago
so how long was the bug there? was there a way to access it before/without the support agent? it feels like Meta will throw anything under the bus to redirect blame from the AI, because that would be the end of their $600B (depending on “which number you want to go with”) experiment
tomkarho
5 hours ago
How very Wernher von Braun of them.
dboreham
3 hours ago
There should have been a test case for this. There wasn't because most shops don't actually test their product. They do some test theater such as unit testing.
cynicalsecurity
3 hours ago
This-is-fine.jpg
saltyoldman
4 hours ago
Isn't that exactly what they said when Cambridge Analytics data gathering happened?
TZubiri
5 hours ago
Of course.
What I gather is that this internal tool was used by human support agents, and it was their responsibility to verify the email adresses and general validity of a claim.
But when implementing AGI TM that was overseen, maybe the oversight in the separate code path was a 'bug', but the mistake was making the chatbot obviously, if the separate code path had a bug, then it had become ossified into a feature, and it was internal, not exposed to the public.
This is an external communication, to save face sure, but if this is the internal excuse, it would be absolutely the wrong RCA and it reads as if the one who made the mistake is not admitting they made their mistake. Which to be honest, just making the mistake is enough to get fired, but not admitting it is enough to get ultra fired.