minimaxir
an hour ago
Back in 2019, it was more fair to have caution around the larger GPT-2 models since robust text generation (by 2019 standards) was a complete unknown. For something like Mythos in 2026, where now the social implications of better LLMs are more understood, it's more fair to call it (EDIT: specifically, the declaration of its danger) a marketing gimmick.
Qhemlomo
an hour ago
How is this a gimmick?
It changes my whole profession on a level i couldn't even imagine how we would 'solve' software engineering.
malfist
9 minutes ago
> It changes my whole profession on a level i couldn't even imagine
I assure, it doesn't.
juleiie
41 minutes ago
For starters it makes you able to bypass having to go on Reddit to find incomplete trace of solution to some niche problem and acts as a sophisticated (but sometimes wrong) search engine. This already is worth every penny and improved my mental health immensely.
throwaway85825
29 minutes ago
Fortunately you still get the reddit experience with AI.
https://www.reddit.com/r/ChatGPT/comments/1hxa3kj/ai_reached...
realusername
an hour ago
We still didn't "solve" software engineering, try to give Claude code access to your friends or family and see what they do with it.
Qhemlomo
an hour ago
My partner wrote an android app which was doing what she wanted to do. She did this experiment 5 month ago and she did this in one day.
My wife has 0 knowledge how any of this works.
That was shocking to see.
Progress is not stoping and Fable proves that.
stanmancan
an hour ago
You can scaffold out a simple app pretty easily. Anything large or complex things break down. If you don’t know what you’re doing you end up leaking secrets like the dozens of examples we’ve seen so far.
Qhemlomo
43 minutes ago
You know what the problem is in software engineering? A LOT of people have no clue what good software engineering is.
I was working in a company before which used md5 in 2015! Databases on the internet with a 5 character password. No tests.
A person i know would have broken the whole production DB if i wouldn't have stoped the PR.
Another ex-collegue thought its okay to 'encrypt' with a basic shift cyper creditcard data.
I don't think any of these companies care that much
stanmancan
5 minutes ago
Yes the same applies to junior and inexperienced developers.
StableAlkyne
36 minutes ago
You could always do this, though.
Before gen code killed the freelance business model, there were hoards of people on Upwork/Fiverr willing to fuck other freelancers over and underpay themselves to make whatever barely-working slop you wanted.
Hell, before managers got the idea of AI layoffs, they had been off-shoring to low-quality code sweatshops for years. That was supposed to kill software engineering in the States 20 years ago. And it was just as frustrating (if not moreso) to get them to actually fulfill the project requirements.
realusername
16 minutes ago
I'd say creating a project is 5% of the job and maintaining it over time 95% of it.
It's true that they can start amazing projects without guidance but then the real work begins.
oathvz
an hour ago
This is a natural follow up question -- what kind of an escalation or message should frontier labs/companies publish to be seen as genuine and not marketing gimmick?
minimaxir
an hour ago
It's fine for the labs to publish model safety cards and stagger releases/limit it to a narrow test group as they are already doing, but saying they're doing it "because the models could be dangerous" comes off as unnecessary as best.
aesthesia
17 minutes ago
One of the main purposes of model cards, from the beginning, has been to outline the ways that a model could be harmful or dangerous, and mitigations that can be or have been taken to reduce those risks. How do you expect labs to publish model cards without talking about this rationale?
enraged_camel
an hour ago
Unnecessary based on what exactly? Your vibes?
killerstorm
an hour ago
Yeah, I'm sure Anthropic loves people switching to Codex. Brilliant marketing.