hackernews client

Survey: a third of senior developers say over half their code is AI-generated

213 pointsposted 4 days ago

386 Comments

Rochus

3 days ago

The article claims, that senior developers with over 10 years of experience are more than twice as likely to heavily rely on AI tools compared to their junior counterparts. No p-values or statistical significance tests are reported in either The Register article or Fastly's original blog post.

I have over 30 years of experience and recently used Claude Opus 4.1 (via browser and claude.ai) to generate an ECMA-335 and an LLVM code generator for a compiler, and a Qt adapter for the Mono soft debugging protocol. Each task resulted in 2-3kLOC of C++.

The Claude experience was mixed; there is a high probability that the system doesn't respond or just quickly shows an overloaded message and does nothing. If it generates code, I quckly run in some output limitation and have to manually press "continue", and then often the result gets scrambled (i.e. the order of the generated code fragments gets mixed up, which requires another round with Claude to fix).

After this process, the resulting code then compiled immediately, which impressed me. But it is full of omissions and logical errors. I am still testing and correcting. All in all, I can't say at this point that Claude has really taken any work off my hands. In order to understand the code and assess the correctness of the intermediate results, I need to know exactly how to implement the problem myself. And you have to test everything in detail and do a lot of redesigning and correcting. Some implementations are just stubs, and even after several attempts, there was still no implementation.

In my opinion, what is currently available (via my $20 subscription) is impressive, but it neither replaces experience nor does it really save time.

So yes, now I'm one of the 30% seniors who used AI tools, but I didn't really benefit from them in these specific tasks. Not surprisingly, also the original blog states, that nearly 30% of senior developers report "editing AI output enough to offset most of the time savings". So not really a success so far. But all in all I'm still impressed.

epolanski

3 days ago

Imho your post summarizes 90% of the posts I see about AI coding on HN: not understanding the tools, not understanding their strenghts and weaknesses, not being good at prompting or context management yet forming strong(ish) opinions.

If you don't know what they are good at and how to use them of course you may end up with mixed results and yes, you may waste time.

That's a criticism I have also towards AI super enthusiasts (especially vibe coders, albeit you won't find much here), they often confuse the fact that LLMs often one shot 80% of the solutions with the idea that LLMs are 80% there, whereas the Pareto principle well applies to software development where it's the hardest 20% that's gonna prove difficult.

Rochus

3 days ago

I'm pretty good at prompting and I successfully use Perplexity (mostly with Claude Sonnet 4) to develop concepts, sometimes with the same session expanded over several days. I think the user interface is much superior over Claude.ai. My hope was that the newer Claude Opus 4.1 would be much better in solving complicated coding tasks, which doesn't seem to be the case. For this I had to subscribe to claude.ai. Actually I didn't see much difference in performance, but a much worse UI and availability experience. When it comes to developing a complex topic in a factual dialogue, Claude Sonnet Thinking seems to me to be even more suitable than Claude Opus.

epolanski

3 days ago

I'll be more detailed in my second reply.

1) Your original post asks a lot if not too much out the LLM, the expectation you have is too big, to the point that to get anywhere near decent results you need a super detailed prompt (if not several spec documents) and your conclusion stands true: it might be faster to just do it manually. That's the state of LLMs as of today. Your post neither hints at such detailed and laborious prompting nor seem to recognize you've asked it too much, displaying that you are not very comfortable with the limitations of the tool. You're still exploring what it can and what it can't do. But that also implies you're yet not an expert.

2) The second takeaway that you're not yet as comfortable with the tools as you think you are is clearly context management. 2/3k locs of code are way too much. It's a massive amount of output to hope for good results (this also ties with the quality of the prompt, with the guidelines and code practices provided, etc, etc).

3 days ago

Would you agree with the claim that emacs/vim is an inadequate tool, since it has such a high learning curve?

Prior to LLMs, my impression was "high learning curve, high results" was a pretty popular sweet-spot with a large portion of the tech crowd. It seems weird how much LLMs seem to be an exception to this.

gammarator

3 days ago

Emacs and vim have complex interfaces that have been stable for decades. Seems like every new flavor of LLM requires learning its warts and blind spots from scratch.

cztomsik

2 days ago

The situation has improved a little bit over the last few months but LLMs are still only barely usable in languages like C/C++/Zig - and it's not about prompting. I would say that LLMs are usable for JS/Python and while the code is not always what I'd write myself, it can be used and improved later (unless you are working on perf-sensitive JS app, then it's useless again).

3 days ago

So, I am paying $20, for a glorified code generator, that may or may not be correct, to write a small function that I can do for free, and be confident about the correctness, if I have not been lazy to implement a test for it.

If you point out, with test it's also the same with any AI tool available, but to come to that result, I have to continuously prompt it till it gives me the desired output, while I may be able to do it in 2/3 iterations.

Reading documentation always made me little bit knowledgeable than before, while prompting the LLM, gives me nothing of knowledge.

And, I also have to decide which LLM would be good for the task at hand, and most of them will not be free (unless I use a local, but that will also use GPU, and add an energy cost)

I may be nitpicking, but I see too many holes with this approach

3 days ago

Certainly, So I use an IDE, IntelliJ Ultimate to be precise.

None of the use-cases you mention requires LLM. Just available as IDE functionalities.

IntelliJ has LLM based auto complete, with which I am okay, But it still wrong too many times. Works extremely well with Rust. Their non-llm autocomplete is also superb, which uses ML for suggesting closest, relevant match, IIRC.

It also makes refactoring a breeze, I know what it's going to do exactly.

Also, it can handle database refactoring to a certain capacity! And for that it does not require LLM, so no nondeterministic behavior.

Also, the IDE have its own way of doing http requests, and it's really nice! But, I can use their live template to do autocomplete any boilerplate code. It only requires setting once. No need to fiddle with prompts.

mattacular

3 days ago

> The best way I've found is to have it write small functions, and then I tell it to compose them together.

Pretty much how I code without AI, except making my brain break the problem down into small functions and expressing them in code rather than as a chat.

Rochus

3 days ago

It always asks before running commands unless you whitelist them. I have whitelisted running testsuites and linters, for example so it can iterate on those corners with minimal interaction. I have had to learn to let it go ahead and make small obvious mistakes rather than intervene immediately because the linters and tests will catch them and Claude will diagnose the failure and fix them at that point.

Anyway I took a small toy project and used that to get a feel for claude-code. In my experience using the /init command to create CLAUDE.md (or asking Claude to interview you to create it) is vital for consistent behavior.

I haven't had good "vibe" experiences yet. Mostly I know what I want to do and just basically delegate implementation. Some things that have worked well for me is to ask Claude to propose a few ways to improve or implement a feature. It's come up with a few things I hadn't thought of that way.

Anyway, claude-code was very good at slowly and incrementally earning my trust. I resisted trying it because I expected it would just run hogwild doing bewildering things, but that's not what it does. It tends to be a bit of an asskisser in it's communication style in a way that would annoy me if it were a real person. But I've managed to look past that.

kace91

3 days ago

On Claude you specifically accept any attempt to use a terminal command (optionally whitelisting) so there’s no risk that it will push force something or whatever. You can also whitelist with granularity, for example to enable it to use git to view git logs but not commit.

You can just let it work, see what’s uncommitted after it’s over, and get rid of the result if you don’t like it.

kelnos

3 days ago

> I have no confidence yet in allowing AI direct (write) access to my repositories.

You don't need to give it write access to your repositories, just to a source tree.

boesboes

3 days ago

I've been trying it for a couple of months, I can't recommend it either tbh. It's frustrating as hell to work with: super inconsistent, very bad at following its own instructions, wasteful and generally unreliable.

3 days ago

I hate to play the "you're holding it wrong" card, but when I started, I had more or less the same experience. Eventually you start to learn how to better talk to it in order to get better results.

Something I've found useful with Claude Code is that it works a lot better if I give it many small tasks to perform to eventually get the big thing done, rather than just dumping the big thing in its lap. You can do this interactively (prompt, output, prompt, output, prompt, output...) or by writing a big markdown file with the steps to build it laid out.

JeremyNT

3 days ago

While this matches my experience, it's worth mentioning thar the act of breaking a task up into the correct chunk size and describing it in English is itself a non trivial task which can be more time consuming than simply writing the actual code.

theshrike79

3 days ago

This, to me, feels like you're complaining to the 45 year old builder that they should be using a hammer instead of a nail gun.

I know how to nail a nail, I've nailed so many nails that I can't remember them all.

My job is to build a house efficiently, not nail nails. Anything that can make me more efficient at it is a net positive.

Now I've saved 2 hours in the framing process by using a nail gun, I have 2 extra hours to do things that need my experience. Maybe spot the contractor using a nail plate in the wrong way or help the apprentice on their hammering technique.

darkwater

3 days ago

IMO it's different. That's why I brought the e-bike similitude: climbing even mild mountains or hills with your own legs will actually make your legs, heart and lungs stronger in the process. So you get both the wonderful views (building the house or delivering the software) but also you get improved health (keeping your mind trained on both high level thinking and low level implementation vs high level only). We might say that using a hammer constantly will develop more your muscles, but in carpentry there are still plenty of manual work that will develop your muscles anyway. (and we still don't have bricks laying machines)

LinXitoW

3 days ago

Ironically, e-bikes, at least in the EU, are having the exact opposite effect. More people that don't normally ride bikes are using e-bikes to get about. The motor functions not as a replacement, but as a force multiplier. It also makes "experimenting" easier, because the motor can make up for any mistakes or wrong turns.

Caveat: In the EU, an e-bike REQUIRES some physical effort any time for the motor to run. Throttles are illegal.

aleph_minus_one

3 days ago

> Ironically, e-bikes, at least in the EU, are having the exact opposite effect. More people that don't normally ride bikes are using e-bikes to get about.

At least in Germany people rather joke that the moment e-bikes became popular, people began to realize that they suddenly became too unathletic to be capable of pedaling a bicycle. I know of no person who uses an e-bike who did not ride an ordinary bicycle before.

> In the EU, an e-bike REQUIRES some physical effort any time for the motor to run.

The motor must shut off when 25 km/h is reached - which is basically the speed that a trained cyclist can easily attain. So because of this red tape stuff, e-bikes are considered to be useless and expensive by cyclists who are not couch potatoes.

theshrike79

2 days ago

But they still cannot assist 100%, there needs to be effort from the user.

Otherwise they would be considered e-scooters and would have different rules and regulations applied.

darkwater

2 days ago

Yes, you pedal with almost zero effort when flat and with a little more effort when uphill. Obviously if you are going to go up the Tourmalet you will run out of battery pretty soon, but that's not the context most e-bikers use them.

sillyfluke

2 days ago

In that it fits the LLM situation quite well. LLMs remove the anxieties around coding for newbies at scale better than they make indisputable productivity gains for senior developers, similar to how e-bikes help with newbies more than cyclists.

darkwater

3 days ago

I know that's what many people, especially elder one, say but this is still a hill I will die on :) They are mostly used to go in mostly flat roads, like some slow-speed motorcycle that needs some low effort. The ones using them outside paved road, using them as multipliers, are the ones that already did mountain biking when they were younger and they want to continue doing it at a higher level their age would permit without (which it's perfectly fine!).

beaugunderson

a day ago

this is a persistent myth about eMTBs... they are still great for exercise! here's a study showing you get 94% of the heart rate on an eMTB compared to the same route on a non-eMTB: https://www.americantrails.org/resources/pedal-assist-mounta...

closewith

3 days ago

> So you get both the wonderful views (building the house or delivering the software) but also you get improved health (keeping your mind trained on both high level thinking and low level implementation vs high level only).

The vast majority of developers aren't summitting beautiful mountains of code, but are instead are sifting through endless corporate slop.

> We might say that using a hammer constantly will develop more your muscles, but in carpentry there are still plenty of manual work that will develop your muscles anyway.

The trades destroy human bodies over time and lead to awful health outcomes.

Most developers will and should take any opportunity to reduce cognitive load, and will instead spend their limited cognitive abilities on things that matter: family, sport, art, literature, civics.

Very few developers are vocational. If that is you and your job is your identity, then that's good for you. But don't fall into the trap of thinking that's a normal or desirable situation for others.

AlecSchueler

3 days ago

> The vast majority of developers aren't summitting beautiful mountains

I'm not sure you're approaching this metaphor the right away. The point is that coding manually is great cognitive exercise which keeps the mind sharp for doing the beautiful stuff.

> The trades destroy human bodies over time and leads to awful health outcomes.

Again, you're maybe being too literal and missing the point. No one is destroying their minds by coding. Exercise is good.

johnisgood

3 days ago

I am using LLMs, too, and I do not consider myself thinking less. I still have to be part of the whole process, incl. architectural process among other things that require my knowledge and my thinking.

AlecSchueler

3 days ago

I use them too and actually agree with you that the cognitive load is somewhat comparable. I was only pointing out what seemed like an abuse of the metaphor.

closewith

3 days ago

> I'm not sure you're approaching this metaphor the right away. The point is that coding manually is great cognitive exercise which keeps the mind sharp for doing the beautiful stuff.

No, I'm challenging the metaphor. Working the trades isn't exercise - it's a grind that wears people out.

> Again, you're maybe being too literal and missing the point. No one is destroying their minds by coding. Exercise is good.

We actually have good evidence that the effects of heavy cognitive load are detrimental to both the brain and mental health. We know that overwork and stress are extremely damaging to both.

So reducing cognitive load in the workplace is an unambiguous good, and protects the brain and mind for the important parts of life, which are not in front of a screen.

AlecSchueler

3 days ago

> We actually have good evidence that the effects of heavy cognitive load are detrimental to both the brain and mental health. We know that overwork and stress are extremely damaging to both.

I don't think this is fair either, you're comparing "overwork and stress" to "work." It's like saying we have evidence that extreme physical stress is detrimental ergo it's "unambiguously" healthier to drive than to walk.

Maybe you could share your good evidence so we can see if normal coding tasks would fall under the umbrella of overwork and stress?

closewith

3 days ago

We have plentiful evidence and studies on the effect of even moderate day-long cognitive work has on cognitive ability and on the effect of stress.

This is so well-founded that I do not have to provide individual sources - it is the current global accepted reality. I wouldn't provide sources for the effect of CO2 emissions on the climate or gravity, either.

However, the opposite is not true. If you have evidence that routine coding itself improves adult brain health or cognitive ability, please share RCTs or large longitudinal studies showing net cognitive gains under typical workloads.

3 days ago

Exactly, not every programmer is an artesan who hones their craft on and off the clock.

Some people just write code for 8 hours, go home and never think about it on their free time.

theshrike79

3 days ago

But LLMs can make the soul crushing part so much easier.

I need to add a FooController to an existing application, to store FooModels to the database. The controller needs the basic CRUD endpoints, etc.

I can spend a day doing it (crushing my soul) or I can just tell any Agentic LLM to do it and no something that doesn't crush my soul, like talk with the customer about how the FooModels will be used after storing.

"But it'll produce bad code!"

No it doesn't. It knows _exactly_ how to do a basic CRUD HTTP API controller in C#. It's not an art form, it's just rote typing and adding Attributes to functions.

Because it's an Agentic LLM, it'll go look at another controller and copy its structure (which is not standard globally, but this project has specific static attributes in every call).

Then I review the code, maybe add a few comments for actual humans, commit and push.

My soul remains uncrushed, client is happy when I delivered the feature on time and I have half the day off for other smaller tasks that would become technical debt otherwise.

darkwater

3 days ago

It completely depends on how you use AI. If you turn off your brain entirely and just coast as much as possible, then yeah your comment would apply.

But I think of work as essentially two things - creative activity and toil. I simply use AI for toil, and let my brain focus on creativity and problem solving.

Writing my 100,000th for loop is not going to preserve my brain.

cpursley

3 days ago

This is why I recommend the functional programming map pattern ;)

watwut

3 days ago

> are you all aware that this basically means you are exercising less your brain day in and day out, and in the end you will forget how to do things?

IDE did the same thing and we found other ways to exercise our brains. This one is really something unreasonable to worry about.

ehnto

3 days ago

I have been coding and athletic training for about as long as eachother, your anecdote works. However just like in physical training, you should really spare your energy for stuff that you enjoy and actually progresses you.

By using LLMs to do some of the stuff I have long gotten over, I have a bit more mental energy to tackle new problems I wouldn't have previously.

As well LLMs just aren't actually that competent yet, so it's not like devs are completely hands off. Since I barely do boilerplate as I work across legacy projects, there's no way Claude Code today is writing half my code.

fragmede

3 days ago

Do you sit there multiplying two digit numbers in your head for fun, for the practice, to keep operating at peek mental capacity on weekends? In the name of operating at peek mental capacity, that seems like the most logical thing to do. Just wake up at 6 am Saturday morning and start multiplying numbers on your head.

If you don't wanna use AI, that's entirely up to you, but "you're gonna forget how to program if you use AI and then whatever are you going to do if the AI is down" reeks of motivated reasoning.

jvanderbot

3 days ago

My current pattern is to manually craft during the first half of the day when I enjoy that, and during the second half when I'd be normally burnt on hard thought and not quite up for another coffee, pomodoro, theanine deep dive, I can start tackling tests, exploratory data analysis, or small bugs, and these tasks are 50% or more LLM.

So yeah, 30%-50% seems right, but it's not like I lost any part of my job that I love.

darkwater

3 days ago

That's a good approach, I like it and probably adopt it as well. I'm not dogmatically against LLMs, I just need we should think about possible consequences, and not thinking about them like a holy grail of everything.

rr808

3 days ago

You're right, but usually at the end of the day I'm completely mentally exhausted and dont want to talk to anyone. Its something I've realized is a big problem in my life. I'm actively trying to reduce mental load to leave stuff for other hobbies and social activities.

wiz21c

3 days ago

you are obviously not past 50 years old. At that age, eventhough I "train" (as you imply it) a lot (I code about 50 hours a week, 40 at work, the rest for my own projects; not of them easy, my job is writing numerical code for simulation and my pet project is writinga very accurate emulator (which implies physics modelling, tons of research, etc)). I can definitely feel I'm not as productive as before (before I could sustain 60 hours a week) eventhough I do feel I'm using my brain to the maximum...

So yeah, past a certain age, you'll be happy to reduce your mental load. No question about it. And I feel quite relieved when Claud writes this classic algorith I have understood long ago and don't want to re-activate in my brain. And I feel quite disappointed when Claude misses the point and I have to code review it...

cmrdporcupine

4 days ago

Strangely I've found myself more exhausted at the end of the week and I think it's because of the constant supervision necessary to stop Claude from colouring outside the lines when I don't watch it like a hawk.

Also I tend to get more done at a time, it makes it easier to get started on "gruntwork" tasks that I would have procrastinated on. Which in turn can lead to burnout quite quickly.

I think in the end it's just as much "work", just a different kind of work and with more quantity as a result.

daveguy

4 days ago

Now we just need an AI that helps with more quality.

cmrdporcupine

3 days ago

No, what I want better is tooling that makes supervising and getting insight into what it's doing a superior experience.

A far more interactive coding "agent" that makes sure it walks through every change it makes with you, and doesn't just rush through tasks. That helps team members come up to speed on a repository by working through it with them.

matwood

3 days ago

> Strangely I've found myself more exhausted at the end of the week and I think it's because of the constant supervision necessary to stop Claude from colouring outside the lines when I don't watch it like a hawk.

Welcome to management. Computers and code are easy. People and people wannabes like LLMs are a pain.

cschneid

4 days ago

I find AI is most useful at the ancillary extra stuff. Things that I'd never get to myself. Little scripts of course, but more like "it'd be nice to rename this entire feature / db table / tests to better match the words that the business has started to use to discuss it".

In the past, that much nitpicky detail just wouldn't have gotten done, my time would have been spent on actual features. But what I just described was a 30 minute background thing in claude code. Worked 95%, and needed just one reminder tweak to make it deployable.

The actual work I do is too deep in business knowledge to be AI coded directly, but I do use it to write tests to cover various edge cases, trace current usage of existing code, and so on. I also find AI code reviews really useful to catch 'dumb errors' - nil errors, type mismatches, style mismatch with existing code, and so on. It's in addition to human code reviews, but easy to run on every PR.

blks

3 days ago

Wow, 30 minutes to rename functions and tests? I wonder how much energy and water that llm wasted for something that any lsp supporting editor can do in a second.

lovestaco

3 days ago

Nice, which reviewer do you use? I was using CodeRabbi now switched to LiveReview.

theonething

4 days ago

> it alleviates a lot of mental energy

For me, this is the biggest benefit of AI coding. And it's energy saved that I can use to focus on higher level problems e.g. architecture thereby increasing my productivity.

nicodjimenez

4 days ago

What kind of codebases do you work on if you don't mind me asking?

I've found a huge boost from using AI to deal with APIs (databases, k8s, aws, ...) but less so on large codebases that needed conceptual improvements. But at worst, i'm getting more than 10% benefit, just cause the AI's can read files so quickly and answer questions and propose reasonable ideas.

Sl1mb0

4 days ago

How are you quantifying that 10% ?

mnky9800n

3 days ago

Do you feel like you lack mental energy at 40? I do not feel any different from 30. I think the main difference between 30 and 40 is I am much more efficient at doing things and therefore am able to do more.

danielvaughn

3 days ago

I do, but I’m not sure it’s age related. Ever since 2020, it feels like my physical and mental energy has been on a downward trajectory. I feel like I’ve lost several IQ points. What’s interesting is that I’ve heard the same from a lot of people. Not sure what the root cause is, but I do need to take better care of myself.

actionfromafar

3 days ago

Don't you have to keep dismissing incorrect auto-complete? For me I have a particular idea in mind, and I find auto-complete to be incredibly annoying.

It breaks flow. It has no idea my intention, but very eagerly provides suggestions I have to stop and swat away.

apt-apt-apt-apt

4 days ago

Yeah, [autocomplete: I totally agree]

it's so [great to have auto-complete]

annoying to constantly [have to type]

have tons of text dumped into your text area. Sometimes it looks plausibly right, but with subtle little issues. And you have to carefully analyze whatever it output for correctness (like constant code review).

shiandow

4 days ago

That we're trying to replace entry/mediocre/expert level code writing with entry/mediocre/expert level code reading is one of the strangest aspects of this whole AI paradigm.

There's literally no way I can see that resulting in better quality, so either that is not what is happening or we're in for a rude awakening at some point.

oblio

4 days ago

https://www.joelonsoftware.com/2000/05/26/reading-code-is-li...

https://www.joelonsoftware.com/2000/04/06/things-you-should-... (read the bold text in the middle of the article)

These articles are 25 years old.

gnerd00

4 days ago

there is no "we" or at least not sufficiently differentiated. Another layer is inserted into .. everything? think MSFT Teams. Your manager's manager is being empowered.. you become a truck driver who must stay on the route, on schedule or be replaced.

jcgrillo

4 days ago

This is how you know the "AI" proselytizers are completely full of shit. They're trying to bend the narrative with a totally unrealistic scenario where reading and reviewing code is somehow "more efficient" than writing it. This is only true if you

sramam

4 days ago

I think there is a difference between type system or Language Server completions and AI generated completion.

When the AI tab completion fills in full functions based on the function definition you have half typed, or completes a full test case the moment you start type - mock data values and all, that just feels mind-reading magical.

spockz

4 days ago

I haven’t tried it lately but how well are these models in generating property based tests?

another_twist

4 days ago

Tab completions simply hit the bottleneck problem. I dont want to press tab on every line it makes no sense. I would rather AI generate a function block and then integrate it back. Saves me the typing hassle and I can focus in design and business logic.

socalgal2

3 days ago

Can you be specific about what you're asking it to write?

I'm pretty confidient that I couldn't get it to implement a element in a web browser. I'm talking about C++ in WebKit or Chromium, not a custom element in HTML/JS. Let's say browsers wanted a new data-table element that natively implemented a scrolling window such that you registered events to supply elements and it asked for only the portion that were visible. I feel like those code bases are too complex for LLMs to add this (could certainly be wrong).

In any case, more concrete examples would help these discussions.

I can give a concrete example: I just asked ChatGPT (should have asked something more integrated into my editor), to give me JavaScript to directly parse meta data out of an MP4 file in JS in the browser. It gave my typescript but told me I should look at mp4box.

I spent 15-20 minutes getting a mp4box example setup (probably should have asked for that too). Only to find that I think mp4box does not get me the data I wanted.

So I went back to ChatGPT and asked for JavaScript because I was hacking in something like jsfiddle. It gave me that and it worked!

I then said I wanted the title and comments meta data as that was missing from what it gave me. That worked too, first try.

spunker540

4 days ago

I’m not yet up to half (because my corporate code base is a mess that doesn’t lend itself well to AI)

But your approach sounds familiar to me. I find sometimes it may be slower and lower quality to use AI, but it requires less mental bandwidth from me, which is sometimes a worthwhile trade off.

wordofx

3 days ago

> because my corporate code base is a mess that doesn’t lend itself well to AI

What language? I picked up an old JS project that had several developers fail over weeks to upgrade to newer versions of react. But I got it done in a day by using AI to generate a ton of unit tests then loop an upgrade / test / build. Was 9 years out of date and it’s running in prod now with less errors than before.

Also upgraded rails 4 app to rails 8 over a few days.

Done other apps too. None of these are small. Found a few memory leaks in a C++ app that our senior “experts” who have spent 20 years doing c++ couldn’t find.

victorbjorklund

3 days ago

100%. My instructions are often pretty exact "i want a genserver that fetches the data every X sec and updates Y if new data"

cpursley

3 days ago

Yeah, Claude has gotten much better with Elixir. I think too many people expect one-shot miracles, but you have to guide it along like you would a junior, including directing it towards the proper context.

npn

3 days ago

I typed less using AI, so it is a net win I guess

cc62cf4a4f20

3 days ago

I've been coding professionally for almost 30 years. I use Claude Code heavily, even for larger features. To get that to work half-decent, you have to take on a PM/Tech-lead role, you're no longer a senior engineer.

For large pieces of work, I will iterate with CC to generate a feature spec. It's usually pretty good at getting you most of the way there first shot and then either have it tweak things or manually do so.

Implementation is having CC first generate a plan, and iterating with it on the plan - a bit like mentoring a junior, except CC won't remember anything after a little while. Once you get the plan in place, then CC is generally pretty good at getting through code and tests, etc. You'll still have to review it after for all the reasons others have mentioned, but in my experience, it'll get through it way faster than I would on my own.

To parallelize some of the work, I often have Visual Studio Code open to monitor what's happening while it's working so I can redirect early if necessary. It also allows me to get a head start on the code review.

I will admit that I spent a lot of time iterating on my way of working to get to where I am, and I don't feel at all done (CC has workflows and subagents to help with common tasks that I haven't fully explored yet). I think the big thing is that tools like CC allow us to work in new ways but we need to shift our mindset and invest time in learning how to use these tools.

angled

3 days ago

I’ve arrived at something similar:

* brainstorm all the ideas, get Claude to write docs + code for all them, and then throw away the code

* ask it to develop architecture and design principles based on the contents of those docs

* get it to write a concise config spec doc that incorporates all the features, respects the architecture and design as appropriate

* iterate over that for a while until I get it into a state I like

* ask it to write an implementation plan for the config spec

* babysit it as I ask it to implement phase by phase of the implementation plan while adhering to the config spec

It’s a bit slower to than what I’d hoped originally, but it’s a lot better in terms of end result and gives me more opportunity to verify tests, tweak implementation, briefly segue or explore enhancements, etc.

paulcole

3 days ago

> To get that to work half-decent, you have to take on a PM/Tech-lead role, you're no longer a senior engineer.

But you’re saying it can be half-decent?

The problem is that about 75% of HN commenters have their identities tightly wound up in being a (genuflect) senior engineer and putting down PM/tech-lead type roles.

They’ll do anything to avoid losing that identity including writing non-stop about how bad AI code is. There’s an Upton Sinclair quote that fits the situation quite nicely.

tbrake

3 days ago

Kinda but kinda not.

I'd agree that 75% you speak of is generally hostile to the mere concept of PMs, but that's usually from a misapplication of PMs as proxy-bosses for absentee product owners/directors who don't want to talk to nerds - flow interruptions, beancounting perceived as useless, pointless ceremonies, even more pointless(er) meetings etc, and the further defiling of the definition of "agile".

But a deep conceptual product and roadmap understanding that helps one steer Claude Code is invaluable for both devs and PMs, and I don't think most of that 75% would begrudge that quality in a PM

INTPenis

4 days ago

I wouldn't say I'm old, but I suddenly fell into the coding agent rabbit hole when I had to write some Python automations against Google APIs.

Found myself having 3-4 different sites open for documentation, context switching between 3 different libraries. It was a lot to take in.

So I said, why not give AI a whirl. It helped me a lot! And since then I have published at least 6 different projects with the help of AI.

It refactors stuff for me, it writes boilerplate for me, most importantly it's great at context switching between different topics. My work is pretty broadly around DevOps, automation, system integration, so the topics can be very wide range.

So no I don't mind it at all, but I'm not old. The most important lesson I learned is that you never trust the AI. I can't tell you how often it has hallucinated things for me. It makes up entire libraries or modules that don't even exist.

It's a very good tool if you already know the topic you have it work on.

But it also hit me that I might be training my replacement. Every time I correct its mistakes I "teach" the database how to become a better AI and eventually it won't even need me. Thankfully I'm very old and will have retired by then.

baq

4 days ago

I love the split personality vibe here.

JadeNB

4 days ago

Or perhaps the commenter just aged a lot while writing the post.

johnfn

4 days ago

First line: "I wouldn't say I'm old"

4 days ago

Google OAuth2 refresh tokens are definitely singe use.

another_twist

4 days ago

Atleast not documented here https://developers.google.com/identity/protocols/oauth2#5.-r.... They have a limit on the number of tokens but not on number of uses per token.

lpapez

4 days ago

This article goes completely against my experience so far.

I teach at an internship program and the main problem with interns since 2023 has been their over reliance on AI tools. I feel like I have to teach them to stop using AI for everything and think through the problem so that they don't get stuck.

Meanwhile many of the seniors around me are stuck in their ways, refusing to adopt interactive debuggers to replace their printf() debug habits, let alone AI tooling...

lordnacho

4 days ago

> Meanwhile many of the seniors around me are stuck in their ways, refusing to adopt interactive debuggers to replace their printf() debug habits, let alone AI tooling...

When I was new to the business, I used interactive debugging a lot. The more experienced I got, the less I used it. printf() is surprisingly useful, especially if you upgrade it a little bit to a log-level aware framework. Then you can leave your debugging lines in the code and switch it on or off with loglevel = TRACE or INFO, something like that.

shmerl

4 days ago

I kind of had the opposite experience. I used to rely mostly on printfs and etc. but started using debugger more.

printf doesn't improve going up and down the call stacks in the debugger to analyze their chain (you'd have to spam debug printfs all around you expect this chain to happen to replace the debugger which would waste time). debugger is really powerful if you use it more than superficially.

boredtofears

3 days ago

> you'd have to spam debug printfs all around you expect this chain to happen to replace the debugger which would waste time

It's not wasting time, it's narrowing in on the things you know you need to look for and hiding everything else. With a debugger you have to do this step mentally every time you look at the debugger output.

shmerl

3 days ago

Trying to guess what that chain is and putting printf's all around that path feels like doing a poor simulation of what debugger can do out of the box and unlike us - precisely. So I'd say it's exactly the opposite.

If you only care about some specific spot, then sure - printf is enough, but you also need to recompile things every time you add a new one or change debug related details, while debugger can do it re-running things without recompilation. So if anything, printf method can take more time.

Also, in debugger you can reproduce printf using REPL.

ambicapter

4 days ago

> printf() is surprisingly useful, especially if you upgrade it a little bit to a log-level aware framework.

What do you mean by this? Do you mean using a logging framework instead of printf()?

4 days ago

And no one has ever written a buggy unit test.

ch4s3

3 days ago

That’s what the code is for.

brongondwana

3 days ago

This is why I trigger a segfault which dumps core at the spot where I had the printf when the conditions aren't what I want, so I can then open the debugger on the core (obviously: not if I have a copy of the input which can recreate it, if so then a debugger with a conditional breakpoint at the same spot is even better)

4 days ago

Interactive debuggers and printf() are both completely valid and have separate use-cases with some overlap. If you're trying to use, or trying to get people to use, exclusively one, you've got some things to think about.

jennyholzer

3 days ago

printf() users in this thread are very proud of their own ignorance

marssaxman

3 days ago

I almost exclusively debug via printf / logging, and I am so stupendously ignorant that I have even written and published a multi-platform interactive debugger, to go with the compiler I also wrote (one of several, but this one was the most successful). Make of that what you will, I suppose.

marssaxman

4 days ago

That's funny. I remember using interactive debuggers all the time back in the '90s, but it's been a long time since I've bothered. Logging, reading, and thinking is just... easier.

TheRoque

4 days ago

Really ? I find myself thinking the opposite. My program always runs in debug mode, and when there's some issue I put a breakpoint, trigger it, and boom I can check what is wrong. I don't need to stop the program, insert a new line to print what i _guess_ is wrong, restart the program from scratch etc.

Properly debugging my stack is probably one of the first things I setup because I find it way less tedious. Like, for example, if you have an issue in a huge Object or Array, will you actually print all the content, paste it somewhere else and search through the logs ? And by the way, most debuggers also have ability to setup a log points anyways, without having to restart your program. Genuinely curious to know how writing extra lines and having to restart makes things easier.

Of course I'm not saying that I never débug with logs, sometimes it's require or even more efficient, but it's often my second choice.

marssaxman

3 days ago

I imagine that it depends on the kind of software you are working on. I found debuggers helpful back when I was working on interactive GUI programs which had a lot of complex state, built in classic OOP style with lots of objects pointing at each other, but I have not done that sort of thing in a long time. In part, that's because I got seriously into functional programming, which left me with a much more rigorous approach to state transitions; but then I shifted from conventional software into embedded firmware, where you can sometimes stop the microcontroller and use a debugger via JTAG, but it's usually easier to just stream data out over a serial port, or hook an LED up to a spare pin and make it blink. The world doesn't stop even if you want the software to stop, so the utility of a debugger is limited.

After that I went to work for Google, building distributed software running across many machines in a datacenter, and I have no idea how you would hook up a debugger even if you wanted to. It's all logs all the time, there.

By the time that was over, I was thoroughly accustomed to logging, and attaching a debugger had come to seem like a nuisance. Since then I've mostly worked on compilers, or ML pipelines, or both: pure data-processing engines, with no interactivity. If I'm fixing a bug, I'm certainly also writing a regression test about it, which lends itself to a logging-based workflow. I don't mind popping into gdb if that's what would most directly answer my question, but that only happens a couple of times a year.

TheRoque

3 days ago

Thanks for the detailed answer. Indeed I also worked on embedded a long time ago, and the debugger was often the last resort because the debug pins weren't always accessible.

LandR

3 days ago

Also conditional breakpoints. I.e. break on this line if foo==5

I couldn't imagine going back to print statement based debugging. Would be a massive waste of time.

globular-toast

3 days ago

Yeah, I remember learning to use gdb when I was beginning in the early 2000s. I totally thought I was "levelling up" as a programmer and to be honest felt kinda badass with all those windows open in Emacs. But I've found that the number of times I actually resorted to using the debugger has been so small I don't remain fluent in it's use. What am I supposed to do? Write more bugs? On the other hand, I'm always ready to read, think and put in some print/log statements.

jacquesm

4 days ago

The right tool for the right job. If someone gets the job done with printf() then that would be good enough for me.

Interactive debuggers are a great way to waste a ton of time and get absolutely nowhere. They do have their uses but those are not all that common. The biggest usecase for me for GDB has been to inspect stacktraces, having a good mental model of the software you are working on is usually enough to tell you exactly what went wrong if you know where it went wrong.

Lots of people spend way too much time debugging code instead of thinking about it before writing.

Oh, and testing >> debugging.

another_twist

4 days ago

Nitpicking a bit here but theres nothing wrong with printf debugging. Its immensely helpful to debug concurrent programs where stopping one part would mess up the state and maybe even avoid the bug you were trying to reproduce.

As for tooling, I really love AI coding. My workflow is pasting interfaces in ChatGPT and then just copy pasting stuff back. I usually write the glue code by hand. I also define the test cases and have AI take over those laborious bits. I love solving problems and I genuinely hate typing :)

quantiq

3 days ago

Yeah, I'm not putting stock in this at all. The methodology of this survey seems dubious.

jennyholzer

3 days ago

30% of the articles on this forum are covert LLM advertising

wulfstan

3 days ago

Good, it's not just me.

I guess it's not a massive surprise given it's an HN forum and a reasonable percentage of HN candidates are doing LLM/AI stuff in recent cohorts, but it still means I have to apply a very big filter every time I open an article and people wax lyrical about how amazing Claude-GPT-super-codez is and how it has made them twice the engineer they were yesterday at the bargain price of $200 a month...

May it all die in a fire very soon. Butlerian jihad now.

jennyholzer

3 days ago

oblio

4 days ago

Most decent debuggers have condițional breakpoints.

jcparkyn

3 days ago

Not to mention tracepoints (logging breakpoints), which are functionally the same as printf but don't require compiling/restarting.

scarface_74

3 days ago

And have since the mid 1990s at least…

izacus

4 days ago

So how many developers in that survey are those?

They surveyed 791 developers (:D) and "a third of senior developers" do that. That's... generiously, what... 20 people?

It's amazing how everyone can massage numbers when they're trying to sell something.

thegrim33

4 days ago

The other thing they do is conveniently not mention all the negative stuff about AI that the source article mentions, they only report on the portion of content from the source that's in any way positive of AI.

And of course, its an article based on a source article based on a survey (of a single company), with the source article written by a "content marketing manager", and the raw data of the survey isn't released/published, only some marketing summary of what the results (supposedly) were. Very trustworthy.

kamaal

3 days ago

>It's amazing how everyone can massage numbers when they're trying to sell something.

More like massage words.

Like one fairly smart person told me- Doing math with words is pointless.

user

4 days ago

[deleted]

encody

3 days ago

I think I'm using it wrong.

I'm using Zed as my editor, and maybe 18 months ago I upgraded my system. I didn't miss the AI autocomplete at the time, so I didn't bother to set it up again. However, around two weeks ago I figured I'd give it another go.

I set up GitHub Copilot in Zed and... it's horrible. It seems like most of its suggestions are completely misguided and incorrect, usually just duplicating the code immediately above or below the cursor location while updating the name of a single identifier to match some perceived pattern. Not remotely close to what I'd consider useful; I'm definitely a faster & better programmer without it.

I also tried setting up some local models on ollama: I kept getting random tokens inserted that seemed to be markup from the model output that Zed didn't know how to parse. (On mobile rn, will post sample output when I am back at work if I remember to.)

Is paying Anthropic an arm and a leg for the privilege of granting them first-party access to train on my user data really the competitive move as a modern developer?

P.S. I know Zed has their own AI (and it seems like it should be really good!), but when they first introduced it, I tried it out and it immediately consumed the entire free tier's worth of credits in just a few minutes of normal coding: suggestions are proactively generated and count against your account credit even if not accepted, so I didn't really feel like I'd gotten a good sense of the tool by the time the trial ran out. Even if it's really good, it burns through credits extremely fast.

LinXitoW

3 days ago

I've used the 200 bucks Claude Code account, and while that's pretty great, it's definitely not "fire and forget". It often requires a completely different skill set. You basically need to develop a feeling for what the agent can do automatically and what needs extra prompting.

If you want a perfect investment imho, get Supermaven. It's autocomplete is 99% perfect.

DecoySalamander

2 days ago

For autocompletion I suggest giving Zed's own offering another try. It's guesses somewhat correctly at least half the time and is very convenient when it get's what you're trying to do durin a refactor and suggest valid multiline changes. It's still rather annoying when it replaces regular autocompletion for import paths with it's plausible sounding guesses. Before that I was using Cody, but I don't think that they have free tier any more.

> immediately consumed the entire free tier's worth of credits in just a few minutes of normal coding

Happened to me too when I've tried to do a single edit with connected gemini pro. It somehow ran out of free quota even before finishing it.

__jonas

3 days ago

I don’t think you should look at the Copilot autocomplete as anything other than autocomplete, as far as I know they use worse but faster models for that.

I’m not big on AI but I get occasional use out of Copilot with the chat / agent mode and Claude Sonnet 4.

freehorse

3 days ago

> suggestions are proactively generated and count against your account credit even if not accepted

This is no longer the case. They only count if you accept them.

Ime their model is not great, but better than copilot (ime copilot is just too slow and disruptive).

Also the options you have for tab-complete, afaik, are zed/zeta, guthub copilot, supermaven, and local models. I dont think there are other providers right now, glad if I am wrong in that.

marcyb5st

4 days ago

In terms of LOCs maybe, in terms of importance I think is much less. At least that's how I use LLMs.

While I understand that <Enter model here> might produce the meaty bits as well, I believe that having a truck factor of basically 0 (since no-one REALLY understands the code) is a recipe for a disaster and I dare say long term maintainability of a code base.

I feel that you need to have someone in any team that needs to have that level of understanding to fix non trivial issues.

However, by all means, I use the LLM to create all the scaffolding, test fixtures, ... because that is mental energy that I can use elsewhere.

epicureanideal

4 days ago

Agreed. If I use an LLM to generate fairly exhaustive unit tests of a trivial function just because I can, that doesn’t mean those lines are as useful as core complex business logic that it would almost certainly make subtle mistakes in.

andsoitis

4 days ago

> If I … generate fairly exhaustive unit tests of a trivial function

… then you are not a senior software engineer

triyambakam

4 days ago

Neither are you if that's your understanding of a senior engineer

mgh95

4 days ago

I think the parent commentors point was that it is nearly trivial to generate variations on unit tests in most (if not all) unit test frameworks. For example:

Java: https://docs.parasoft.com/display/JTEST20232/Creating+a+Para...

C# (nunit, but xunit has this too): https://docs.nunit.org/articles/nunit/technical-notes/usage/...

Python: https://docs.pytest.org/en/stable/example/parametrize.html

cpp: https://google.github.io/googletest/advanced.html

A belief that the ability of LLMs to generate parameterizations is intrinsically helpful to a degree which cannot be trivially achieved in most mainstream programming languages/test frameworks may be an indicator that an individual has not achieved a substantial depth of experience.

com2kid

4 days ago

The useful part is generating the mocks. The various auto mocking frameworks are so hit or miss I end up having to manually make mocks which is time consuming and boring. LLMs help out dramatically and save literally hours of boring error prone work.

mgh95

4 days ago

Why mock at all? Spend the time making integration tests fast. There is little reason a database, queue, etc. can't be set up in a per-test group basis and be made fast. Reliable software is built upon (mostly) reliable foundations.

com2kid

3 days ago

Because if part of my tests involve calling an OpenAI endpoint, I don't want to pay .01 cent every time I run my tests.

Because my tests shouldn't fail when a 3rd party dependency is down.

Because I want to be able to fake failure conditions from my dependencies.

Because unit tests have value and mocks make unit tests fast and useful.

Even my integration tests have some mocks in them, especially for any services that have usage based pricing.

But in general I'm going to mock out things that I want to simulate failure states for, and since I'm paranoid, I generally want to simulate failure states for everything.

End to End tests are where everything is real.

mgh95

3 days ago

> Because if part of my tests involve calling an OpenAI endpoint, I don't want to pay .01 cent every time I run my tests.

This is a good time to think to yourself: do I need these dependencies? Can I replace them with something that doesn't expose vendor risk?

These are very real questions that large enterprises grapple with. In general (but not always), orgs that view technology as the product (or product under test) will view the costs of either testing or inhousing technology as acceptable, and cost centers will not.

> But in general I'm going to mock out things that I want to simulate failure states for, and since I'm paranoid, I generally want to simulate failure states for everything.

This can be achieved with an instrumented version of the service itself.

com2kid

3 days ago

> This is a good time to think to yourself: do I need these dependencies? Can I replace them with something that doesn't expose vendor risk?

Given that my current projects all revolve solely around using LLMs to do things, yes I need them.

3 days ago

99% of the work in testing is coming up with test scenarios and test cases. 95% of the code is just dealing with setting up input and output data, 4% is calling the code you want to test and the final assert is often just a single line of code.

I'm not sure what depth of experience has to do with any of this, since it is busy work that costs a lot of time. A form with 120 fields is a form with 120 fields. There is no way around coming up with the several dozens of test cases that you're going to test without filling out almost all of the fields, even the ones that are not relevant to the test itself, otherwise you're not really testing your application.

VectorLock

4 days ago

Parameterized tests are good, but I think he might be talking about exercising all the corner cases in the logic of your function, which to my knowledge almost no languages can auto-generate for but LLMs can sorta-ish figure it out.

mgh95

4 days ago

We are talking about basic computing for CRUD apps. When you start needing to rely upon "sorta-ish" to describe the efficacy or a tool for such a straightforward and deterministic use case, it may be an indicator you need to rethink your approach.

VectorLock

4 days ago

If you want to discount a tool that may save you an immense amount of time because you might have to help it along the fast few feet, thats up to you.

If you can share a tool that can analyze a function and create a test for all corner cases in a popular language, I'm sure some people would be interested in that.

mgh95

4 days ago

You should look up intellitest and reshaper test generator. Products exist for this.

imtringued

4 days ago

> you're breaking the task into smaller achievable tasks.

this is the part that I would describe as engineering in the first place. This is the part that separates a script kiddie or someone who "knows" one language and can be somewhat dangerous with it, from someone who commands a $200k/year salary, and it is the important part

Chris_Newton

4 days ago

It turns into a specification problem.

This, IMHO, is the critical point and why a lot of “deep” development work doesn’t benefit much from the current generation of AI tools.

Last week, I was dealing with some temporal data. I often find working in this area a little frustrating because you spend so much time dealing with the inherent traps and edge cases, so using an AI code generator is superficially attractive. However, the vast majority of my time wasn’t spent writing code, it was getting my head around what the various representations of certain time-based events in this system actually mean and what should happen when they interact. I probably wrote about 100 test cases next, each covering a distinct real world scenario, and working out how to parameterise them so the coverage was exhaustive for certain tricky interactions also required a bit of thought. Finally, I wrote the implementation of this algorithm that had a lot of essential complexity, which means code with lots of conditionals that needs to be crystal clear about why things are being done in a certain order and decisions made a certain way, so anyone reading it later has a fighting chance of understanding it. Which of those three stages would current AI tools really have helped with?

3 days ago

Awesome. That must be on the same planet where 300% of the worlds power goes to processing a single bitcoin transaction.

merlincorey

4 days ago

Claude is making $72k a year for a consistent $300/day spend.

PhantomHour

4 days ago

Bear in mind those are revenue figures, they're costing claude hundreds a day.

One imagines Leadership won't be so pleased after the inevitably price hike (which, given the margins software uses, is going to be in the 1-3 thousands a day) and the hype wears off enough for them to realize they're spending a full salary automating a partial FTE.

ojosilva

4 days ago

But, by the looks of things, models will be more efficient by then and a cheaper-to-run model will produce comparable output. At least that's how it's been with OSS models, or with the Openai api model. So maybe the inevitable price hike (or rate limiting) may lead to switching models / providers and the results being just as good.

racc1337

3 days ago

There is an interesting substack post about this. LLM costs are dropping 10x/year but the amount of tokens used have gone up like crazy https://open.substack.com/pub/ethanding/p/ai-subscriptions-g...

krainboltgreene

4 days ago

> But, by the looks of things, models will be more efficient by then and a cheaper-to-run model will produce comparable output

So far there's negative evidence of this. Things are getting more expensive for similar outputs.

etoxin

3 days ago

And this varies widely between models based on how they are told to reason.

codingdave

3 days ago

> Senior developers were also more likely to say they invest time fixing AI-generated code.

That quote is the key - even if 1/3 of senior devs are pushing mostly AI-driven code, they are checking it first. And while the survey did not cover it, I suspect they are using their experience to decide which areas of a codebase are commonplace enough that AI can handle it vs. which areas are unique and require coding without AI.

People are learning when to use AI as a helpful tool and when not to.

goosejuice

4 days ago

This is self reported unless I missed something. I bet that skews these results quite a bit. Many are very hesitant to say they use AI, and I suspect that's much more likely to be the case when you are new to the field.

Also, green coding? That's new to me. I guess we'll see optional carbon offset purchasing in our subs soon.

countWSS

4 days ago

Most of code that "needs to be written" is just a copy of something standard, "Do X in the simplest way possible" code that doesn't need optimizations, and writing it by hand is just waste of time. AI is good enough to write megabytes of that code, since its statistically common and part of tons of codebases. Its the other half of code that AI can't handle, that you need to manually verify it doesn't hallucinate fantastic stuff that manages to compile, but doesn't work.

bgwalter

4 days ago

Fastly is in the AI business like Cloudflare:

https://www.fastly.com/products/ai

https://www.fastly.com/products/fastly-ai-bot-management

https://www.fastly.com/documentation/guides/compute/about-th...

aspir

2 days ago

Disclaimer: not only am I a Fastly employee, I'm the Fastly employee quoted in the register article originally linked.

We'd argue our stance and focus are a bit different. Our stance is -- we help eng teams and the open web with whatever their preference for AI might be.

If you want to go all in and monetize AI traffic or sign an agreement with a foundation model for training, we have tools that can help you do that. If you want to go scorched-earth on AI bots that try to scrape your systems, we can help you do that too.

It's all about helping engineers tasked with tough problems get their jobs done, regardless of whether the flavor of the week is AI or something else.

binarymax

4 days ago

I guess I’m an older developer.

But I’ve come full circle and have gone back to hand coding after a couple years of fighting LLMs. I’m tired of coaxing their style and fixing their bugs - some of which are just really dumb and some are devious.

Artisanal hand craft for me!

Gigachad

4 days ago

I've also just turned off copilot now. I had several cases where bugs in the generated code slipped through and ended up deployed. Bugs I never would have written myself. Reviewing code properly is so much harder than writing it from scratch.

jennyholzer

3 days ago

"write it right the first time" is my new mantra/challenge to myself.

along with improving my skills in vim, this approach has made me significantly more productive and has made my code much simpler compared to when i was using LLM code generation tools.

there is no shortcut around hard work.

there is no shortcut to thoroughly interrogating the constraints of the software problem at hand.

developers who rely on LLM code generation are poor coworkers because they don't understand what they've written and why they've written it.

uludag

4 days ago

I'm in the same exact boat. I started with a lot of different tools but eventually went back to hand coding everything. When using tools like co-pilot I noticed I would ship a lot mode dumb mistakes. I even experimented with not even using a chat interface and it turns out that a lot of answers to problems are indeed found with a web search.

jennyholzer

3 days ago

I'm in this boat. On top of this though, I genuinely think less of developers who rely on LLMs. I very seriously believe that they are beneath me.

3 days ago

it's a lot faster and a lot easier and it gives you a much better understanding of the problem if you just think about the changes you want to make and write them yourself.

why are you looking for a shortcut? just do the work.

gardnr

3 days ago

I feel like it’s my job to understand new tools and keep abreast of the way things are changing in my industry.

jillesvangurp

3 days ago

LLMs are tools. If you master your tools, you become more productive. I've had the same mixed experiences with LLMs that others have. Some good, some not so great. But I've been merging quite a bit of codex created PRs in the last weeks. Some needed manual intervention. Some were tedious to create. And some clearly saved me a lot of work.

I always have more work than I can handle. Part of my job is deciding what not to do and what to drop because it hasn't got the right priority. Me spending an afternoon on a thing that is fun but not valuable is usually a bad use of my time. With LLMs, I'm taking on a few more of the things that I previously wouldn't have. That started with a few hobby projects that I'm now doing that I previously wasn't. And it's creeping into work as well. LLMs struggle on larger code bases. But less so with recent model releases.

latexr

3 days ago

> LLMs are tools. If you master your tools, you become more productive.

LLMs are unpredictable tools which change all the time, let’s not pretend otherwise. You can’t “master” them in the same way as previous tools. You can learn some tricks to trick them to be closer to what you want, and that’s about it.

Imagine if every time you did the exact same movement to hammer a nail, you had to check your work. Maybe this time it hammered it in perfectly, or maybe it smashed your finger, or maybe it only went half-way through. You could never develop muscle memory for such a tool. You could use it, sure, but never master it.

jillesvangurp

2 days ago

I didn't say they were perfect tools; quite the opposite actually.

scubadude

4 days ago

At a minimum, 30-50% of bogus surveys are bogus but I'm willing to bet it's a lot more.

uludag

4 days ago

Naive question but wouldn't it could as having AI write 50%+ of your code if you just use an unintelligent complete-the-line AI tool? In this case the AI is hardly doing anything intelligent, but is still getting credit for doing most of the work.

righthand

4 days ago

Yes there is even a small business that champions an small LLM that is trained on the language Lsp, your code base, and your recent coding history (not necessarily commit, but any time one presses ctrl + s). How it works is essentially autocomplete. This functionality is packaged as an IDE plugin: TabNine

However now they try to sell subscriptions to LLMs.

Tabnine has been in the scene since at least 2018.

jesterson

4 days ago

That doesn't worry me much. Whats way more concerning is junior/middle developers have likely way more than 1/2 of their code "AI" generated.

At least seniour developer allegedly know how to error-proof their code to some extend.

matula

4 days ago

I've been at this for many years. If I want to implement a new feature that ties together various systems and delivers an expected output, I know the general steps that I need to take. About 80% of those steps are creating and stubbing out new files with the general methods and objects I know will be needed, and all the test cases. So... I could either spend the next 4 hours doing that, or spend 3 minutes filling out a CLAUDE.md with the specs and 5 minutes having Claude do it (and fairly well).

I feel no shame in doing the later. I've also learned enough about LLMs that I know how to write that CLAUDE.md so it sticks to best practices. YMMV.

blast

4 days ago

> I've also learned enough about LLMs that I know how to write that CLAUDE.md so it sticks to best practices.

Could you share some examples / tips about this?

LarryMade2

4 days ago

I tried it - didn't like it. Had an LLM work on a backup script since I don't use Bash very often. Took a bunch of learning the quirks of bash to get the code working properly.

While I'll say it got me started, it wasn't a snap of the fingers and a quick debug to get something done. Took me quite a while to figure out why something worked but really it didn't (LLM using command line commands where Bash doesn't interpret the results the same).

If its something I know, probably wont use LLM (as it doesn't do my style). If it's something I don't know, might use it to get me started but I expect that's all I'll it for.

dboreham

4 days ago

Can I ask which agent/model you used? I'm similarly irritated with shell script coding, but find I have to make scripts fairly often. My experience using various models but latterly Claude Code has been quite different -- it churned out pretty much what I was looking for. Also old, fwiw. I'm older than all shells.

LarryMade2

4 days ago

It was Gemini, I picked the point of least resistance. I've heard Claude does better but haven't looked at it.

mr90210

4 days ago

> survey of 791 developers

We have got to stop. In a universe of well over 25 million programmers a sample of 791 is not significant enough to justify such headlines.

We’ve got to do better than this, whatever this is.

recursive

4 days ago

Validity of the sample size is not determined by its fraction of the whole population. I don't know the formulas and I'm not a statistician. Maybe someone can drop some citations.

spmurrayzzz

4 days ago

I generally agree with this just from a perspective of personal sentiment, it does feel wrong.

But statistically speaking, at a 95% confidence level you'd be within a +/- 3.5% margin of error given the 791 sample size, irrespective of whether the population is 30k or 30M.

oasisaimlessly

4 days ago

You should read more about statistical significance. Under some reasonable assumptions, you can confidently certain deduce things with small sample sizes.

From another perspective: we've deduced a lot of things about how atoms work without any given experiment inspecting more than an insignificant fraction of all atoms.

TL;DR: The population size (25e6 total devs, 1e80 atoms in observable universe) is almost entirely irrelevant to hypothesis testing.

FpUser

3 days ago

My programming background - started from machine codes. Did everything - business desktop, high performance game like apps, firmware for microcontrollers, web front ends, high performance enterprise grade backends, distributed etc. etc. Multiple languages. So lots of experience. Now I use AI quite extensively to generate snippets / classes of code with small and well defined functionality. Most of the time it works just fine and due to my extensive experience I could quickly spot wrong approach, inefficiency, potential issues etc. After telling AI to fix whatever if any problems I've found I usually satisfied with the solution. So overall AI has become a very valuable tool for me, saves gobbles of time. How much exactly - do not really know / care.

Michael_Keller

3 days ago

As a consultant, I’m seeing this trend firsthand. A few thoughts:

Productivity boost: AI-generated code accelerates prototyping and routine tasks, letting developers focus on higher-value architecture and problem-solving.

Quality depends on oversight: The best results come when experienced engineers review, refine, and adapt AI output—it’s not “set and forget.”

Business impact: Faster development cycles mean reduced time-to-market, which is especially critical for startups and innovation-driven companies.

Skill evolution: Developers are shifting from “just coding” to becoming solution architects who guide AI tools effectively.

In short, AI is becoming a powerful co-pilot, but human expertise remains the differentiator.

calibas

4 days ago

I think they're being really loose with the term "vibe coding", and what they really mean is AI-assisted coding.

Older devs are not letting the AI do everything for them. Assuming they're like me, the planning is mostly done by a human, while the coding is largely done by the AI, but in small sections with the human giving specific instructions.

Then there's debugging, which I don't really trust the AI to do very well. Too many times I've seen it miss the real problem, then try to rewrite large sections of the code unnecessarily. I do most of the debugging myself, with some assistance from the AI.

9rx

4 days ago

> Assuming they're like me, the planning is mostly done by a human, while the coding is largely done by the AI

I've largely settled on the opposite. AI has become very good at planning what to do and explaining it in plain English, but its command of programming languages still leaves a lot to be desired.

calibas

4 days ago

It's good at checking plans, and helping with plans, but I've seen it make really really bad choices. I don't think it can replace a human architect.

bluefirebrand

4 days ago

It can't replace a human anything, yet, but that doesn't seem to be stopping anyone from trying unfortunately:(

9rx

4 days ago

Yes, much like many of the humans I have worked with, sometimes bad choices are introduced. But those bad choices are caught during the writing of the code, so that's not really that big of a deal when it does happen. It is still a boon to have it do most of the work.

And remains markably better than when AI makes bad choices while writing code. That is much harder to catch and requires pouring over the code with a fine tooth comb to the point that you may as well have just written it yourself, negating all the potential benefits of using it to generate code in the first place.

kmoser

4 days ago

They're also being really loose with the term "older developers" by describing it as anybody with more than ten years of experience.

WalterSear

4 days ago

When debugging, I'll coax the AI to determine what went wrong first - to my satisfaction - and have it go from there. Otherwise it's a descent into madness.

egorfine

3 days ago

As a developer with over 30 years of experience, I face a dilemma when asked "how much AI code do you commit"?

a) Answer "a lot". This answer supports the notion that developers are a dying breed and soon everyone will be able to vibe code their own personal software. Which at this point is obviously false. This answer is detrimental to my job.

b) Answer "not much". Then this gets interpreted as "he's an old fart, he can't learn new things, we should be thinking of retiring him". Which is (hopefully) false. Which is - again - detrimental to my job.

devmor

4 days ago

I’m still yet to find a use-case for AI-generated code in my workflow.

Even when I am building tools that heavily utilize modern AI, I haven’t found it. Recently, I disabled the AI-powered code completion in my IDE because I found that the cognitive load required to evaluate the suggestions it provided was greater and more time consuming than just writing the code I was already going to write anyways.

I don’t know if this is an experience thing or not - I mainly work on a tech stack I have over a decade of experience in - but I just don’t see it.

Others have suggested generating tests with AI but I find that horrifying. Tests are the one thing you should be the most anal about accuracy on compared to anything else in your codebase.

pheelicks

3 days ago

This agrees with my experience on a project I’ve been working on this year, in particular related to porting the code. I’ve developed a strategy that I’m calling “Polyglot Mirroring” where the code is written in multiple languages at once, with LLMs handling the mirroring.

I actually made a Ask HN about it just today https://news.ycombinator.com/item?id=45091607 but for some reason the HN algorithm never even showed it on the Ask page :/

willhslade

3 days ago

I know this is late to the party but I think, given that this topic comes up.every day, we should start including statistics in our posts to get a better sense of how AI is working or not for us. I imagine a years of experience / primary language used / tool used / AI positive or negative as a percentile estimate.

Personally for me it's 20 years of experience / Python / Copilot / 85% positive experience.

demarq

3 days ago

gdubs

4 days ago

One thing I don't hear a lot of people talk about is building prototypes. That's where I see a gigantic time savings. It doesn't have to be beautiful code, it just has to help me answer a question so I can make a decision about where to go next. That and tools. There have been many times where I've wanted to build a task-specific tool but justifying the time would be hard. Now I can create little tools like that, and it's a huge productivity boost.

ath3nd

3 days ago

Related: "MIT Study Finds Artificial Intelligence Use Reprograms the Brain, Leading to Cognitive Decline"

https://publichealthpolicyjournal.com/mit-study-finds-artifi...

Checks out

smrtinsert

4 days ago

It's 80% at least for me. I've hit a groove for sure. It's not only 80% is very well tested, and almost exactly how I would have preferred. Big tips are to design your CLAUDE.md files how you would actually code from a high level perspective, and not try the usual "You are an expert Google Distributed Engineer" and all that embarassing AI hype bro crap.

JustExAWS

3 days ago

Been in the industry professionally since 1996 writing code and before that 10 years as a hobbyist between a little BASIC, C and a lot of assembly (65C02, 68k, PPC and x86).

In the last 6 months, when I have had an assignment that involved coding, AI has generated 100% of my code. I just described the abstractions I wanted and reusable modules/classes I needed and built on it.

dep_b

3 days ago

The nice thing about having AI generated code is that you get more time to focus on designing the system and it's edge cases. My code coverage (the significant metric, not the vain metric) and documentation coverage is really high because it's a very strong productivity boost when coupled with LLM generated code.

mcv

4 days ago

I would certainly hope junior developers don't rely too much on AI; they need the opportunity to learn to do this stuff themselves.

kachapopopow

4 days ago

I think at this point it's whoever can get the most useful work out of AI which is actually really hard due to their 'incomplete' state. Finding uses which require very little user input is going to be the next big thing in my opinion since it seems that LLMs are currently at a wall where they require technical advancements before they can overcome it.

csbrooks

4 days ago

Is "vibe coding" synonymous with using AI code-generation tools now?

I thought vibe coding meant very little direct interaction with the code, mostly telling the LLM what you want and iterating using the LLM. Which is fun and worth trying, but probably not a valid professional tool.

crazygringo

4 days ago

I think what happened is that a lot of people started dismissing all LLM code creation as "vibe coding" because those people were anti-LLM, and so the term itself became an easy umbrella pejorative.

And then, more people saw these critics using "vibe coding" to refer to all LLM code creation, and naturally understood it to mean exactly that. Which means the recent articles we've seen about how good vibe coding starts with a requirements file, then tests that fail, then tests that pass, etc.

Like so many terms that started out being used pejoratively, vibe coding got reclaimed. And it just sounds cool.

Also because we don't really have any other good memorable term for describing code built entirely with LLM's from the ground up, separate from mere autocomplete AI or using LLM's to work on established codebases.

actsasbuffoon

4 days ago

“Agentic coding” is probably more accurate, though many people (fairly) find the term “Agentic” to be buzz-wordy and obnoxious.

I’m willing to vibe code a spike project. That is to say, I want to see how well some new tool or library works, so I’ll tell the LLM to build a proof of concept, and then I’ll study that and see how I feel about it. Then I throw it away and build the real version with more care and attention.

drooby

4 days ago

I have "vibe coded" a few internal tools now that are very low risk in terms of negative business impact but nonetheless valuable for our team's efficiency.

E.g one tool packages a debug build of an iOS simulator app with various metadata and uploads it to a specified location.

Another tool spits out my team's github velocity metrics.

These were relatively small scripting apps, that yes, I code reviewed and checked for security issues.

I don't see why this wouldn't be a valid professional tool? It's working well, saves me time, is fun, and safe (assuming proper code review, and LLM tool usage).

With these little scripts it creates it's actually pretty quick to validate their safety and efficacy. They're like validating NP problems.

actsasbuffoon

4 days ago

The original definition of vibe coding meant that you just let the agent write everything, and if it works then you commit it. Your code review and security check turned this from vibe coding into something else.

This is complicated by the fact that some people use “vibe coding” to mean any kind of LLM-assisted coding.

biglyburrito

4 days ago

My personal definition of "vibe coding" is when a developer delegates -- abdicates, really -- responsibility for understanding & testing what AI-generated code is doing and/or how that result is achieved. I consider it something that's separate from & inferior to using AI as a development tool.

ladyprestor

4 days ago

Yeah, for some reason the term has been used interchangeably for a while, which is making it very hard to have a conversation about it since many people think vibe coding is just using AI to assist you.

From Karpathy's original post I understood it to be what you're describing. It is getting confusing.

bonoboTP

I'm not a senior developer - I pay them.

LLMs are handy in the same way I still have my slide rules and calculators (OK kids I use a calc app) but I do still have my slide rules.

ChatGPT does quite well with the basics for a simple OpenSCAD effort but invents functions within libraries. That is to be expected - its a next token decider function and not a real AI.

I find it handy for basics, very basic.

platevoltage

4 days ago

I just got back into OpenSCAD after recently getting my first new 3D Printer in 10 years, so I basically had to relearn it. ChatGPT got the syntax wrong for the most basic of operations.

gerdesj

3 days ago

Chat did work for some operations for me but not many.

My screwdrivers, drills, planes, and all the rest don't tell me how to use them and I think that you need to treat LLMs in the same way.

They can provide an insight or a prod or whatever that gets you over the line but I do not think that they are ... AI of any sort.

I can cut say 12mm of plywood with a panel saw within an accuracy of about 1mm if I am careless. If I take more care, I can do way better. Depends on the job. However, I have to keep my eye in.

Chat will get the basics right but will hallucinate badly when it comes to libraries like https://github.com/BelfrySCAD/BOSL2/wiki

Just like a slightly blunt chisel, you have to treat Chat and co in the same way - they are tools. No more and no less. You need to know how to sharpen them up (oil and a whetstone).

v3xro

3 days ago

Still negative interest in being in the users of AI-generated group. Make it work, make it not steal other people's work and we'll talk then. :)

deterministic

4 days ago

90% of my code is auto generated using custom code generation tools. No AI needed.

jedisct1

3 days ago

invl

4 days ago

as a developer my first priority is whether the software works, not whether it is fast or easy to develop

dang

4 days ago

I think we can assume that what daft_pink means by "development" includes that the software works.

("Please respond to the strongest plausible interpretation of what someone says, not a weaker one that's easier to criticize." - https://news.ycombinator.com/newsguidelines.html)

globnomulous

4 days ago

> Anything that makes development faster or easier is going to be welcomed by a good developer.

I strongly disagree. Struggling with a problem creates expertise. Struggle is slow, and it's hard. Good developers welcome it.

jasonjmcghee

4 days ago

Indeed. This is my biggest fear for engineers as a whole. LLMs can be a great productivity boost in the very short term, but can so easily be abused. If you build a product with it, suddenly everyone is an engineering manager and no one is an expert on it. And growth as an engineer is stunted. It reminds me of abusing energy drinks or grinding to the point of burnout... But worse.

I think we'll find a middle ground though. I just think it hasn't happened yet. I'm cautiously optimistic.

blackqueeriroh

3 days ago

> Struggling with a problem creates expertise. Struggle is slow, and it's hard. Good developers welcome it.

There is significant evidence that shows mixed results for struggle-based learning - it’s highly individualized and has to be calibrated carefully: https://consensus.app/search/challenge-based-learning-outcom...

globnomulous

3 days ago

I don't particularly care what this, or any, LLM spits out, and given the pervasive problems that have bedeviled social-science research, I also don't care what the results show, let alone what they show on average.

Anybody who has developed software should understand the value of struggling with a difficult problem. I'm obviously not talking about classroom exercises where the problem sets are expected to match a given skill level or cultivate a specific skill set, so the very idea of individualized, calibrated learning is irrelevant.

As a teacher I'm also 100% uninterested in highly individualized, calibrated challenges for what I teach -- or for what I do professionally. The people who need those highly individualized, wildly different, more gently graduated increases in difficulty, for general problem solving or for the study of any area of programming or computer science, simply should not become engineers.

blackqueeriroh

19 hours ago

While I appreciate that you don’t particularly care what I’ve shared, or what any social science research says, instead seeking to fair your own subjective experience and generalizing it to others, and going so far as to say that as an educator you believe people who need modified learning methods or methodologies, or even additional help and assistance to learn concepts that others may find simple don’t deserve to be engineers, some of us prefer to actually engage with data and reality.

You can operate on your own opinion all you want, but here we value data and science and facts. You have provided nothing but your own fulminations and prejudices. Enjoy them while you can, I will warn students away from taking your classes because you, sir, at a minimum, believe that anyone with a learning disability shouldn’t be an engineer, which is the fundamental definition of ableism, and failing to give accommodation, as an educator, is a violation of the ADA.

grim_io

2 days ago

Nah, don't agree that struggle adds much.

2 days ago

Not mine.

I mean maybe if you include tests, which for some projects are almost a repeating pattern.

Cloudef

3 days ago

Survey: a third of surveys say that over half of them are bullshit

andy_ppp

3 days ago

This is nonsense, if they are getting this AI slop past code review that is a disgrace. When I take solutions from AI I need to rewrite almost every line for style and project alignment reasons. There is something uncanny valley about AI code that just needs to be deleted or changed too, for example the comments it writes are usually terrible.

Maybe I can potentially believe at one point AI code was used to create a starting point for 50% of PRs…

DrewADesign

3 days ago

Many young developers’ careers are fucked for at least a decade. Their lifetime earnings will never recover from this.

percentcer

4 days ago

“I don't know half of you half as well as I should like; and I like less than half of you half as well as you deserve.”

dfxm12

4 days ago

around a third of senior developers with more than a decade of experience are using AI code-generation tools such as Copilot, Claude, and Gemini to produce over half of their finished software, compared to 13 percent for those devs who've only been on the job for up to two years.

A third? I would expect at least a majority based on the headline and tone of the article... Isn't this saying 66% are down on vibe coding?

user

4 days ago

[deleted]

babhinav

3 days ago

[dead]