oc1
9 hours ago
That in my experience is never the bottleneck, at least not for professionals. Letting Claude code is the easy part of the job. Gathering the requirements, Drafting a good story for claude, guiding claude through the endless mistakes it usually commits, reviewing the output of claude, steering the ship, this is the bottleneck. Unfortunately, no AI can't steer the ship currently, not o3 pro, not gemini, not opus 4, not any of their fancy cli agent tools, no matter how clever the md instruction files and other gimmicks. And boy, i'd be the first one to cheer if AI could do this. But currently, it's useless without fulltime attention of a senior experienced human.
stavros
8 hours ago
Exactly. I've spent the weekend trying my hand at making an AI assistant SaaS (I can't believe this doesn't exist yet!), and the biggest lesson I learnt is that I need to pay attention to what Claude Code does. I need to be specific in my instructions, and I need to review the output, because it will sometimes not know how to do things and it will never say "I don't know how to do that", it'll just endlessly write more and more code to try to appease you.
I think I'm faster with Claude Code overall, but that's because it's a tradeoff between "it makes me much faster at the stuff I'm not good at" and "it slows me down on the stuff I am good at". Maybe with better prompting skills, I'll be faster overall, but I'm definitely glad I don't have to write all the boilerplate yet again.
_pdp_
7 hours ago
> I've spent the weekend trying my hand at making an AI assistant SaaS (I can't believe this doesn't exist yet!), and the biggest lesson I learnt is that I need to pay attention to what Claude Code does.
It is part of the learning curve discovering that making a non-deterministic system act deterministically based on some vague instructions is actually pretty difficult.
stavros
7 hours ago
I don't need it to act deterministically, I just need it to be correct.
audessuscest
7 hours ago
> I've spent the weekend trying my hand at making an AI assistant SaaS (I can't believe this doesn't exist yet!)
you mean another wrapper ?
stavros
7 hours ago
Do you know of any good ones?
audessuscest
6 hours ago
there's plainly, just run a search on perplexity or else. Most YC startup are AI wrapper in some sort...
stavros
4 hours ago
I haven't been able to find any AI personal assistants, please point me to any you may know.
amelius
9 hours ago
> And boy, i'd be the first one to cheer if AI could do this.
Yeah, well it would be the next major step towards human irrelevance.
Or at least, for developers.
IgorPartola
9 hours ago
I don’t see it that way. Even if Claude could give me code without hallucinating (in my experience it is a 30-35% success rate on giving me code that actually works and doesn’t use APIs it makes up as it goes), it cannot come up with real world problems to solve. For example, it isn’t going to notice that I need help managing my calendars and want an AI assistant that can read my calendar and email me my agenda for the day and the week, find scheduling conflicts, and suggest dates and times that align with social norms and my habits for get togethers with friends. It cannot notice that my car’s Bluetooth prioritized the last phone it was connected to and not my phone. It cannot notice that my 3D printer has a frame skew that needs to be corrected. It cannot notice that a set of solar panels could be optimized with a bunch of liners actuators and a cloud tracking camera. Those are meatspace problems that Claude cannot see. It might get more capable but it can’t design a product or a service.
stavros
5 hours ago
Coincidentally, I want this AI assistant as well. I built a proof of concept and it worked really well, so I'm building a more multi-user version so my friends can use it as well.
The really nice thing about it is that I gave it memory, so a lot of these behaviours are just things you teach it. For example, I only programmed it to be able to read my calendar and add events, and then told it "before you add an event, check for conflicts" and it learned to do that. I really like that sort of "programming by saying stuff" that LLMs enable.
I'm looking forward to seeing where this experiment goes, email me if you want access/want to discuss features. I don't know if I'll open it up to everyone, as LLMs are costly, but maybe I could do a "bring your own key" thing.
amelius
5 hours ago
Yes, but product designers can talk to the AI and they wouldn't need developers to implement their ideas.
whazor
6 hours ago
It helps when the agent automatically uses diagnostics, logs, and tests. If it resolves two or three problems by itself, that saves you the effort of doing so.
karaterobot
7 hours ago
This project is about getting around the extremely aggressive throttling Claude applies.