oc1
7 months ago
That in my experience is never the bottleneck, at least not for professionals. Letting Claude code is the easy part of the job. Gathering the requirements, Drafting a good story for claude, guiding claude through the endless mistakes it usually commits, reviewing the output of claude, steering the ship, this is the bottleneck. Unfortunately, no AI can't steer the ship currently, not o3 pro, not gemini, not opus 4, not any of their fancy cli agent tools, no matter how clever the md instruction files and other gimmicks. And boy, i'd be the first one to cheer if AI could do this. But currently, it's useless without fulltime attention of a senior experienced human.
suchuanyi
7 months ago
If you’re on a Pro account, it’s common to hit the usage limit in the middle of a long-running task. Claude Code will tell you you’re out of quota, and the reset time might be something like 3 AM.
If you’re asleep by then, you miss the chance to resume right when it resets. The script is just a workaround to automatically pick up where you left off as soon as the quota is restored.
stavros
7 months ago
Exactly. I've spent the weekend trying my hand at making an AI assistant SaaS (I can't believe this doesn't exist yet!), and the biggest lesson I learnt is that I need to pay attention to what Claude Code does. I need to be specific in my instructions, and I need to review the output, because it will sometimes not know how to do things and it will never say "I don't know how to do that", it'll just endlessly write more and more code to try to appease you.
I think I'm faster with Claude Code overall, but that's because it's a tradeoff between "it makes me much faster at the stuff I'm not good at" and "it slows me down on the stuff I am good at". Maybe with better prompting skills, I'll be faster overall, but I'm definitely glad I don't have to write all the boilerplate yet again.
_pdp_
7 months ago
> I've spent the weekend trying my hand at making an AI assistant SaaS (I can't believe this doesn't exist yet!), and the biggest lesson I learnt is that I need to pay attention to what Claude Code does.
It is part of the learning curve discovering that making a non-deterministic system act deterministically based on some vague instructions is actually pretty difficult.
stavros
7 months ago
I don't need it to act deterministically, I just need it to be correct.
taormina
7 months ago
I mean, you need to deterministically produce correctness.
_alternator_
7 months ago
Most of us would be happy with the “probably approximately correct” standard for Claude code. ;)
https://en.m.wikipedia.org/wiki/Probably_approximately_corre...
audessuscest
7 months ago
> I've spent the weekend trying my hand at making an AI assistant SaaS (I can't believe this doesn't exist yet!)
you mean another wrapper ?
stavros
7 months ago
Do you know of any good ones?
audessuscest
7 months ago
there's plainly, just run a search on perplexity or else. Most YC startup are AI wrapper in some sort...
stavros
7 months ago
I haven't been able to find any AI personal assistants, please point me to any you may know.
com2kid
7 months ago
The entire market is focused on productivity ones that all have the same feature set, no real originality in the offerings IMHO.
There are a lot of low hanging fruit that can be tackled. From helping people stay focused to managing household schedules. I honestly don't even think the product has to be SaSS, a 16GB VRAM GPU, or a well equipped MacBook, can do everything with locally.
zaphirplane
7 months ago
They keep asking for a specific recommendation and gets there are tons out there
lelanthran
7 months ago
> Exactly. I've spent the weekend trying my hand at making an AI assistant SaaS (I can't believe this doesn't exist yet!)
I'm not sure what this means; what exactly is an AI assistant SaaS? There are plenty of wrappers around LLMs that you can use, but I'm guessing that a wrapper around (for example) the ChatGPT or Claude API isn't what you had in mind, right?
Can you explain?
stavros
7 months ago
I mean an AI PA. Someone to manage my calendar, todos, personal documentation, emails, etc. Just something to help with general life admin.
amelius
7 months ago
> And boy, i'd be the first one to cheer if AI could do this.
Yeah, well it would be the next major step towards human irrelevance.
Or at least, for developers.
IgorPartola
7 months ago
I don’t see it that way. Even if Claude could give me code without hallucinating (in my experience it is a 30-35% success rate on giving me code that actually works and doesn’t use APIs it makes up as it goes), it cannot come up with real world problems to solve. For example, it isn’t going to notice that I need help managing my calendars and want an AI assistant that can read my calendar and email me my agenda for the day and the week, find scheduling conflicts, and suggest dates and times that align with social norms and my habits for get togethers with friends. It cannot notice that my car’s Bluetooth prioritized the last phone it was connected to and not my phone. It cannot notice that my 3D printer has a frame skew that needs to be corrected. It cannot notice that a set of solar panels could be optimized with a bunch of liners actuators and a cloud tracking camera. Those are meatspace problems that Claude cannot see. It might get more capable but it can’t design a product or a service.
stavros
7 months ago
Coincidentally, I want this AI assistant as well. I built a proof of concept and it worked really well, so I'm building a more multi-user version so my friends can use it as well.
The really nice thing about it is that I gave it memory, so a lot of these behaviours are just things you teach it. For example, I only programmed it to be able to read my calendar and add events, and then told it "before you add an event, check for conflicts" and it learned to do that. I really like that sort of "programming by saying stuff" that LLMs enable.
I'm looking forward to seeing where this experiment goes, email me if you want access/want to discuss features. I don't know if I'll open it up to everyone, as LLMs are costly, but maybe I could do a "bring your own key" thing.
mstkllah
7 months ago
Do you have a demo of it one can check?
stavros
7 months ago
I only started working on it on Saturday, so nothing very useful yet. It's at https://www.askhuxley.com/ and I want to focus on making something really useful instead of on monetization, so you'll have to bring your own API key, but I'd love it if you wanted to bounce some ideas and use cases off each other. I know what things are useful to me, but I don't know what's useful to other people, and I'd love to get new ideas.
Feel free to email me (email in profile) if you'd like to try it out. Right now it only does weather and Google Calendar, but adding new integrations is easy and the interesting thing is the fact that it can learn from you, and will behave like a PA would, while also being proactive and messaging you without you having to message it first.
I did make a prototype a while ago, which I integrated with a hardware device, and that was extremely useful, being able to do things by me teaching it. For example, it only had access to my calendar and its memories, but I told it (in chat) to check for and notify me of conflicts before adding an event, and told it the usual times of some events, so then I'd say "I'm playing D&D on Thursday" and it would reply with "you can't, you have an appointment at 8PM". This sounds simple for a human, but the LLM had to already know what time D&D was, that it's something I can't do during appointments, and that I wanted to be informed of conflicts, which are all things that I didn't have to program, but just instructed the LLM to do.
amelius
7 months ago
Yes, but product designers can talk to the AI and they wouldn't need developers to implement their ideas.
IgorPartola
7 months ago
For some things, yes. For a lot of things, not really. Think of it like this: if product designers could just treat human software developers as idea -> code translators then we wouldn’t have developers making quite so many crucial decisions. Just translate the spec to code. But in reality engineers end up making the bulk of the decisions and often drive the product’s direction because of what is possible. Engineers also think up base tech that enables new products. An AI cannot stand in front of a radar dish with a sandwich in its pocket to discover microwave ovens.
danjl
7 months ago
While true, this comment has nothing to do with hitting usage limits, which happens fairly frequently, often daily.
whazor
7 months ago
It helps when the agent automatically uses diagnostics, logs, and tests. If it resolves two or three problems by itself, that saves you the effort of doing so.
karaterobot
7 months ago
This project is about getting around the extremely aggressive throttling Claude applies.