skhameneh
2 hours ago
For tools... I find Aider to be the most reliable and least "invasive" (from following instructions and nothing more standpoint), but it's also the most hands-on. Cline is a nice step up in hand holding, but does require a significantly larger context size. Roo Code is a fork of Cline that's more focused on agentic functionality, but it also has some very poor default prompts and is much more finicky.
For running inference... Ollama is the easiest, but generally the slowest. Llama.cpp is a tad more involved, but faster. vLLM is even more involved, but even faster. Sglang can be difficult and sometimes challenging, but tends to be the fastest.