Introducing LlamaStash: a zero-overhead, terminal-native llama.cpp launcher (opens in new tab)
Originally published at deepu.tech. In my recent post about my fully offline AI-assisted Linux development machine, I dropped a small detail near the bottom. I run my local model with an alias. llamaServer I described it as "a small script. It lets me pick a GGUF model, context size, and reasoning mode. It remembers the last choice, so most of the time I just start it and get going." That script grew up. Today I'm releasing LlamaStash, the first public release of a fast, cross-platform, termi...
Read the original article