llama.cpp now supports model management (downloading etc) via API (opens in new tab)
Overview Add these APIs: GET /models/sse: Real-time events POST /models: Download new model DELETE /models: Delete a model from cache Refer to server's docs for details Requirements I ha...
Read the original article