Self-hosted AI
Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency
💻Local LLMs Content type: News Content type: Blogzhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability
💻Local LLMs Content type: CodeNo more posts from nmarshall's subscribed feeds.