🔓 Open-source Models - zongyuzhang · Scour

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

everylocalai.com··DEV

alexziskind1/model-shelf: Model Shelf is a local-first model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.

⚡Quantization Code

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

💹AI in Finance Academic

Government aims to make UK top spot for open source AI

💡AI Reasoning News

computerweekly.com

·

Qwen 3.6 27B AutoRound GGUF, need your feedback

⚡Quantization

huggingface.co··r/LocalLLaMA

You don't need Copilot for code completion, try this instead

🕵️AI Agents

mistral.ai··r/GithubCopilot

DiffusionGemma

simonwillison.net·

LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem

🦾Robotics News

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

🧠LLMs Blog

blogs.nvidia.com·

Qwen3.7-Plus is Alibaba's bid to turn multimodal AI into a full-blown autonomous agent

🕵️AI Agents

the-decoder.com

·

"North Mini Code"; open weights, 30B param, Canadian coding model

🔧Tool Use Blog

cohere.com··Hacker News, Hacker News

Google’s DiffusionGemma is 4x faster than its other Gemma models

🖥️Inference Compute

thenewstack.io·

Malicious Hugging Face Models Could Trigger Remote Code Execution

⚡Quantization

techrepublic.com·

What's in the Box? A Field Guide to AI Models

🧠LLMs Blog

iankduncan.com·

The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has

🕵️AI Agents

xda-developers.com·

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

🧠LLMs News Blog

blog.google··Hacker News

Previewing nAnalyst, the layer that finally explains your network

Tell HN: Anthropic's Fable model is too expensive

🧠LLMs Discussion

news.ycombinator.com··Hacker News

I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why

🧠LLMs News Tutorial

Improved performance and model support with GGUF

🧠LLMs Blog

Log in to enable infinite scrolling