Open-source Models

Feeds to Scour
SubscribedAll
Scoured 500 posts in 4.6 ms

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

 🧠LLMs
everylocalai.com··DEV

alexziskind1/model-shelf: Model Shelf is a local-first model resolver that helps AI agents and scripts find model weights on your own storage before downloading from Hugging Face. Point it at an internal SSD, NAS, external SSD, or Thunderbolt DAS, and it returns the best local path for GGUF, MLX, safetensors, Ollama, vLLM, and other local AI workflows.

 Quantization  Content type: Code
github.com·

How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions

 💹AI in Finance  Content type: Academic
arxiv.org·

Government aims to make UK top spot for open source AI

 💡AI Reasoning  Content type: News
computerweekly.com
·

Qwen 3.6 27B AutoRound GGUF, need your feedback

 Quantization

You don't need Copilot for code completion, try this instead

 🕵️AI Agents

DiffusionGemma

 🧠LLMs
simonwillison.net·

LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem

 🦾Robotics  Content type: News
hackster.io·

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

 🧠LLMs  Content type: Blog
blogs.nvidia.com·

Qwen3.7-Plus is Alibaba's bid to turn multimodal AI into a full-blown autonomous agent

 🕵️AI Agents
the-decoder.com
·

"North Mini Code"; open weights, 30B param, Canadian coding model

 🔧Tool Use  Content type: Blog

Google’s DiffusionGemma is 4x faster than its other Gemma models

 🖥️Inference Compute
thenewstack.io·

Malicious Hugging Face Models Could Trigger Remote Code Execution

 Quantization
techrepublic.com·

What's in the Box? A Field Guide to AI Models

 🧠LLMs  Content type: Blog
iankduncan.com·

The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has

 🕵️AI Agents
xda-developers.com·

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 🧠LLMs  Content type: News  Content type: Blog
blog.google··Hacker News

Previewing nAnalyst, the layer that finally explains your network

 👁️VLMs
ntop.org·

Tell HN: Anthropic's Fable model is too expensive

 🧠LLMs  Content type: Discussion

I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why

 🧠LLMs  Content type: News  Content type: Tutorial
zdnet.com·

Improved performance and model support with GGUF

 🧠LLMs  Content type: Blog
ollama.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help