Open Source LLMs

Feeds to Scour
SubscribedAll
Scoured 694 posts in 8.7 ms

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

 🧠LLMs  Content type: Blog
adambien.blog·

Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM

 🛠️AI Tooling
everylocalai.com··DEV

BeeLlama.cpp DFlash on Strix Halo: 2.7x Gemma 31B, But MTP Is Still Faster

 🐋DeepSeek
sleepingrobots.com·

Integrate on-device AI models into your app using Core AI - WWDC26 - Videos

 💡AI

Google Gemma 4 12B brings native multimodal AI to standard laptops

 🧠LLMs
4sysops.com·

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

 🧠LLMs
androidauthority.com·

DiffusionGemma: The Developer Guide

 💡AI  Content type: Blog
developers.googleblog.com·

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 💡AI  Content type: Code
github.com··Hacker News

"AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY

 🛠️AI Tooling  Content type: News  Content type: Blog

Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB

 🛠️AI Tooling  Content type: Blog
ziraph.com··Hacker News

Aspen: Own your intelligence

 🐋DeepSeek  Content type: Discussion  Content type: Tutorial
runonaspen.com··Hacker News

Gemma Collins’ mum rushed to hospital as I’m A Celeb star says she’s ‘so worried she can’t sleep’

 🛡️AI Safety  Content type: News
thesun.co.uk·

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

 🧠LLMs

DiffusionGemma: 4x Faster Text Generation

 💡AI  Content type: News  Content type: Blog

Fixing a stuck Ollama runner and building a GPU watchdog

 🛠️AI Tooling

Google DeepMind releases Gemma 4 QAT, but Unsloth developer Daniel Han warns naive llama.cpp conversions suffer accuracy loss

 🧠LLMs  Content type: News
digg.com·

Tales of an Ollama Honeypot (Part 3): More Traffic, More Findings

 🛠️AI Tooling
posts.inthecyber.com·

The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring

 🧠LLMs  Content type: Academic
arxiv.org·

Google Gemma4 12B released

 💡AI  Content type: Blog
medium.com·

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

 🧠LLMs  Content type: News  Content type: Blog
developer.nvidia.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help