Quantization

Feeds to Scour
SubscribedAll
Scoured 60 posts in 6.1 ms

FADA: Accessible fetal ultrasound interpretation and annotation with a selectively distilled unified vision-language model

 👁️VLMs  Content type: Academic
arxiv.org·

The latest Gemma 4 models use a training trick to slash their on-device memory footprint

 🔓Open-source Models
androidauthority.com·

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

 🔓Open-source Models  Content type: Code
github.com··Hacker News

google/gemma-4-12B-it-qat-q4_0-gguf

 🔓Open-source Models
huggingface.co·

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

 🕵️AI Agents  Content type: Blog
adambien.blog·

stable-diffusion.cpp/docs/quantization_and_gguf.md at master · leejet/stable-diffusion.cpp

 🔓Open-source Models  Content type: Code

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

 🔓Open-source Models  Content type: Blog
huggingface.co·

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 🔓Open-source Models
smolhub.com··r/LocalLLaMA

Dew Drop - June 8, 2026 (#4685)

 🕵️AI Agents
alvinashcraft.com·

Benchmarking dots.tts on Strix Halo

 🖥️Inference Compute
sleepingrobots.com·

techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (Ollama + Llama 3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.

 🧠LLMs  Content type: Code
github.com·

Nvidia's RTX Spark is a developer's dream, but AMD's Ryzen AI Max+ is what most people actually need for local AI

 🔓Open-source Models
xda-developers.com·

iChristGit/comfyui-llamacpp-ideogram: ComfyUI Prompt enhancer for ideogram4 powered by llama cpp

 💡AI Reasoning  Content type: Code

Job Searcher

 🔓Open-source Models  Content type: Blog
huggingface.co·

JinXSuper/gwenland: GwenLand — AI toolkit. Local-first, <50MB, zero Python.

 🧠LLMs  Content type: Code
github.com··DEV

lajjadred/comfyui-lrw-nodes: ComfyUI custom nodes for Riemannian geometry and Bayesian latent space manipulation

 🔓Open-source Models  Content type: Code

Week 1 of building Quantamind: Ditching Electron for Rust & Tauri 🦀

 🔓Open-source Models  Content type: Code
github.com··DEV

[AINews] not much happened today

 🕵️AI Agents  Content type: News
latent.space
·

Show HN: TuringLLM – a LLM-powered Universal Turing machine

 🧠LLMs  Content type: Code
github.com··Hacker News

sk8erboi17/DStudio: Run DeepSeek V4 fully local: chat, a coding agent, and a design studio in one private desktop app.

 🔓Open-source Models  Content type: Code
github.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help