Model Quantization, ONNX Runtime, Embedded Inference, TinyML

Feeds to Scour
SubscribedAll
Scoured 5013 posts in 95.0 ms
GoMLX: Accelerating Machine Learning with Go, GPUs, and TPUs
dev.to·16h·
Discuss: DEV
🔥Burn
Preview
Report Post
Federation Over Embeddings: Let AI Agents Query Data Where It Lives
gnanaguru.com·4h·
Discuss: Hacker News
🏗️AI Infrastructure
Preview
Report Post
No, Small Models Are Not the "Budget Option" (English)
mostlylucid.net·4h
💻Local LLMs
Preview
Report Post
Yann LeCun’s VL-JEPA: The breakthrough that gives AI a "Mind's Eye" (instead of just a mouth).
hisohan.substack.com·11h·
Discuss: Substack
🧠Neuromorphic Hardware
Preview
Report Post
is this legit? Supposedly LangVAE straps a VAE + compression algorithm onto any LLM image, reduces resource requirements by up to...
arxiv.org·3d·
Discuss: r/LocalLLaMA
💻Local LLMs
Preview
Report Post
The 2025 Guide to Machine Learning
ibm.com·1d·
Discuss: Hacker News
🏗️AI Infrastructure
Preview
Report Post
The Transformer Architecture: A Deep Dive into How LLMs Actually Work
dev.to·9h·
Discuss: DEV
🤖Transformers
Preview
Report Post
Show HN: Chat-DeepAI – DeepSeek pricing and getting-started guides (fan project)
chat-deepai.com·15h·
Discuss: Hacker News
🏗️AI Infrastructure
Preview
Report Post
wwes4/AI_Accel_1.5x: AI acceleration framework for ~1.5x speedups in mid-sized models via tension-based pruning. Built utilizing xAI's Grok.
github.com·1d·
Discuss: Hacker News
🔥PyTorch
Preview
Report Post
Show HN: Why is ML inference still so ad-hoc in practice?
news.ycombinator.com·1d·
Discuss: Hacker News
🚀MLOps
Preview
Report Post
How IntelliNode Automates Complex Workflows with Vibe Agents
towardsdatascience.com·16h
🤖AI agents
Preview
Report Post
Introducing the XLab AI Security Guide
lesswrong.com·12h
🛡️Computer Security
Preview
Report Post
How to Build AI-Based Recommendation Systems in Mobile Apps (2026 Guide)
vibe.forem.com·1d·
Discuss: DEV
🧠AI
Preview
Report Post
A tiny AI supercomputer for your desk
youtube.com·1d·
Discuss: r/hardware
Hardware Acceleration
Preview
Report Post
Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision
arxiv.org·5d
🏗️AI Infrastructure
Preview
Report Post
What Deep Learning Theory Teaches Us About AI Memory
dev.to·1d·
Discuss: DEV
🧠Memory Models
Preview
Report Post
Book Review: Why Machines Learn
philippdubach.com·1d·
Discuss: Hacker News
🎯Vector Databases
Preview
Report Post
Your Team Uses AI. Why Aren't You 10x Faster?
bits.logic.inc·10h·
Discuss: Hacker News
🤖AI Coding Tools
Preview
Report Post
Thread by @theresanaiforit on Thread Reader App
threadreaderapp.com·6h
🤖Anthropic Claude
Preview
Report Post
🦉 From Broken Models to Living Systems: My Journey Building AI Without a GPU
dev.to·2d·
Discuss: DEV
🏗️AI Infrastructure
Preview
Report Post