Model Quantization, ONNX Runtime, Embedded Inference, TinyML

Hypernetworks: Neural Networks for Hierarchical Data
sturdystatistics.com·4d
🤝Federated Learning
Ghosts in the Code: A Memorial Grove for Deleted AI
connectingminds.uk·1d·
Discuss: DEV, Hacker News
🧠AI
VLLM Predicted Outputs
cascadetech.ai·18h·
Discuss: Hacker News
💻Local LLMs
Everyday AI Agents
oreilly.com·1d
🤖AI agents
Show HN: I built a local AI agent desk toy
blog.simone.computer·2d·
Discuss: Hacker News
🎙️Whisper
Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
arxiv.org·1d
🤖AI agents
AI-Driven Ethical Risk Assessment & Mitigation in Supply Chain Compliance
dev.to·11h·
Discuss: DEV
🧠AI
Harmonizing AI Voices: Bridging the Gap in Intelligent Communication
dev.to·3d·
Discuss: DEV
🎤Voice Interfaces
To Sink or Not to Sink: Visual Information Pathways in Large Vision-Language Models
arxiv.org·1d
🏗️AI Infrastructure
Tech With Tim: Cancel Your AI subscriptions | This All-in-one AI is All You Need (ChatLLM Review)
dev.to·49m·
Discuss: DEV
🧠AI
When AI Learns to Think
dev.to·2d·
Discuss: DEV
🏗️AI Infrastructure
Tech With Tim: How to Build AI Agents in Python
dev.to·6h·
Discuss: DEV
🤖AI agents
Bridging Reasoning to Learning: Unmasking Illusions using Complexity Out of Distribution Generalization
arxiv.org·2d
🏗️AI Infrastructure
Optimal Stopping in Latent Diffusion Models
arxiv.org·1d
💻Local LLMs
Beyond Transformers: Can MLPs Unlock the Potential of In-Context Learning?
dev.to·3d·
Discuss: DEV
🏗️AI Infrastructure
Krish Naik: Agentic AI 3.0 - Live Ultimate RAG Bootcamp Course Announcement
dev.to·6h·
Discuss: DEV
🤖AI agents
94% of Developers Waste Tokens on Reasoning LLMs. Here's Why.
dev.to·1d·
Discuss: DEV
🤖AI agents
Small Language Models for Agentic Systems: A Survey of Architectures, Capabilities, and Deployment Trade offs
arxiv.org·4d
🤖AI agents
TRIM: Token-wise Attention-Derived Saliency for Data-Efficient Instruction Tuning
arxiv.org·2d
💻Local LLMs
OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
arxiv.org·1d
💻Local LLMs