Model Quantization, ONNX Runtime, Embedded Inference, TinyML

VLLM Predicted Outputs
cascadetech.ai·13h·
Discuss: Hacker News
💻Local LLMs
LLM Optimization Notes: Memory, Compute and Inference Techniques
gaurigupta19.github.io·4d·
Discuss: Hacker News
💻Local LLMs
Is Architectural Complexity Always the Answer? A Case Study on SwinIR vs. an Efficient CNN
arxiv.org·1d
🏗️AI Infrastructure
Enhanced SoC Design via Adaptive Topology Optimization with Reinforcement Learning
dev.to·1d·
Discuss: DEV
🧩RISC-V
LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?
arxiv.org·1d
🏗️AI Infrastructure
What to Look For in Image Annotation Services Today
dev.to·1d·
Discuss: DEV
🏗️AI Infrastructure
Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness
arxiv.org·2d
💻Local LLMs
ECLipsE-Gen-Local: Efficient Compositional Local Lipschitz Estimates for Deep Neural Networks
arxiv.org·3d
💻Local LLMs
Context-Aware Inference via Performance Forecasting in Decentralized Learning Networks
arxiv.org·2d
🤝Federated Learning
Decentralized Intelligence: Empowering Autonomous Systems with Localized Learning by Arvind Sundararajan
dev.to·1d·
Discuss: DEV
🤝Federated Learning
Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling
arxiv.org·3d
🏗️AI Infrastructure
Show HN: I built a local AI agent desk toy
blog.simone.computer·2d·
Discuss: Hacker News
🎙️Whisper
HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation
arxiv.org·1d
🏗️AI Infrastructure
OpenAI Is Catching Up To Anthropic in AI Coding - The Information
news.google.com·1d
🧠AI
Automated Anomaly Detection in Account Takeover via Multi-Modal Graph Neural Network Fusion
dev.to·6h·
Discuss: DEV
💻Local LLMs
Graph-based LLM over Semi-Structured Population Data for Dynamic Policy Response
arxiv.org·3d
🏗️AI Infrastructure
Thousands of AI Authors on the Future of AI
arxiv.org·1d
🏗️AI Infrastructure
Effective and Stealthy One-Shot Jailbreaks on Deployed Mobile Vision-Language Agents
arxiv.org·1d
🤖AI agents
AI-Driven Ethical Risk Assessment & Mitigation in Supply Chain Compliance
dev.to·6h·
Discuss: DEV
🧠AI