Model Quantization, ONNX Runtime, Embedded Inference, TinyML

Custom AI models in hours not months with auto Data Synth and LLM-as-a-Judge
blog.oumi.ai·23h·
Discuss: Hacker News
🏗️AI Infrastructure
Neural Networks from Scratch in Python: Simpler Than You Think
hamza.se·3h·
Discuss: Hacker News
🧠Neuromorphic Hardware
Scalable Semantic Map Generation via Hierarchical Graph Optimization
dev.to·3h·
Discuss: DEV
🏗️AI Infrastructure
InferenceMAX: Open-Source Inference Benchmarking
newsletter.semianalysis.com·1d·
Discuss: Hacker News
🏗️AI Infrastructure
Evaluating Gemini 2.5 Deep Think's math capabilities
epoch.ai·10h·
Discuss: Hacker News
🏗️AI Infrastructure
Neuro-Symbolic AI
en.wikipedia.org·9h·
Discuss: Hacker News
🧠Neuromorphic Chips
Self-Improving LLM Agents at Test-Time
arxiv.org·19h
🤖AI agents
GPT-5 for AI-assisted discovery
johndcook.com·8h·
Discuss: Hacker News
🏗️AI Infrastructure
SPAD: Specialized Prefill and Decode Hardware for Disaggregated LLM Inference
arxiv.org·19h·
Discuss: r/LLM
💻Local LLMs
In-Depth Analysis: "Attention Is All You Need"
dev.to·8h·
Discuss: DEV
🏗️AI Infrastructure
Unlocking Image Understanding: A New Path to Visual AI for Everyone
dev.to·1d·
Discuss: DEV
🏗️AI Infrastructure
Reflection raises $2B to be America’s open frontier AI lab, challenging DeepSeek
techcrunch.com·1d·
Discuss: Hacker News
🏗️AI Infrastructure
LLM-Based AI Agent That Automates The Transistor Sizing Process (Univ. of Edinburgh)
semiengineering.com·2h
🧠Neuromorphic Chips
A Manifesto for the Programming Desperado
github.com·7h·
Discuss: Hacker News
🧩Low-code
n8n raises $180M to get AI closer to value with orchestration
blog.n8n.io·1d·
Discuss: Hacker News
🧠AI
Three Solutions to Nondeterminism in AI
blog.hellas.ai·2d·
Discuss: Hacker News
💻Local LLMs
SliceMoE: Routing Embedding Slices Instead of Tokens for Fine-Grained and Balanced Transformer Scaling
arxiv.org·3d
🏠Self-hosted AI
Evaluating Small Vision-Language Models on Distance-Dependent Traffic Perception
arxiv.org·19h
👥Digital Twins
11+ Best All-in-One AI Platforms in 2025
dev.to·1d·
Discuss: DEV
🏗️AI Infrastructure