Model Quantization, ONNX Runtime, Embedded Inference, TinyML

Continuous Autoregressive Language Models
shaochenze.github.io·12h·
Discuss: Hacker News
💬Prompt Engineering
Flag this post
Unlocking the Brain of AI: How Neural Networks Are Changing Everything
dev.to·20m·
Discuss: DEV
👁️Computer Vision
Flag this post
Gen AI Grows Up: Building Production-Ready Agents on the JVM • Rod Johnson • GOTO 2025
youtube.com·2h
⚙️JIT Compilation
Flag this post
A Thesis and Playbook for Edge AI
ondeviceguy.substack.com·2d·
Discuss: Substack
💬Prompt Engineering
Flag this post
Detailed Technical Documentation on AI Implementation Logic (Taking Large Language Models as an Example )
nbtab.com·1d·
Discuss: DEV
💬Prompt Engineering
Flag this post
Automated Defect Prediction via Cross-Entropy Regularized Graph Neural Networks for Microservice Architectures
dev.to·1d·
Discuss: DEV
🎯Microservices
Flag this post
An introduction to program synthesis (Part II) - Automatically generating features for machine learning
mchav.github.io·4h·
Discuss: r/programming
🎭Program Synthesis
Flag this post
Ubuntu Blog: Edge Networking gets smarter: AI and 5G in action
ubuntu.com·8h
🌍Edge Computing
Flag this post
From logs to insights: The AI breakthrough redefining observability
venturebeat.com·10h
🔭Tracing
Flag this post
'No Free Lunch: Deconstruct Efficient Attention with MiniMax M2'
lmsys.org·1d
📊Profile-Guided Optimization
Flag this post
Beyond Scarcity: How LLM-Driven Synthetic Data Generation is Reshaping AI
pub.towardsai.net·10h
💬Prompt Engineering
Flag this post
Radar Trends to Watch: November 2025
oreilly.com·1d
🎭Program Synthesis
Flag this post
Unlocking AI Vision with the Wisdom of Cats: Building Generalizable Models
dev.to·6h·
Discuss: DEV
👁️Computer Vision
Flag this post
Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph
huggingface.co·11h·
Discuss: Hacker News
🎴TAO
Flag this post
Feature Stores 2.0: The Next Frontier of Scalable Data Engineering for AI
hackernoon.com·10h
🎨Design Systems
Flag this post
Enabling Trillion-Parameter Models on AWS EFA
research.perplexity.ai·15h·
Discuss: Hacker News
Hardware Acceleration
Flag this post
The Infrastructure of Modern Ranking Systems, Part 3: The MLOps Backbone - From Training to Deployment
shaped.ai·2d
🚀MLOps
Flag this post
Choosing the best AI coding agent for Bitrise
bitrise.io·17h·
Discuss: Hacker News
🎭Program Synthesis
Flag this post