Model Deployment, Cross-framework, Inference Engine, Optimization

Automated Anomaly Detection & Root Cause Analysis in Complex System Simulations via Adaptive Bayesian Networks
dev.to·13h·
Discuss: DEV
🔄ONNX
Flag this post
TinyML is the most impressive piece of software you can run on any ESP32
xda-developers.com·2d
🔄ONNX
Flag this post
Fortytwo's decentralized AI has the answer to life, the universe, and everything
theregister.com·1h
🔄ONNX
Flag this post
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for LargeVision-and-Language Models
paperium.net·8h·
Discuss: DEV
🏎️TensorRT
Flag this post
Feature Infrastructure Engineering: A Comprehensive Guide
mlfrontiers.substack.com·17h·
Discuss: Substack
🔄ONNX
Flag this post
[Open Source] We deployed numerous agents in production and ended up building our own GenAI framework
reddit.com·1d·
Discuss: r/LocalLLaMA
🚀MLOps
Flag this post
Ubuntu Blog: Why we brought hardware-optimized GenAI inference to Ubuntu
ubuntu.com·3d
🔄ONNX
Flag this post
Cross-Platform Evaluation of Reasoning Capabilities in Foundation Models
arxiv.org·2d
🔄ONNX
Flag this post
From Parrot to Partner - How Reinforcement Learning Taught LLMs to Talk Like Humans
dev.to·1h·
Discuss: DEV
🎓Model Distillation
Flag this post
Built with Go for pure performance, LucidArc - The AI-Powered Command Bar For Windows
lucidquery.com·10h·
Discuss: r/golang
🤖AI Coding Tools
Flag this post
Context Engineering: The Foundation for Reliable AI Agents
thenewstack.io·1d
🤖AI Coding Tools
Flag this post
Hybrid Neuro-Symbolic Reasoning for Adaptive Robotics Control in Dynamic Environments
dev.to·2h·
Discuss: DEV
📊Gradient Accumulation
Flag this post
Physics informed machine learning based predictive control for intelligent operation of edge datacenters
sciencedirect.com·13h
🎓Model Distillation
Flag this post
How to Build Your First MCP Server using FastMCP
hackernoon.com·2d
🚀MLOps
Flag this post
A Beginner’s Guide to Getting Started with add_messages Reducer in LangGraph
langcasts.com·2d·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Speedrunning an RL Environment
sidb.in·23h·
Discuss: Hacker News
📜TorchScript
Flag this post
Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.com·11h·
Discuss: Substack
🧩Attention Kernels
Flag this post
From hours to seconds: AI tools to detect animal calls
seangoedecke.com·3h·
Discuss: Hacker News
📉Model Quantization
Flag this post
A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining
paperium.net·12h·
Discuss: DEV
🔄ONNX
Flag this post