Model Deployment, Cross-framework, Inference Engine, Optimization

Building “AI Disaster Response Platform” with Google Cloud Run and Gemini
ai-risk-dashboard-192565971483.asia-south1.run.app·1d·
Discuss: DEV
🤖AI Coding Tools
Flag this post
The Threats of Agentic AI Data Trails
blogger.com·11h
🤖AI Coding Tools
Flag this post
Unlocking LLMs: The Self-Steering Revolution
dev.to·14h·
Discuss: DEV
💡LSP
Flag this post
Deep Integration and the Convergence of Model Architecture and Hardware in AI
dev.to·9h·
Discuss: DEV
🎯Tensor Cores
Flag this post
Live Conversational Threads: Not an AI Notetaker
lesswrong.com·1h
🔄ONNX
Flag this post
pDANSE: Particle-based Data-driven Nonlinear State Estimation from Nonlinear Measurements
arxiv.org·52m
🔄ONNX
Flag this post
AI Inference: The Silent Budget Killer (and How to Stop It)
dev.to·1d·
Discuss: DEV
🎓Model Distillation
Flag this post
From Lossy to Lossless Reasoning
manidoraisamy.com·2d·
Discuss: Hacker News
🤖AI Coding Tools
Flag this post
Cross-Platform Evaluation of Reasoning Capabilities in Foundation Models
arxiv.org·3d
🔄ONNX
Flag this post
Everything You Need to Know About AI — In One Repository
dev.to·17h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
InertialAR: Autoregressive 3D Molecule Generation with Inertial Frames
arxiv.org·52m
🏎️TensorRT
Flag this post
Semantic search with embeddings in JavaScript: a hands-on example using LangChain and Ollama
dev.to·12h·
Discuss: DEV
🔄ONNX
Flag this post
NeuraSnip A Local Semantic Image Search Engine
github.com·14h·
Discuss: r/opensource
🔍Nsight
Flag this post
MCP standard
dev.to·13h·
Discuss: DEV
🚀MLOps
Flag this post
I made NotebookLM my personal assistant by pairing it with an agentic AI browser
xda-developers.com·14h
🤖AI Coding Tools
Flag this post
REMI: PostgreSQL as Agentic Core in Tiger Cloud (Agentic Postgres Challenge by Auth0)
dev.to·11h·
Discuss: DEV
🔄ONNX
Flag this post
Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
arxiv.org·52m
🧮cuDNN
Flag this post
Show HN: AI agents running on 2011 Raspberry Pi with pure PHP – no GPU
github.com·13h·
Discuss: Hacker News
🔄ONNX
Flag this post
Revisiting Model Interpolation for Efficient Reasoning
dev.to·2h·
Discuss: DEV
🎓Model Distillation
Flag this post