Multi-GPU Communication, Collective Operations, Distributed Training, AllReduce

Get Ready for .NET Conf 2025!
devblogs.microsoft.com·3h
💡LSP
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.com·2d·
Discuss: Substack
📉Model Quantization
Flag this post
A Thesis and Playbook for Edge AI
ondeviceguy.substack.com·1d·
Discuss: Substack
ONNX Runtime
Flag this post
How neuroscientists are using AI
thetransmitter.org·16h
ONNX Runtime
Flag this post
From Pilot to Production with Custom Judges
databricks.com·1h
🤖AI Coding Tools
Flag this post
Beyond Standard LLMs
magazine.sebastianraschka.com·8h·
Discuss: Hacker News, r/LLM
👁️Attention Optimization
Flag this post
NocoBase 2.0: Meet Your AI Employees
dev.to·1d·
Discuss: DEV
ONNX Runtime
Flag this post
Supercharging the ML and AI Development Experience at Netflix
netflixtechblog.medium.com·1h·
Discuss: Hacker News
🚀MLOps
Flag this post
Help us benchmark Hephaestus on SWEBench-Verified! Watch AI agents solve real bugs + get credited in our report
reddit.com·12h·
Discuss: r/LocalLLaMA
🤖AI Coding Tools
Flag this post
eBPF Tutorial by Example: Monitoring GPU Driver Activity with Kernel Tracepoints
dev.to·14h·
Discuss: DEV
⏱️CUDA Events
Flag this post
Why Agentic AI Needs a Context-Based Approach
thenewstack.io·2h
🤖AI Coding Tools
Flag this post
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
arxiv.org·16h
ONNX Runtime
Flag this post
3 MCP servers you should be using (safely)
developers.redhat.com·6h
🚀MLOps
Flag this post
Hybrid Quantum-Classical Optimization of the Resource Scheduling Problem
arxiv.org·16h
📈Occupancy Optimization
Flag this post
Disciplined Biconvex Programming
arxiv.org·16h
📉Model Quantization
Flag this post
Building a Digital Twin Office with Unity, WebGL, and AI — The Workflow Simulator Project
dev.to·9h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.org·16h
🏎️TensorRT
Flag this post
Small Vs. Large Language Models
semiengineering.com·1d·
Discuss: Hacker News, r/LLM
Flash Attention
Flag this post
PDE-SHARP: PDE Solver Hybrids Through Analysis & Refinement Passes
arxiv.org·16h
✂️CUTLASS
Flag this post