Multi-GPU Communication, Collective Operations, Distributed Training, AllReduce

Get Ready for .NET Conf 2025!
devblogs.microsoft.comยท1h
๐Ÿ’กLSP
Flag this post
Inside Pinecone: Slab Architecture
pinecone.ioยท2hยท
Discuss: Hacker News
๐Ÿ”ฒLoop Tiling
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.comยท2dยท
Discuss: Substack
๐Ÿ“‰Model Quantization
Flag this post
A Thesis and Playbook for Edge AI
ondeviceguy.substack.comยท1dยท
Discuss: Substack
โšกONNX Runtime
Flag this post
Algorithmic Alchemy: Transmuting Dynamic Programming with Gradients by Arvind Sundararajan
dev.toยท2hยท
Discuss: DEV
๐Ÿ“ŠGradient Accumulation
Flag this post
How neuroscientists are using AI
thetransmitter.orgยท14h
โšกONNX Runtime
Flag this post
Probabilistic Robustness for Free? Revisiting Training via a Benchmark
arxiv.orgยท14h
๐Ÿ“ŠGradient Accumulation
Flag this post
Self-Harmony: Learning to Harmonize Self-Supervision and Self-Play in Test-Time Reinforcement Learning
arxiv.orgยท14h
๐ŸŽ๏ธTensorRT
Flag this post
A Decade of AI Platform at Pinterest
medium.comยท1h
๐Ÿš€MLOps
Flag this post
Loquetier: A Virtualized Multi-LoRA Framework for Unified LLM Fine-tuning and Serving
arxiv.orgยท14h
โšกONNX Runtime
Flag this post
From Uniform to Adaptive: General Skip-Block Mechanisms for Efficient PDE Neural Operators
arxiv.orgยท14h
๐ŸŽ๏ธTensorRT
Flag this post
Beyond Standard LLMs
magazine.sebastianraschka.comยท5hยท
Discuss: Hacker News, r/LLM
๐Ÿ‘๏ธAttention Optimization
Flag this post
NocoBase 2.0: Meet Your AI Employees
dev.toยท1dยท
Discuss: DEV
โšกONNX Runtime
Flag this post
Augmenting learning in neuro-embodied systems through neurobiological first principles
arxiv.orgยท14h
๐Ÿ“ŠGradient Accumulation
Flag this post
Help us benchmark Hephaestus on SWEBench-Verified! Watch AI agents solve real bugs + get credited in our report
reddit.comยท9hยท
Discuss: r/LocalLLaMA
๐Ÿค–AI Coding Tools
Flag this post
eBPF Tutorial by Example: Monitoring GPU Driver Activity with Kernel Tracepoints
dev.toยท11hยท
Discuss: DEV
โฑ๏ธCUDA Events
Flag this post
3 MCP servers you should be using (safely)
developers.redhat.comยท3h
๐Ÿš€MLOps
Flag this post
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
arxiv.orgยท14h
โšกONNX Runtime
Flag this post
Hybrid Quantum-Classical Optimization of the Resource Scheduling Problem
arxiv.orgยท14h
๐Ÿ“ˆOccupancy Optimization
Flag this post