Multi-GPU Communication, Collective Operations, Distributed Training, AllReduce

What I learned building Python notebooks to run any AI model (LLM, Vision, Audio) — across CPU, GPU, and NPU
reddit.com·15h·
Discuss: r/programming
ONNX Runtime
Flag this post
Cline: The Fastest Growing AI Open Source Project on GitHub in 2025, Thanks to You
cline.ghost.io·7h
🤖AI Coding Tools
Flag this post
Practical Design Patterns for Agentic Systems
pub.towardsai.net·1d
🤖AI Coding Tools
Flag this post
Probabilistic Robustness for Free? Revisiting Training via a Benchmark
arxiv.org·4h
📊Gradient Accumulation
Flag this post
Self-Harmony: Learning to Harmonize Self-Supervision and Self-Play in Test-Time Reinforcement Learning
arxiv.org·4h
🏎️TensorRT
Flag this post
Loquetier: A Virtualized Multi-LoRA Framework for Unified LLM Fine-tuning and Serving
arxiv.org·4h
ONNX Runtime
Flag this post
From Uniform to Adaptive: General Skip-Block Mechanisms for Efficient PDE Neural Operators
arxiv.org·4h
🏎️TensorRT
Flag this post
Augmenting learning in neuro-embodied systems through neurobiological first principles
arxiv.org·4h
📊Gradient Accumulation
Flag this post
NocoBase 2.0: Meet Your AI Employees
dev.to·19h·
Discuss: DEV
ONNX Runtime
Flag this post
eBPF Tutorial by Example: Monitoring GPU Driver Activity with Kernel Tracepoints
dev.to·2h·
Discuss: DEV
⏱️CUDA Events
Flag this post
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
arxiv.org·4h
ONNX Runtime
Flag this post
Hybrid Quantum-Classical Optimization of the Resource Scheduling Problem
arxiv.org·4h
📈Occupancy Optimization
Flag this post
Deploy an LLM inference service on OpenShift AI
developers.redhat.com·1d
ONNX Runtime
Flag this post
Disciplined Biconvex Programming
arxiv.org·4h
📉Model Quantization
Flag this post
Observability Made Easy: How AI & OpenTelemetry Tame Tool Sprawl
dev.to·5h·
Discuss: DEV
🐕Ruff
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.org·4h
🏎️TensorRT
Flag this post
🛡️ Fortify - AI-Powered Security Analysis Platform
dev.to·15h·
Discuss: DEV
💡LSP
Flag this post
Small Vs. Large Language Models
semiengineering.com·1d·
Discuss: Hacker News, r/LLM
Flash Attention
Flag this post
PDE-SHARP: PDE Solver Hybrids Through Analysis & Refinement Passes
arxiv.org·4h
✂️CUTLASS
Flag this post