Multi-GPU Communication, Collective Operations, Distributed Training, AllReduce

Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.com·1d
🧠CPU Architecture
Flag this post
I made a tensor runtime & inference framework in C (good for learning how inference works)
github.com·11h·
📜TorchScript
Flag this post
Small Vs. Large Language Models
semiengineering.com·4h
Flash Attention
Flag this post
Rethinking Networking for the AI/ML Era
lukew.com·2d
🌐Distributed Computing
Flag this post
Opportunistically Parallel Lambda Calculus
dl.acm.org·3d·
Discuss: Hacker News
💡LSP
Flag this post
Dynamic Resource Allocation in CXL-Enabled Heterogeneous Compute Clusters
dev.to·1d·
Discuss: DEV
🔍Nsight
Flag this post
Torchforge – a PyTorch native library for scalable RL post-training
pytorch.org·4d·
Discuss: Hacker News
📜TorchScript
Flag this post
EP187: Why is DeepSeek-OCR such a BIG DEAL?
blog.bytebytego.com·1d
🤖AI Coding Tools
Flag this post
Live Conversational Threads: Not an AI Notetaker
lesswrong.com·8h
🔄ONNX
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·18h·
Discuss: Hacker News
📜TorchScript
Flag this post
Building Yantra: A Visual Workflow Automation Engine
patali.dev·9h·
Discuss: Hacker News
🤖Automation
Flag this post
Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems
arxiv.org·7h
🔄ONNX
Flag this post
I'm currently solving a problem I have with Ollama and LM Studio.
reddit.com·2d·
Discuss: r/LocalLLaMA
⏱️CUDA Events
Flag this post
A Thesis and Playbook for Edge AI
ondeviceguy.substack.com·2h·
Discuss: Substack
ONNX Runtime
Flag this post
ParallelMind Engine: First AI System with Parallel Logical Reasoning (202+ problems/sec)
github.com·1d·
Discuss: r/programming
🤖AI Coding Tools
Flag this post
Physics informed machine learning based predictive control for intelligent operation of edge datacenters
sciencedirect.com·1d
🎓Model Distillation
Flag this post
Where to Buy or Rent GPUs for LLM Inference: The 2026 GPU Procurement Guide
bentoml.com·2d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Generation at the Speed of Thought: Speculative Decoding
bittere.substack.com·1d·
Discuss: Substack
Flash Attention
Flag this post
CEO Interview with Wilfred Gomes of Mueon Corporation
semiwiki.com·20h
Flash Attention
Flag this post
Cocoon from Telegram: A Decentralized AI Network That Pays GPU Owners in Crypto
decrypt.co·19h·
Discuss: Hacker News
🎯GPU Kernels
Flag this post