Multi-GPU Communication, Collective Operations, Distributed Training, AllReduce

Grafana Mimir 3.0 release: performance improvements, a new query engine, and more
grafana.com·1d
🏗️Build Optimization
Flag this post
A self-rewriting AI from KAUST revives Jürgen Schmidhuber’s vision of a Gödel Machine
the-decoder.com·11h
ONNX Runtime
Flag this post
Benchmarking Large Language Models and Privacy Protection
priv.gc.ca·22h
ONNX Runtime
Flag this post
EP187: Why is DeepSeek-OCR such a BIG DEAL?
blog.bytebytego.com·2d
🤖AI Coding Tools
Flag this post
AndesVL Technical Report: An Efficient Mobile-side Multimodal Large LanguageModel
paperium.net·2d·
Discuss: DEV
🏎️TensorRT
Flag this post
Building Yantra: A Visual Workflow Automation Engine
patali.dev·23h·
Discuss: Hacker News
🤖Automation
Flag this post
A behind-the-scenes look at Broadcom’s design labs
techbrew.com·6h
⏱️CUDA Events
Flag this post
Where to Buy or Rent GPUs for LLM Inference: The 2026 GPU Procurement Guide
bentoml.com·3d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Storage news ticker – 3 November 2025
blocksandfiles.com·9h
🤖AI Coding Tools
Flag this post
🛡️ Fortify - AI-Powered Security Analysis Platform
dev.to·8h·
Discuss: DEV
💡LSP
Flag this post
OpenAI and NVIDIA Team Up for Massive AI Infrastructure Deployment
dev.to·1d·
Discuss: DEV
🔍Nsight
Flag this post
Torchforge – a PyTorch native library for scalable RL post-training
pytorch.org·4d·
Discuss: Hacker News
📜TorchScript
Flag this post
Doo: A Simple, Fast Programming Language Built on Rust and LLVM
news.ycombinator.com·18h·
Discuss: Hacker News
🐕Ruff
Flag this post
Small Vs. Large Language Models
semiengineering.com·19h·
Discuss: Hacker News, r/LLM
Flash Attention
Flag this post
Fast, Scalable LDA in C++ with Stochastic Variational Inference
github.com·12h·
Discuss: r/cpp
🏎️TensorRT
Flag this post
AI-Driven Biomarker Discovery for Accelerated Orphan Drug Development
dev.to·1d·
Discuss: DEV
🔄ONNX
Flag this post
The best AI inference for your project. Blazing fast responses.
dev.to·3h·
Discuss: DEV
Flash Attention
Flag this post
I repurposed my old GPU for self-hosted AI and it changed my life
xda-developers.com·11h
🤖AI Coding Tools
Flag this post
Generation at the Speed of Thought: Speculative Decoding
bittere.substack.com·1d·
Discuss: Substack
Flash Attention
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.com·1d
🧠CPU Architecture
Flag this post