Multi-GPU Communication, Collective Operations, Distributed Training, AllReduce

Ranking LLMs based on 180k French votes (French government's AI arena)
comparia.beta.gouv.fr·1h·
Discuss: Hacker News
🛠Ml-eng
Flag this post
Cline: The Fastest Growing AI Open Source Project on GitHub in 2025, Thanks to You
cline.ghost.io·12h
🤖AI Coding Tools
Flag this post
NocoBase 2.0: Meet Your AI Employees
dev.to·23h·
Discuss: DEV
ONNX Runtime
Flag this post
Augmenting learning in neuro-embodied systems through neurobiological first principles
arxiv.org·9h
📊Gradient Accumulation
Flag this post
Beyond Standard LLMs
magazine.sebastianraschka.com·1h
👁️Attention Optimization
Flag this post
eBPF Tutorial by Example: Monitoring GPU Driver Activity with Kernel Tracepoints
dev.to·7h·
Discuss: DEV
⏱️CUDA Events
Flag this post
Why stop at 1M tokens when you can have 10M?
news.ycombinator.com·2h·
Discuss: Hacker News
Flash Attention
Flag this post
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph
arxiv.org·9h
ONNX Runtime
Flag this post
Hybrid Quantum-Classical Optimization of the Resource Scheduling Problem
arxiv.org·9h
📈Occupancy Optimization
Flag this post
Deploy an LLM inference service on OpenShift AI
developers.redhat.com·1d
ONNX Runtime
Flag this post
🛡️ Fortify - AI-Powered Security Analysis Platform
dev.to·19h·
Discuss: DEV
💡LSP
Flag this post
Disciplined Biconvex Programming
arxiv.org·9h
📉Model Quantization
Flag this post
Observability Made Easy: How AI & OpenTelemetry Tame Tool Sprawl
dev.to·9h·
Discuss: DEV
🐕Ruff
Flag this post
OpenAI and NVIDIA Team Up for Massive AI Infrastructure Deployment
dev.to·1d·
Discuss: DEV
🔍Nsight
Flag this post
Building a Digital Twin Office with Unity, WebGL, and AI — The Workflow Simulator Project
dev.to·2h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
AndesVL Technical Report: An Efficient Mobile-side Multimodal Large LanguageModel
paperium.net·2d·
Discuss: DEV
🏎️TensorRT
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.org·9h
🏎️TensorRT
Flag this post
Small Vs. Large Language Models
semiengineering.com·1d·
Discuss: Hacker News, r/LLM
Flash Attention
Flag this post
PDE-SHARP: PDE Solver Hybrids Through Analysis & Refinement Passes
arxiv.org·9h
✂️CUTLASS
Flag this post