Multi-GPU Communication, Collective Operations, Distributed Training, AllReduce

Ranking LLMs based on 180k French votes (French government's AI arena)
comparia.beta.gouv.fr·21h·
Discuss: Hacker News
🛠Ml-eng
Flag this post
Meet Denario — An AI Assistant for Every Step of the Scientific Process
simonsfoundation.org·17h
🤖AI Coding Tools
Flag this post
Building a better testing experience for Workflows, our durable execution engine for multi-step applications
blog.cloudflare.com·21h
🤖AI Coding Tools
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.com·2d·
Discuss: Substack
📉Model Quantization
Flag this post
How to evaluate and benchmark Large Language Models (LLMs)
together.ai·1d
⏱️Benchmarking
Flag this post
Efficient Curvature-aware Graph Network
arxiv.org·1d
🔄ONNX
Flag this post
Dynamic Foveation Allocation via Reinforcement Learning for Perceptual Quality Maximization in VR Rendering
dev.to·2h·
Discuss: DEV
🏎️TensorRT
Flag this post
Weekly AI Startup Funding: October 26 - November 1, 2025
hackernoon.com·13h
🤖AI Coding Tools
Flag this post
Neural Green's Functions
arxiv.org·6h
📊Gradient Accumulation
Flag this post
PDE-SHARP: PDE Solver Hybrids Through Analysis & Refinement Passes
arxiv.org·1d
✂️CUTLASS
Flag this post
Agentic World Modeling for 6G: Near-Real-Time Generative State-Space Reasoning
arxiv.org·6h
ONNX Runtime
Flag this post
AI Infrastructure as Code - Automating AI Model Deployment and Scaling in Cloud Environments
dev.to·1d·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Demo: Statistically Significant Results On Biases and Errors of LLMs Do Not Guarantee Generalizable Results
arxiv.org·6h
🚀MLOps
Flag this post
Radar Trends to Watch: November 2025
oreilly.com·23h
🤖AI Coding Tools
Flag this post
DecompSR: A dataset for decomposed analyses of compositional multihop spatial reasoning
arxiv.org·6h
🔄ONNX
Flag this post
I'm working on a project I've been dreaming about for months and it feels good
github.com·12h·
Discuss: r/webdev
🤖AI Coding Tools
Flag this post
Spiking Neural Networks: The Next Leap in AI Power Efficiency by Arvind Sundararajan
dev.to·10h·
Discuss: DEV
ONNX Runtime
Flag this post
FLoC: Facility Location-Based Efficient Visual Token Compression for Long Video Understanding
arxiv.org·1d
🧩Attention Kernels
Flag this post
GrowthHacker: Automated Off-Policy Evaluation Optimization Using Code-Modifying LLM Agents
arxiv.org·1d
🤖AI Coding Tools
Flag this post
Dual 5090 work station for SDXL
reddit.com·23h·
Discuss: r/LocalLLaMA
🔧PTX
Flag this post