🔗 NCCL - miterion · Scour

RL over Commodity Networks: Overcoming the Bandwidth Barrier with Lossless Sparse Deltas

arxiv.org·1d

🏎️TensorRT

Sponsored: How AI is redefining high-density computing in 2026

datacenterdynamics.com·7h

📊CUDA Graphs

Custom Kernels for All from Codex and Claude

huggingface.co·1d·

Discuss: Hacker News

🎯GPU Kernels

Breaking the Tractability Barrier: A Generic Low-Level Solver for NP-Hard Instances (N=63) on Commodity 64-Bit Silicon

zenodo.org·1d·

Discuss: Hacker News

🎯Tensor Cores

harishsg993010/tiny-NPU: opensource NPU for LLM inference (this run gpt2)

github.com·1d·

Discuss: r/LocalLLaMA

⚡ONNX Runtime

BalatroBench Benchmarks Large Language Models Playing Balatro

balatrobench.com·1d·

Discuss: Hacker News

⚡ONNX Runtime

Parallel Track Transformers: Enabling Fast GPU Inference with Reduced Synchronization

machinelearning.apple.com·4d

⏱️CUDA Events

Arming the rebels with GPUs: Gradium, Kyutai, and Audio AI

amplifypartners.com·1d·

Discuss: Hacker News

🏎️TensorRT

Building a Production ML Inference Stack with KServe, vLLM, and Karmada

dev.to·1d·

Discuss: DEV

Show HN: Darius – An AI router that selects the best model for each prompt

withdarius.com·17h·

Discuss: Hacker News

🤖AI Coding Tools

CL API: Real-Time Closed-Loop Interactions with Biological Neural Networks

arxiv.org·1d·

Discuss: Hacker News

⚡ONNX Runtime

How low-bit inference enables efficient AI

dropbox.tech·4h·

Discuss: Hacker News

🎯Tensor Cores

Replace MCP With CLI , The Best AI Agent Interface Already Exists | by Cobus Greyling | Feb, 2026

cobusgreyling.medium.com·5h

Show HN: Kintsugi – A desktop app for reviewing Claude Code sessions

events.sonarsource.com·20h·

Discuss: Hacker News

🤖AI Coding Tools

🎲 Fine-Tuning an AI

zwischenzugs.com·1h

📜TorchScript

Powering AI Centers with AI Spines

blogs.arista.com·1d

🧩Attention Kernels

CUDA Shared Memory Bank Conflict-Free Vectorized Access

leimao.github.io·1d

🎛️CUDA Optimization

AI usage in popular open source projects

tirkarthi.github.io·7h·

Discuss: Hacker News, r/programming

🤖AI Coding Tools

Power of Agent assisted coding and learning to achieve goals faster and cheaper

osm2pgsql.org·1h·

Discuss: DEV

🤖AI Coding Tools

Show HN: PolyMCP – Orchestrate AI agents across Python tools and MCP servers

news.ycombinator.com·23h·

Discuss: Hacker News

Loading more...