🐿️ Scour
Browse
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🚀 CUDA Kernels
GPU Programming, Memory Optimization, Parallel Computing, Performance Tuning
Hot
Past Hour
Today
This Week
This Month
Subscribed Feeds
All Feeds
Cursor: 1.5x Faster Moe Training on Blackwell with MXFP8 Kernels
cursor.com
·
21h
·
Discuss:
Hacker News
🔱
Triton
How to Give Your RTX 4090 Nearly Infinite Memory for LLM Inference
medium.com
·
1d
·
Discuss:
Hacker News
🔧
Hardware
H100 vs GB200 NVL72 Training Benchmarks – Power, TCO, and Reliability Analysis, Software Improvement Over Time
semianalysis.com
·
14h
·
Discuss:
Hacker News
🔱
Triton
Show HN: Randomly switching between LMs at every step boosts SWE-bench score
swebench.com
·
4h
·
Discuss:
Hacker News
🔱
Triton
Vulkan: Continuing to Forge Ahead
khronos.org
·
1d
·
Discuss:
Hacker News
🔱
Triton
guide : running gpt-oss with llama.cpp
github.com
·
2d
·
Discuss:
r/LocalLLaMA
🔧
Hardware
ROG Matrix GeForce RTX 5090
rog.asus.com
·
1d
·
Discuss:
Hacker News
🔧
Hardware
Nvidia B200 vs. H100 performance compared with GPT-OSS
clarifai.com
·
2d
·
Discuss:
Hacker News
⚡
GPU
Compute Where It Counts: a trainable LLM sparsity enabling 4x CPU speed
crystalai.org
·
38m
·
Discuss:
Hacker News
👁️
Computer vision
flow-run: LLM Orchestration, Prompt Testing & Cost Monitoring
vitaliihonchar.com
·
1d
·
Discuss:
r/golang
,
r/programming
⏱️
Real-time Systems
Anno 1800 Frame Analysis
blog.thomaspoulet.fr
·
20h
·
Discuss:
Hacker News
🎨
Neural Rendering
Fuzzing Hardware Like Software (2021)
arxiv.org
·
23h
·
Discuss:
Hacker News
🔧
Hardware
Building a Distributed Filesystem for Scalable Research
hudsonrivertrading.com
·
5h
·
Discuss:
Hacker News
🔱
Triton
Pinned Device Memory Patches For Intel's Multi-GPU "Project Battlematrix" Linux Efforts
phoronix.com
·
1d
·
Discuss:
Hacker News
🔧
Hardware
MoE optimization idea (VRAM/RAM)
preview.redd.it
·
3d
·
Discuss:
r/LocalLLaMA
🔧
Hardware
Nvidia CUDA Quantum
github.com
·
2d
·
Discuss:
Hacker News
🔧
Hardware
Show HN: I built a toy TPU that can do inference and training on the XOR problem
tinytpu.com
·
1d
·
Discuss:
Hacker News
🔌
FPGAs
Building a Carbon and Price-Aware Kubernetes Scheduler [audio]
kube.fm
·
3h
·
Discuss:
Hacker News
🔱
Triton
The Drivers of HRM's Performance on Arc-AGI
arcprize.org
·
5h
·
Discuss:
Hacker News
🏋️
Isaac Gym
Escaping the Steamcar Era of AI
speakez.tech
·
2d
·
Discuss:
Hacker News
📱
Edge AI
Loading...
Loading more...
Page 2 »