🐿️ Scour
Browse
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🚀 CUDA Kernels
GPU Programming, Memory Optimization, Parallel Computing, Performance Tuning
Hot
Past Hour
Today
This Week
This Month
Subscribed Feeds
All Feeds
Show HN: We Built a Serverless GPU Platform with Fast Cold Starts
dat1.co
·
4h
·
Discuss:
Hacker News
🔱
Triton
Custom CUDA kernels for small-batch ML on GTX 1650: Memory hierarchy optimization and vectorization techniques
reddit.com
·
5d
·
Discuss:
r/programming
🔱
Triton
ik_llama.cpp and Qwen 3 30B-A3B architecture.
reddit.com
·
9h
·
Discuss:
r/LocalLLaMA
🔱
Triton
Skimpy HBM Memory Opens Up The Way AI Inference Memory Godbox
nextplatform.com
·
1d
·
Discuss:
Hacker News
🔧
Hardware
A lightweight library for portable low-level GPU computation using WebGPU
github.com
·
5d
·
Discuss:
Hacker News
🔱
Triton
AMD Ryzen Threadripper 9980X and 9970X Review: Zen 5 Powers Gains
storagereview.com
·
19h
·
Discuss:
Hacker News
🔧
Hardware
Maybe the Fastest Disk Usage Program on macOS
healeycodes.com
·
17h
·
Discuss:
Hacker News
🦀
Rust
Kaizen (YC X25) Is Hiring Engineers to Build Browser Agents That Work
ycombinator.com
·
39m
·
Discuss:
Hacker News
📱
Edge AI
How Judoscale's Utilization-Based Autoscaling Works
judoscale.com
·
2h
·
Discuss:
Hacker News
⏱️
Real-time Systems
Sub-millisecond GPU Task Queue: Optimized CUDA Kernels for Small-Batch ML Inference on GTX 1650
reddit.com
·
5d
·
Discuss:
r/programming
🔱
Triton
Why build a domain-specific agent for front end tasks?
kombai.com
·
3h
·
Discuss:
Hacker News
🤖
llm
AMD's Ryzen AI MAX+ Processors Now Offer a Whopping 96 GB Memory for Consumer Graphics, Allowing Gigantic 128B-Parameter LLMs to Run Locally on PCs
wccftech.com
·
1d
·
Discuss:
r/LocalLLaMA
🔧
Hardware
Nvidia-backed startup invents Ethernet memory pool to help power AI — claims it can add up to 18TB of DDR5 capacity for large-scale inference workloads and redu...
tomshardware.com
·
1d
·
Discuss:
Hacker News
🔧
Hardware
MethodHandles And Bad Benchmarks
github.com
·
1d
·
Discuss:
r/programming
🐞
Debugging
LLGuidance: Making Structured Outputs Go Brrr
guidance-ai.github.io
·
5h
·
Discuss:
Hacker News
🤖
llm
How Long Before Superintelligence?
nickbostrom.com
·
23m
·
Discuss:
Hacker News
🔧
Hardware
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
arxiviq.substack.com
·
6h
·
Discuss:
Substack
🤖
llm
Poor’s Man Shaders
nullonerror.org
·
2d
·
Discuss:
Hacker News
,
Hacker News
🎨
Neural Rendering
Not everything needs GPT. Sometimes a simple equation will do
danielball.com
·
4h
·
Discuss:
Hacker News
📱
Edge AI
tcmalloc's Temeraire: A Hugepage-Aware Allocator
paulcavallaro.com
·
3d
·
Discuss:
Hacker News
,
r/compsci
,
r/programming
🦀
Rust
Loading...
Loading more...
Page 2 »