Feeds to Scour
SubscribedAll
Scoured 72710 posts in 554.2 ms
CUDA Programming: From Zero to GPU Kernels
pythongiant.github.io·21h·
Discuss: Hacker News
🎮WebGPU
Preview
Report Post
FlipFlop: A Static Analysis-based Energy Optimization Framework for GPU Kernels
arxiv.org·1d
🏛️Region-Based Memory
Preview
Report Post
meta-pytorch/segment-anything-fast: A batched offline inference oriented version of segment-anything
github.com·5m
🔥PyTorch
Preview
Report Post
Pushing the Packed SIMD Extension Over the Line: An Update on the Progress of Key RISC-V Extension
semiwiki.com·1d
📏Picolibc
Preview
Report Post
FlashAttention 4: Faster, Memory-Efficient Attention for LLMs
digitalocean.com·21h
🔄Hardware Transactional Memory
Preview
Report Post
**Abstract:** This research proposes a novel approach to dynamic resource allocation within CUDA Streaming Multiprocessors (SMs) to enhance performance and e...
freederia.com·2d
🧩mimalloc
Preview
Report Post
Why WebGPU Feels Like the Future of the Web (Live Demo 🚀)
dev.to·20h·
Discuss: DEV
🎮WebGPU
Preview
Report Post
Generalized Statistics on Lattices
link.aps.org·1d
📊Data Science
Preview
Report Post
ANN v3: 200ms p99 query latency over 100 billion vectors
turbopuffer.com·1d·
Discuss: Hacker News
🌊Memory Bandwidth
Preview
Report Post
Hardware-Aware Reformulation of Convolutions for Efficient Execution on Specialized AI Hardware: A Case Study on NVIDIA Tensor Cores
arxiv.org·1d
🔬Deep Learning
Preview
Report Post
The Story on ISPC (Intel SPMD Program Compiler)
pharr.org·1d·
Discuss: Hacker News
🚀Intel ISPC
Preview
Report Post
Streamlining CUB with a Single-Call API
developer.nvidia.com·11h
🧩mimalloc
Preview
Report Post
Raylib OpenDE physics and ragdolls
bedroomcoders.co.uk·11h·
💡Photon
Preview
Report Post
AI Systems Performance Engineering
github.com·8h·
Discuss: Hacker News
🧩mimalloc
Preview
Report Post
I Made Zig Compute 33 Million Satellite Positions in 3 Seconds. No GPU Required.
atempleton.dev·1d·
Discuss: Hacker News
🛣️Highway
Preview
Report Post
Optimizing GPU Programs from Java using Babylon and HAT
openjdk.org·2d·
🎮WebGPU
Preview
Report Post
NVIDIA's new N1X and N1 gaming laptop chips rumored for debut soon, will fight x86 processors
tweaktown.com·7h
Hardware Acceleration
Preview
Report Post
Co-optimization Approaches For Reliable and Efficient AI Acceleration (Peking University et al.)
semiengineering.com·15h
Hardware Acceleration
Preview
Report Post
Production-Ready ML Projects: Why Structure Matters More Than Your Model
pub.towardsai.net
·4h
🚀MLOps
Preview
Report Post
FLUX.2 Klein Guide: Run 9B Models on 12GB VRAM
dev.to·2h·
Discuss: DEV
🦀Rayon
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help