Feeds to Scour
SubscribedAll
Scoured 82840 posts in 1.65 s
WritePolicyBench: Benchmarking Memory Write Policies under Byte Budgets
arxiv.org·8h
📈Occupancy Optimization
Preview
Report Post
Stratum: Architecting a Configurable Cache Simulator with C++ and Racket
thecloudlet.github.io·2d·
Discuss: Hacker News
🧠CPU Architecture
Preview
Report Post
Mitigating Staleness in Asynchronous Pipeline Parallelism via Basis Rotation
arxiv.org·8h
🌊CUDA Streams
Preview
Report Post
The Heartbeat of Tetris 🟥🟥🟥🟥: What a 1x1 Pixel Taught Me About Concurrency
qianarthurwang.substack.com·20h·
Discuss: r/programming
CUDA Programming Patterns
Preview
Report Post
How Virtual Textures Really Work
shlom.dev·1h·
Discuss: Hacker News
📈GPU Occupancy
Preview
Report Post
Show HN: C discrete event SIM w stackful coroutines runs 45x faster than SimPy
github.com·21h·
Discuss: Hacker News
⏱️CUDA Events
Preview
Report Post
Millets: A practical memory-safety and thread-safety experiment
eagledot.xyz·1d·
⚙️Systems Programming
Preview
Report Post
Go Deep Dive: Mutex vs RWMutex
dev.to·1h·
Discuss: DEV
CUDA Programming Patterns
Preview
Report Post
ML for Energy-Performance-Aware Scheduling On Heterogeneous Multicore Architectures (Cambridge)
semiengineering.com·1d
📈Occupancy Optimization
Preview
Report Post
Semantic LLM Cache: Vector-Based Caching for Java (Spring Boot)
dev.to·5h·
Discuss: DEV
🏗️Build Optimization
Preview
Report Post
Diffusion LLM Sampling Achieves 70% Latency Reduction With Novel NPU Design
quantumzeitgeist.com·2d
🎯Tensor Cores
Preview
Report Post
slow abstraction
steel-water.bearblog.dev·6h
🐕Ruff
Preview
Report Post
Demystifying ARM SME to Optimize General Matrix Multiplications
news.ycombinator.com·2d·
Discuss: Hacker News
🔄SIMD Programming
Preview
Report Post
WebGPU Cameras
webgpufundamentals.org·5h
🎮NVIDIA
Preview
Report Post
Using Nsight Compute with large codebases - Part 2 : Profiling large code bases
blog.ncompass.tech·21h·
Discuss: Hacker News
🔍Nsight
Preview
Report Post
**Abstract:** This paper introduces a novel approach to stabilizing simulated spacetime geometries in high-performance computing environments by leveraging h...
freederia.com·1h
✂️CUTLASS
Preview
Report Post
Linear-time classical approximate optimization of cubic-lattice classical spin glasses
link.aps.org·1d
🔀Operator Fusion
Preview
Report Post
Intel attacks the workstation segment with Xeon 600 featuring up to 86 cores and a new platform
igorslab.de·8h
🧠CPU Architecture
Preview
Report Post
Claude Code's renderer is more complex than a game engine
spader.zone·1d·
Discuss: Hacker News
📈GPU Occupancy
Preview
Report Post
Anthropic's Performance Take-Home: A 65x Optimization (For Dummies)
ikot.blog·23h·
Discuss: Hacker News
🎛️CUDA Optimization
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help