Memory Architecture, Performance, CPU Topology, Cache Locality

Feeds to Scour
SubscribedAll
Scoured 72640 posts in 1.05 s
End-to-End Transformer Acceleration Through Processing-in-Memory Architectures
arxiv.org·4h
🧠PIM
Preview
Report Post
SHADOW: Simultaneous Multi-Threading Architecture with Asymmetric Threads
danglingpointers.substack.com·1d·
Discuss: Substack
🧵Lightweight Threads
Preview
Report Post
ANN v3: 200ms p99 query latency over 100 billion vectors
turbopuffer.com·1d·
Discuss: Hacker News
🌊Memory Bandwidth
Preview
Report Post
A Quest to Find the Fastest Search Stack
dev.to·19h·
Discuss: DEV
🗄️Database Internals
Preview
Report Post
One ISA, Infinite Use Cases: RISC-V and the Road to Workload-Specific Silicon
riscv.org·13h
RISC-V
Preview
Report Post
Scalable Adaptive Memory Compiler Optimization via Multi-Objective Evolutionary Algorithms
dev.to·1d·
Discuss: DEV
🧩mimalloc
Preview
Report Post
Streamlining CUB with a Single-Call API
developer.nvidia.com·12h
🧩mimalloc
Preview
Report Post
A Novel Side-channel Attack That Utilizes Memory Re-orderings (U. of Washington, Duke, UCSC et al.)
semiengineering.com·15h
🔄Hardware Transactional Memory
Preview
Report Post
Memory Addressing and Memory Mapped I/O | by Tom Herbert | Jan, 2026
medium.com·2d
🗂️mmap
Preview
Report Post
On the Limits of Learned Importance Scoring for KV Cache Compression
arxiv.org·4h
Deoptimization
Preview
Report Post
I replaced my ChatGPT subscription with a 12GB GPU and never looked back
xda-developers.com·13h
🧩mimalloc
Preview
Report Post
Arctic Wolf’s Liquid Clustering Architecture Tuned for Petabyte Scale
databricks.com·15h
🎚️Tiered Storage
Preview
Report Post
FlashAttention 4: Faster, Memory-Efficient Attention for LLMs
digitalocean.com·21h
🔄Hardware Transactional Memory
Preview
Report Post
Polimi chip speeds up computing and drastically reduces energy consumption
polimi.it·2h·
Discuss: Hacker News
🧠PIM
Preview
Report Post
Phase-space engineering and collective dynamics in memcomputing
link.aps.org·2h
🎴SIMD Shuffles
Preview
Report Post
**Abstract:** This research proposes a novel approach to dynamic resource allocation within CUDA Streaming Multiprocessors (SMs) to enhance performance and e...
freederia.com·2d
🧩mimalloc
Preview
Report Post
Why AI Needs GPUs and TPUs: The Hardware Behind LLMs
blog.bytebytego.com·2d
Hardware Acceleration
Preview
Report Post
Weird RAM issue
68kmla.org·1d
🔄Memory Disambiguation
Preview
Report Post
CUDA Programming: From Zero to GPU Kernels
pythongiant.github.io·22h·
Discuss: Hacker News
🎮SIMT Execution
Preview
Report Post
Every Mini PC & SFF Hardware Announced at CES 2026
williamlam.com·18h
Intel TSX
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help