Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🧩 Memory Interleaving
Specific
Bank Interleaving, Memory Controllers, DRAM Access, Parallelism
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
184918
posts in
25.2
ms
TLX: Hardware-Native,
Evolvable
MIMW
GPU Compiler for Large-scale Production Environments
⚡
Hardware Acceleration
arxiv.org
·
15h
The M:N
Concurrent
Model — A Complete Guide. From First Principles to Production
Schedulers
🧵
Lightweight Threads
0xkiire.com
·
3d
·
Hacker News
,
r/golang
,
r/rust
Why
gRPC
Is Fast: The Real Reason Is HTTP/2, Not Just
Protobuf
🔌
gRPC
javarevisited.substack.com
·
2d
·
r/programming
Knowledge gaps for
neuromorphic
ionic
computing
🧮
Intel MKL-DNN
science.org
·
5d
Regulating
Branch
Parallelism
in LLM Serving
🧵
OpenMP
arxiv.org
·
1d
FractalSortCPU
: Bandwidth-Efficient Compressed
Radix
Sort on CPU
📋
Columnar Storage
arxiv.org
·
15h
Data Path Fusion in GPU for
Analytical
Query
Processing
📊
Vectorized Query Execution
arxiv.org
·
15h
HexiSeq
:
Accommodating
Long Context Training of LLMs over Heterogeneous Hardware
🔄
Hardware Transactional Memory
arxiv.org
·
1d
Surviving
Partial Rank Failures in Wide Expert-Parallel
MoE
Inference
🧩
mimalloc
arxiv.org
·
15h
Stencil
Computations
on Cerebras Wafer-Scale Engine
🌀
Naiad
arxiv.org
·
1d
Towards Compute-Aware In-Switch Computing for LLMs
Tensor-Parallelism
on Multi-GPU Systems
🚀
Intel ISPC
arxiv.org
·
4d
·
Hacker News
A Controlled Study of Memory
Hierarchy
Transitions
in Quantum Circuit Simulation on Apple M4 Pro Unified Memory Architecture
⚛️
Quantum Computing
arxiv.org
·
15h
On Similarity of Computational
Kernels
in our Codes and
Proxies
🧮
Vector Databases
arxiv.org
·
1d
Unleashing Scalable Context
Parallelism
for Foundation Models Pre-Training via
FCP
🤖
TVM
arxiv.org
·
15h
Piper: Efficient Large-Scale MoE Training via Resource Modeling and
Pipelined
Hybrid
Parallelism
🧩
mimalloc
arxiv.org
·
5d
An Efficient Hybrid
Sparse
Attention with CPU-GPU
Parallelism
for Long-Context Inference
🔬
Deep Learning
arxiv.org
·
1d
EnergyLens
:
Interpretable
Closed-Form Energy Models for Multimodal LLM Inference Serving
🤖
TVM
arxiv.org
·
15h
TAD
: Temporal-Aware
Trajectory
Self-Distillation for Fast and Accurate Diffusion LLM
🤖
TVM
arxiv.org
·
15h
CCL-Bench
1.0: A
Trace-Based
Benchmark for LLM Infrastructure
🚀
Performance
arxiv.org
·
4d
·
Hacker News
Enhancing Performance Insight at Scale: A Heterogeneous Framework for
Exascale
Diagnostics
📊
Extrae
arxiv.org
·
6d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help