Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🔲 CPU Architecture
Specific
microarchitecture, instruction set, pipeline, RISC, x86
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
234
posts in
16.6
ms
GhostServe
: A Lightweight
Checkpointing
System in the Shadow for Fault-Tolerant LLM Serving
🔁
Cache Coherence
arxiv.org
·
1d
A Treasure
Trove
of Performance: Analyzing the
IO500
Submission Data
⚡
Performance
arxiv.org
·
1d
Exploring the Efficiency of
3D-Stacked
AI Chip Architecture for LLM Inference with
Voxel
⚡
Hardware Acceleration
arxiv.org
·
6d
PipeMax
: Enhancing Offline LLM Inference on
Commodity
GPU Servers
⚡
Vectorized Execution
arxiv.org
·
1d
Caliper-in-the-Loop
: Black-Box Optimization for
Hyperledger
Fabric Performance Tuning
⚡
Low-Latency Systems
arxiv.org
·
1d
Predictive
Multi-Tier Memory Management for
KV
Cache in Large-Scale GPU Inference
📦
CPU Caches
arxiv.org
·
5d
Stochastic
Sparse
Attention for Memory-Bound Inference
⚡
Vectorized Execution
arxiv.org
·
1d
FACT:
Compositional
Kernel
Synthesis
with a Three-Stage Agentic Workflow
🧮
Compute Optimization
arxiv.org
·
6d
Replication
in Graph
Partitioning
and Scheduling Problems
🌐
Distributed Systems
arxiv.org
·
2d
Lightweight
Tamper-Evident
Log Integrity Verification for IoT Edge Environments: A Merkle Tree Pipeline with Adaptive Chunking
🌸
Bloom Filters
arxiv.org
·
2d
Exploring Sparse Matrix
Multiplication
Kernels on the
Cerebras
CS-3
🧮
Compute Optimization
arxiv.org
·
5d
On the
Distortion
of
Partitioning
Performance by Random Quantum Circuits
🧠
NUMA
arxiv.org
·
1d
Efficient Training on Multiple Consumer
GPUs
with
RoundPipe
🖥️
GPU Computing
arxiv.org
·
5d
Efficient,
VRAM-Constrained
xLM
Inference on Clients
⚡
SIMD
arxiv.org
·
6d
Near-Optimal
Privacy-Preserving
Learning for Max-Min Fair Multi-Agent
Bandits
🤝
Consensus Algorithms
arxiv.org
·
1d
RaMP: Runtime-Aware
Megakernel
Polymorphism
for Mixture-of-Experts
♟️
Chess Engines
arxiv.org
·
6d
A Semantic Quantum
Circuit
Cache for Scalable and Distributed
Quantum-Classical
Workflows
🔁
Cache Coherence
arxiv.org
·
6d
DAK
: Direct-Access-Enabled GPU Memory
Offloading
with Optimal Efficiency for LLM Inference
⚡
DMA
arxiv.org
·
6d
MARS: Efficient, Adaptive
Co-Scheduling
for
Heterogeneous
Agentic Systems
🔄
Coroutines
arxiv.org
·
5d
A Study on the Performance of Distributed Training of Data-driven
CFD
Simulations
📐
Data-Oriented Design
arxiv.org
·
5d
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help