Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
💾 Cache Algorithms
LRU, Cache Coherence, Memory Hierarchy, Performance
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
121205
posts in
1.50
s
Distributed Hybrid
Parallelism
for Large Language Models:
Comparative
Study and System Design Guide
arxiv.org
·
1d
⚡
Parallel Parsing
AI
Inference
Needs A
Mix-And-Match
Memory Strategy
semiengineering.com
·
2h
🗺️
Region Inference
Predicting Future Utility: Global
Combinatorial
Optimization for Task-Agnostic KV Cache
Eviction
arxiv.org
·
2d
🔮
CPU Branch Prediction
RCBldEng
- a computationally efficient RC network modeling method and engine for
multizone
building simulation
sciencedirect.com
·
20h
🌪️
V8 TurboFan
UbiquitousLearning/mllm
: Fast Multimodal LLM on Mobile Devices
github.com
·
1h
🏗️
MLIR
Performance Tip of the Week #83:
Reducing
memory
indirections
abseil.io
·
4d
📦
Compact Data
Two Ways to Move
Tensors
Without Stopping: Inside
vLLM
's Async GPU Transfer Patterns
dev.to
·
13h
·
Discuss:
DEV
💾
Zero-Copy
Researchers propose a self-distillation fix for ‘
catastrophic
forgetting
’ in LLMs
infoworld.com
·
7m
🪜
Recursive Descent
[News] SK
hynix
Unveils AI Chip Architecture with
HBF
, Reportedly Boosts Performance per Watt by Up to 2.69×
trendforce.com
·
8h
·
Discuss:
r/hardware
📦
Compact Data
A Local Code Copilot for
Edits
: Why
sweep-next-edit-1.5B
Is Built for Speed
hackernoon.com
·
1d
⚡
Interpreter Optimization
AI agent
sandboxing
in 2026: how to choose between primitives,
runtimes
, and platforms
manveerc.substack.com
·
15h
·
Discuss:
Substack
🛡️
Capability VMs
Scheduling in a changing world:
Maximizing
throughput with
time-varying
capacity
research.google
·
23h
⏲️
Embedded GC
Show HN: Latent-k –
Persistent
dependency
map to reduce AI coding token usage
latentk.org
·
21h
·
Discuss:
Hacker News
🗺️
Region Inference
How Memory Technology Is
Powering
the Next Era of
Compute
semiwiki.com
·
16h
🧠
Memory Models
Uncached
buffered
IO [LWN.net]
lwn.net
·
1d
🔗
Weak References
OSTEP
Chapter
8
muratbuffalo.blogspot.com
·
1d
·
Discuss:
Blogger
📡
Erlang BEAM
[
Repost
] Integration
contemplation
danq.me
·
12h
💾
Persistent Heaps
Quick
Stack
Tiedown
artlu.bearblog.dev
·
15h
🚂
Cranelift IR
AFMTJ
Model For In-Memory Computing (University of
Arizona
)
semiengineering.com
·
1d
🧠
Memory Models
LLM Performance in
Astro
, React,
Tailwind
and Cloudflare
10xbench.ai
·
1d
·
Discuss:
Hacker News
⚡
Performance
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help