Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ Low-latency
SIMD, vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
121958
posts in
1.99
s
Scaling llama.cpp On
Neoverse
N2: Solving
Cross-NUMA
Performance Issues
semiengineering.com
·
14h
🚀
Performance
PRISM
: Parallel
Residual
Iterative Sequence Model
arxiv.org
·
17h
⚡
SIMD Optimization
datavorous/spheni
: An in-memory vector search library in C++ with Python bindings
github.com
·
1d
·
Discuss:
Hacker News
⚡
SIMD Optimization
Training-Free Real-Time Control for
Autoregressive
Video Generation
daydream.live
·
7h
·
Discuss:
Hacker News
⚡
SIMD Optimization
Block encoding of sparse
matrices
with a periodic
diagonal
structure
arxiv.org
·
17h
⚡
SIMD Optimization
A
RISC-V
vector
extension primer
blog.adafruit.com
·
6h
⚡
SIMD Optimization
Discussion - Investigation of Single Thread CPU "
Thoughput/cycle
"
forums.anandtech.com
·
23h
🖥️
CPU Microarchitecture
Supercharging
Inference for AI Factories: KV Cache
Offload
as a Memory-Hierarchy Problem
blog.min.io
·
7h
🏗️
System Design
Two Ways to Move
Tensors
Without Stopping: Inside
vLLM
's Async GPU Transfer Patterns
dev.to
·
1d
·
Discuss:
DEV
🚀
Performance
Show HN: We Made Nasdaq
Parsing
Even Faster (and More
Reliable
)
lunyn.com
·
2h
·
Discuss:
Hacker News
🚀
Performance
borodark/exmc
: Probabilistic programming in BEAM
github.com
·
1d
⚡
SIMD Optimization
TileIR
ianbarber.blog
·
18h
·
Discuss:
Hacker News
🚀
Performance
Zvec
: SQLite-like
simplicity
in an embedded vector database (By Alibaba)
zvec.org
·
9h
·
Discuss:
Hacker News
📊
Vector Database
Nvidia’s new
technique
cuts LLM reasoning costs by 8x without losing
accuracy
venturebeat.com
·
32m
🚀
Performance
EyesOff
: Why Some Models
Quantize
Better Than Others
ym2132.github.io
·
23h
·
Discuss:
Hacker News
⚡
SIMD Optimization
Minimum
Energy Per
Query
semiengineering.com
·
14h
⚙️
Systems Programming
Floating
bus
technical
guide
k1.spdns.de
·
7h
🐝
eBPF
Show HN: Latent-k –
Persistent
dependency
map to reduce AI coding token usage
latentk.org
·
1d
·
Discuss:
Hacker News
⚙️
Systems Programming
Porting an INT8 VHDL CNN from Intel
Agilex
3 to Lattice
Certus-NX
news.ycombinator.com
·
9h
·
Discuss:
Hacker News
🖥️
CPU Microarchitecture
Memgraph
3.8 is Out: Atomic
GraphRAG
+ Vector Single Store With Major Performance Upgrades
memgraph.com
·
4h
·
Discuss:
Hacker News
🚀
Performance
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help