Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ SIMD Optimization
AVX-512, Vectorization, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
27876
posts in
1.73
s
Benchmarking
Claude C
Compiler
dineshgdk.substack.com
·
20h
·
Discuss:
Substack
,
r/programming
🧮
Compute Optimization
DeltaKV
:
Residual-Based
KV Cache Compression via Long-Range Similarity
arxiv.org
·
22h
🔬
RaBitQ
Opus 4.6 Reasoning
Distill
3k
prompts
huggingface.co
·
19h
·
Discuss:
r/LocalLLaMA
🧮
SMT Solvers
Show HN:
FastLog
: 1.4 GB/s text file analyzer with
AVX2
SIMD
github.com
·
4d
·
Discuss:
Hacker News
⚡
Vectorized Execution
Parallel Track Transformers:
Enabling
Fast GPU Inference with Reduced
Synchronization
machinelearning.apple.com
·
1d
📦
Batch Embeddings
Vector search using only
Parquet
and
DataFusion
blog.xiangpeng.systems
·
6h
🐘
pgvector
Series-Parallel-Loop
Decompositions
of Control-flow Graphs
arxiv.org
·
22h
🧮
Compute Optimization
Beyond the
Hype
: Why Machine Learning is the Strategic
Backbone
of Modern AI
pub.towardsai.net
·
7h
🏆
LLM Benchmarking
Building a
Regex
Engine with a team of parallel
Claudes
lesswrong.com
·
3h
🔍
RegEx Engines
Rewrote
my Node.js data generator in Rust. 20x faster, but the 15MB binary (vs 500MB node_
modules
) is the real win.
algomimic.com
·
5h
·
Discuss:
r/rust
🏹
Apache Arrow
Interesting things about the
Lua
interpreter
thesephist.com
·
6h
🪄
Prompt Engineering
Testing a 6200 and
comparison
with 6100
68kmla.org
·
22h
🔮
Prefetching
Heterogeneous
Processing: A Strategy for
Augmenting
Moore's Law (2006)
linuxjournal.com
·
2d
·
Discuss:
Hacker News
🖥️
Hardware Architecture
Backtracking
Algorithms
algos.khourani.com
·
12h
🌸
Bloom Filters
Automating Inference Optimizations with NVIDIA
TensorRT
LLM
AutoDeploy
developer.nvidia.com
·
1d
🏗️
LLM Infrastructure
Your
VCL
App: 4x to 11x Faster Math Performance with
Elements
blogs.remobjects.com
·
1d
·
Discuss:
Hacker News
🏹
Apache Arrow
PC Services
Optimizer
5.0.1682
majorgeeks.com
·
1d
⚡
Systems Performance
How
Anam
Achieved 250% Faster Inference Using
Zymtrace
Continuous GPU Profiling
zymtrace.com
·
2d
⚡
Hardware Acceleration
Grouped
Blockscaled
Gemm
veitner.bearblog.dev
·
3d
⚡
Glommio
tmilovan/composite-machine
: Composite Machine: Automatic Calculus via Dimensional
Arithmetic
github.com
·
1d
·
Discuss:
Hacker News
🧮
SMT Solvers
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help