Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔄 SIMD Programming
AVX512, Vector Instructions, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
20519
posts in
449.4
ms
Show HN: C discrete event SIM w
stackful
coroutines runs 45x faster than
SimPy
github.com
·
1d
·
Discuss:
Hacker News
⚡
Glommio
Mitigating
Staleness
in Asynchronous Pipeline
Parallelism
via Basis Rotation
arxiv.org
·
20h
⚡
Vectorized Execution
WebGPU
Cameras
webgpufundamentals.org
·
17h
🚀
Astral
Building a 24-bit Arcade
CRT
Display
Adapter
, From Scratch
scd31.com
·
1d
·
Discuss:
Lobsters
,
Hacker News
🚀
Modal
Qwen3 Coder Next
80B
A3B
: what it takes to run it locally
hardware-corner.net
·
13h
🏗️
LLM Infrastructure
Sequential Attention: Making AI models
leaner
and faster without
sacrificing
accuracy
research.google
·
10h
🧠
LLM Inference
Taking on
Anthropic
's Public Performance Engineering Interview Challenge
matthewtejo.substack.com
·
23h
·
Discuss:
r/programming
🪄
Prompt Engineering
Optimized
LLM Inference
Engines
rishirajacharya.com
·
10h
🏗️
LLM Infrastructure
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
·
3h
·
Discuss:
Hacker News
🕯️
Candle
“
Parallelizing
MCMC
Across the Sequence Length”: This one is really cool.
statmodeling.stat.columbia.edu
·
1d
⚡
Vectorized Execution
Anthropic
's Performance Take-Home: A 65x Optimization (For
Dummies
)
ikot.blog
·
1d
·
Discuss:
Hacker News
🖥️
Hardware Architecture
Demystifying
ARM SME to Optimize General Matrix
Multiplications
news.ycombinator.com
·
3d
·
Discuss:
Hacker News
⚡
SIMD Optimization
The Launch of
RISC-V
Now! A New
Chapter
in Open Computing
semiwiki.com
·
9h
🏛️
Computer Architecture History
A
Tale
of AI and
Apples
arbinquiry.com
·
1h
·
Discuss:
Hacker News
📟
Terminals
Exploration of
Unary
Arithmetic-Based Matrix
Multiply
Units for Low Precision DL Accelerators
arxiv.org
·
1d
⚡
Hardware Acceleration
open-simh/simh
: The Open
SIMH
simulators
package
github.com
·
7h
🔄
Cache Coherence
Cracking
the rules of gene regulation with experimental
elegance
and AI
phys.org
·
9h
🔍
AI Interpretability
Converting data to
hexadecimal
outputs
quickly
lemire.me
·
2d
·
Discuss:
Hacker News
🗜️
Vector Compression
ASCII
Arts and Terminal User Interfaces •
apatki.dev
apatki.dev
·
7h
·
Discuss:
r/reactjs
💻
CLI UX
Why Move To
2nm
?
semiengineering.com
·
17h
🔬
Chip Fabrication
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help