Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ SIMD
Vectorization, SSE, AVX, Intrinsics, Parallel Processing
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81758
posts in
310.9
ms
Heterogeneous
Processing: A Strategy for
Augmenting
Moore's Law (2006)
linuxjournal.com
·
1d
·
Discuss:
Hacker News
⚡
performance optimization
Running my
kernel
on real
hardware
kamkow1lair.pl
·
1h
·
Discuss:
Hacker News
⚡
performance optimization
Same Engine, Multiple Gears: Parallelizing
Fixpoint
Iteration at Different
Granularities
(Extended Version)
arxiv.org
·
14h
⚡
performance optimization
The
Prospero
Challenge
mattkeeter.com
·
3h
⚡
performance optimization
Faster
AI Training
Unlocked
With New System For Massive Language Models
quantumzeitgeist.com
·
5h
⚡
performance optimization
Mapping
Gemma3
onto an Edge
Dataflow
Architecture
arxiv.org
·
14h
⚡
performance optimization
Your
VCL
App: 4x to 11x Faster Math Performance with
Elements
blogs.remobjects.com
·
4h
·
Discuss:
Hacker News
⚡
performance optimization
Hitting
1,000
tokens
per second on a single RTX 5090
blog.alpindale.net
·
20h
·
Discuss:
Hacker News
,
Hacker News
⚡
performance optimization
STM32F4
- ARM Cortex-M4 High-Performance
MCUs
st.com
·
44m
⚡
performance optimization
Main
Content ||
Math
∩ Programming
jeremykun.com
·
20h
⚡
performance optimization
CPUs
are Back: The
Datacenter
CPU Landscape in 2026
newsletter.semianalysis.com
·
56m
·
Discuss:
Hacker News
⚡
performance optimization
Concurrent
vs.
Parallel
Execution in LLM API Calls: From an AI Engineer’s Perspective
pub.towardsai.net
·
13h
⚡
performance optimization
Quantized
Tensor Train Compression For Turbulent Flow Simulation: O(log N) Scaling with
Reynolds-Independent
Bond Dimension
zenodo.org
·
6h
·
Discuss:
Hacker News
⚡
performance optimization
How
Anam
Achieved 250% Faster Inference Using
Zymtrace
Continuous GPU Profiling
zymtrace.com
·
17h
⚡
performance optimization
NUASM
— Neuro‑Universal‑ASM: The World's First Native Multi‑Language
Assembler
dev.to
·
1d
·
Discuss:
DEV
⚡
performance optimization
The
extraordinary
GPU, from entertainment to
supercomputer
jonpeddie.com
·
3h
⚡
performance optimization
What should I program?
jamesmcm.github.io
·
1d
⚙
systems programming
Gate-All-Around
(
GAA
) Technology for Sustainable AI
semiwiki.com
·
3h
⚙
systems programming
amirouche/seed
: Adding `
vau
` with an immutable dynamic environment to Chez Scheme
github.com
·
3h
·
Discuss:
Hacker News
⚙
systems programming
Galaxy Wins “
FPGA
Professional Spot Service
Provider
” Award
eetimes.com
·
11h
💹
trading systems
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help