Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⏩ SIMD
Vectorization, Parallel Processing, Performance, CPU Instructions
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
74292
posts in
553.0
ms
Heterogeneous
Processing: A Strategy for
Augmenting
Moore's Law (2006)
linuxjournal.com
·
1d
·
Discuss:
Hacker News
⚡
Hardware Acceleration
Protean
Compiler: An
Agile
Framework to Drive Fine-grain Phase Ordering
arxiv.org
·
10h
📊
Profile-Guided Optimization
How
Anam
Achieved 250% Faster Inference Using
Zymtrace
Continuous GPU Profiling
zymtrace.com
·
13h
🎮
SIMT Execution
Main
Content ||
Math
∩ Programming
jeremykun.com
·
16h
📊
Algorithms
Building
Scalable
AI Applications: Architecture
Patterns
That Actually Work
dev.to
·
1d
·
Discuss:
DEV
🎭
Program Synthesis
Faster
AI Training
Unlocked
With New System For Massive Language Models
quantumzeitgeist.com
·
1h
🤖
TVM
Vector
Databases
Explained: Architecture and System Design for AI Apps
dev.to
·
9h
·
Discuss:
DEV
🧮
Vector Databases
Your
VCL
App: 4x to 11x Faster Math Performance with
Elements
blogs.remobjects.com
·
53m
·
Discuss:
Hacker News
🛣️
Highway
Concurrent
vs.
Parallel
Execution in LLM API Calls: From an AI Engineer’s Perspective
pub.towardsai.net
·
9h
⏰
Timely Dataflow
Implementing
vector
accu.org
·
1d
·
Discuss:
r/cpp
📌
Pin Projection
Dirk
Eddelbuettel
:
chronometre
: A new package (pair) demo for R and Python
dirk.eddelbuettel.com
·
21h
🐻❄️
Polars
CUDA
Guide:
Workflow
for Performance Tuning
digitalocean.com
·
4d
🎮
SIMT Execution
How AI coding makes
developers
56% faster and 19%
slower
thenewstack.io
·
3h
🎭
Program Synthesis
What should I program?
jamesmcm.github.io
·
1d
🦀
Rust
My Claude Code
workflow
invertedpassion.com
·
2h
🔨
Incremental Compilation
Writing a
ONNX
Neural Network Inference Engine from Scratch in C to run image classification with
MobileNetV2
flexw.github.io
·
20h
·
Discuss:
r/C_Programming
📱
Edge AI
Hitting
1,000
tokens
per second on a single RTX 5090
blog.alpindale.net
·
16h
·
Discuss:
Hacker News
📍
CPU Pinning
Quantized
Tensor Train Compression For Turbulent Flow Simulation: O(log N) Scaling with
Reynolds-Independent
Bond Dimension
zenodo.org
·
2h
·
Discuss:
Hacker News
⏲️
TimescaleDB Compression
Accelerate your discovery by
parallelizing
experiments
magellink.com
·
22h
·
Discuss:
Hacker News
🌀
Naiad
More
cores
didn't improve my
workflow
xda-developers.com
·
1d
🚀
Performance
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help