Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔄 SIMD Programming
Specific
AVX512, Vector Instructions, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
146370
posts in
23.1
ms
themankindproject/simd-bp128-rs
: High-performance
SIMD-BP128
integer compression library for Rust with scalar, SSE4.1, AVX2, and AVX-512 backends.
✂️
CUTLASS
github.com
·
5d
Optimising a
Pipelined
RISC-V Core: From Naive Pipeline to
Near-Superscalar
Performance
🧠
CPU Architecture
mummanajagadeesh.github.io
·
2d
·
Lobsters
,
Hacker News
Anytime
Analysis on
BinVal
: Adaptive Parameters Help
⚡
ONNX Runtime
arxiv.org
·
16h
Compression
technique
makes AI models
leaner
and faster while they're still learning
📉
Model Quantization
techxplore.com
·
1h
SOTA
Normalization
Performance with Torch.compile
⚡
torch.compile
pytorch.org
·
1d
·
Hacker News
High-Performance
RK3568
SOM
Module
for Industrial and AI App
🧠
CPU Architecture
hackster.io
·
13h
Low-Rank Key Value Attention: Reducing
KV
Cache Memory and
Maintaining
Head Diversity
👁️
Attention Optimization
fin.ai
·
3h
·
Hacker News
Portability
and the Road Ahead
🔧
PTX
modular.com
·
6d
the value of a performance
oracle
📊
Profiling Tools
wingolog.org
·
2d
·
Lobsters
,
Hacker News
Architecting
Intelligence: The Rise of RISC-V
CPUs
in Agentic AI Infrastructure
🔧
PTX
semiwiki.com
·
7h
Advancing AI performance with
HBM4
,
SPHBM4
DRAM solutions
⚡
Flash Attention
edn.com
·
5h
Super Micro Computer's
Rack-Scale
AI Push: A New Growth
Catalyst
?
🧠
CPU Architecture
finance.yahoo.com
·
21h
Intel introduces its own Neural Compression technology with a
fallback
mode that works on GPUs without dedicated AI cores — early performance is on the level of Nvidia
NTC
⚡
Flash Attention
tomshardware.com
·
2d
Mojo
Programming
Language: Architecture, Performance, and AI Reality
💡
LSP
krun.pro
·
3d
·
DEV
AMD finally puts a price tag on the Ryzen 9
9950X3D2
and it’s
hefty
🔍
Nsight
digitaltrends.com
·
9h
System Design 101: The Architecture Behind Large Scale
Applications
⚙️
Systems Programming
medium.com
·
3h
Abaco
Systems Launches
VP892
FPGA Processing Engine
🔧
PTX
embedded.com
·
1d
Untangling
Tokio and
Rayon
in production: From 2s latency spikes to 94ms flat
⏱️
CUDA Events
posthog.com
·
1d
·
Lobsters
Upgrading the
IIoT
Performance
Envelope
: How Hardware Affects
IIoT
Workloads
⚙️
Systems Programming
tigerdata.com
·
3d
The Hidden Value of
CPU-Intensive
Compression
on Modern Hardware
📈
Occupancy Optimization
klarasystems.com
·
1d
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help