Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
↔️ SIMD Horizontal Operations
Reduction Operations, Horizontal Add, Cross-Lane Reduction, SSSE3
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
146379
posts in
26.1
ms
Optimizing
Recommendation Systems with
JDK
’s Vector API
netflixtechblog.com
·
3h
·
Discuss:
Hacker News
⚡
SIMD Optimization
TurboSparse
Efficiency: Achieving 97% Parameter Sparsity in
Mixtral-47B
hackernoon.com
·
1h
🤖
TVM
Intel’s new
Xeon
600
processors
confirmed to clock up to 4.9GHz
kitguru.net
·
14h
⚙️
CPU Microarchitecture
🧱 Beginner-Friendly Guide 'Minimum
Swaps
to
Arrange
a Binary Grid' - Problem 1536 (C++, Python, JavaScript)
dev.to
·
11h
·
Discuss:
DEV
⚡
Quicksort
Hyprland 0.54 brings per-workspace
layouts
, major performance gains &
Hyprnix
integration
alternativeto.net
·
17h
⚡
BOLT
Building an Open-Source
Verilog
Simulator with AI:
580K
Lines in 43 Days
normalcomputing.com
·
2h
·
Discuss:
Hacker News
🔍
DTrace
GenDRAM
:Hardware-Software Co-Design of General Platform in
DRAM
arxiv.org
·
23h
🌊
Memory Bandwidth
Advancing
vRAN
Economics with AMD
EPYC
8005 Server CPUs
storagereview.com
·
12h
🛡️
AMD SEV
A
Number
with a
Shadow
campedersen.com
·
23h
🌀
Naiad
WarpSpeed
automatically rewrites Nvidia core library, achieves 3.6-100x
speedup
doubleai.com
·
11h
·
Discuss:
Hacker News
⚡
Hardware Acceleration
Beyond
Pandas
:
Architecting
High-Performance Python Pipelines
hackernoon.com
·
8h
🐙
Benthos
What
Happens
When You Put “n” Billion
Weights
in Your RAM
pub.towardsai.net
·
1d
🧩
mimalloc
Optimized
Compilation
for Distributed Quantum Computing
arxiv.org
·
23h
⚛️
Quantum Computing
AMD
Zen
7 CPU leak shows huge core
counts
club386.com
·
12h
🏗️
CPU Cache Topology
MoRI
— AMD's MoE dispatch and
KV
Cache library
github.com
·
1d
🔄
Glommio vs Tokio
Building a Virtual Computer for the Intel 80286
hackster.io
·
9h
⚡
RISC-V
New Zlib-rs Delivers More Performance With AVX-512
VNNI
Adler32
Implementation
phoronix.com
·
15h
🔢
AVX-512
Implementing
Burger-Dybvig
: finding the shortest decimal that round-trips to the original IEEE 754 bits, with
ECMA-262
tie-breaking
lattice-substrate.github.io
·
14h
·
Discuss:
r/programming
📏
Run-Length Encoding
A GPU
Microarchitecture
Optimized for Fully
Homomorphic
Encryption
semiengineering.com
·
1d
🔢
Homomorphic Encryption
Simulating
Queueing
buttondown.com
·
9h
📮
Multi-producer Queues
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help