Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⏩ SIMD
Specific
Vectorization, Parallel Processing, Performance, CPU Instructions
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
185314
posts in
22.8
ms
Closer in the Gap: Towards Portable Performance on
RISC-V
Vector
Processors
🔀
SIMD Programming
arxiv.org
·
16h
Speed-Optimized Python
3.14t
on Debian
Forky
: A Clang-19 Build Guide (Assisted by Google AI)
🐻❄️
Polars
dbaxps.blogspot.com
·
5d
BLAS,
Lapack
and
OpenMP
🧮
MKL
pypackaging-native.github.io
·
2d
·
Hacker News
SIMD-accelerate
JSON string scanning in
JSONLexer
(#1984)
🔀
SIMD Programming
github.com
·
3d
EULER-ADAS: Energy-Efficient & SIMD-Unified
Logarithmic-Posit
Engine for Precision-Reconfigurable Approximate ADAS Acceleration
⚡
Hardware Acceleration
arxiv.org
·
1d
FractalSortCPU
: Bandwidth-Efficient Compressed
Radix
Sort on CPU
📋
Columnar Storage
arxiv.org
·
16h
NikoMalik/probemap
: simd swiss table based map
🛣️
Highway
github.com
·
5d
·
r/rust
TLX: Hardware-Native,
Evolvable
MIMW
GPU Compiler for Large-scale Production Environments
⚡
Hardware Acceleration
arxiv.org
·
16h
On Similarity of Computational
Kernels
in our Codes and
Proxies
🧮
Vector Databases
arxiv.org
·
1d
REPTILES: Repeated Tiles of
Sargantana
, a RISC-V multicore based on
OpenPiton
🧵
OpenMP
arxiv.org
·
16h
TransDot
: An Area-efficient
Reconfigurable
Floating-Point Unit for Trans-Precision Dot-Product Accumulation for FPGA AI Engines
⚡
Hardware Acceleration
arxiv.org
·
1d
TREA
: Low-precision
Time-Multiplexed
, Resource-Efficient Edge Accelerator for Object Detection and Classification
🎯
Intel IPP
arxiv.org
·
1d
DICE: Enabling Efficient General-Purpose
SIMT
Execution with
Statically
Scheduled Coarse-Grained Reconfigurable Arrays
🎮
SIMT Execution
arxiv.org
·
4d
FalconGEMM
:
Surpassing
Hardware Peaks with Lower-Complexity Matrix Multiplication
⚡
Hardware Acceleration
arxiv.org
·
4d
Beyond
Static
Policies: Exploring Dynamic Policy
Selection
for Single-Thread Performance Optimization
📍
CPU Pinning
arxiv.org
·
4d
Enhancing Performance Insight at Scale: A Heterogeneous Framework for
Exascale
Diagnostics
📊
Extrae
arxiv.org
·
6d
Litespark
Inference on Consumer CPUs: Custom SIMD Kernels for
Ternary
Neural Networks
📱
Edge AI
arxiv.org
·
4d
Lifting to
tensors
when
compiling
scientific computing workloads for AI Engines
🌀
Naiad
arxiv.org
·
6d
Quantizing
With Randomized
Hadamard
Transforms: Efficient Heuristic Now Proven
📏
Run-Length Encoding
arxiv.org
·
4d
DITRON
: Distributed Multi-level
Tiling
Compiler for Parallel Tensor Programs
🦀
Rayon
arxiv.org
·
6d
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help