Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🔄 SIMD Programming
Specific
AVX512, Vector Instructions, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
29483
posts in
50.4
ms
Microarchitectural
Co-Optimization for Sustained Throughput of RISC-V Multi-Lane
Chaining
Vector Processors
🖥️
Hardware Architecture
arxiv.org
·
6d
Xero.DataComparer
– High-performance generic list
comparator
for .NET
🏹
Apache Arrow
github.com
·
6h
·
Hacker News
Can a Chip That
Loves
Zeros
Make Huge AI Models More Efficient?
⚡
Hardware Acceleration
spectrum.ieee.org
·
5d
·
Hacker News
,
r/hardware
AMD and Intel Unveil ACE: New matrix
instructions
deliver a massive 16x AI performance leap over
AVX
⚡
Hardware Acceleration
tweaktown.com
·
3d
·
r/LocalLLaMA
My GPU Was
Starving
: How I
Broke
the I/O Wall for 3.7x Faster Training
⚙️
Mechanical Sympathy
pub.towardsai.net
·
5d
atomic_
queue
benchmarks
SMT
vs
no-SMT
performance
⚡
Glommio
max0x7ba.github.io
·
4d
·
r/cpp
,
r/linux
Exploring Sparse Matrix
Multiplication
Kernels on the
Cerebras
CS-3
🧮
Compute Optimization
arxiv.org
·
2d
New
CPU
Memory
Module
🖥️
Hardware Architecture
semiengineering.com
·
5d
Restartable
sequences,
TCMalloc
, and Hyrum's Law
🔓
Lock-Free Structures
lwn.net
·
3d
Enabling
Next-Generation AI Through Advanced Packaging and 3D
Fabric
Integration
🔬
Chip Fabrication
semiwiki.com
·
4d
Gigabyte
X870E
Aorus
Xtreme
X3D AI Top motherboard review: The latest and greatest
Xtreme
🖥
GPUs
tomshardware.com
·
5d
Auto-FlexSwitch
: Efficient Dynamic Model Merging via
Learnable
Task Vector Compression
🔬
RaBitQ
arxiv.org
·
2d
Introducing
Silico
: the platform for building AI models with the
precision
of written software.
🆕
New AI
threadreaderapp.com
·
3d
RuC
:
HDL-Agnostic
Rule Completion Benchmark Generation
🕯️
Candle
arxiv.org
·
2d
[2404.02581]
Multi-Granularity
Guided
Fusion-in-Decoder
🔤
Tokenization
arxiv.org
·
1d
Parameter-Efficient Architectural
Modifications
for Translation-Invariant
CNNs
📦
Batch Embeddings
arxiv.org
·
2d
How I Built ML-Powered LLM
Routing
with <5ms
Latency
🏗️
LLM Infrastructure
pub.towardsai.net
·
5d
Man, Machine, and
Mathematics
📊
Vector Databases
arxiv.org
·
2d
Scalable Network-on-Chip
Enables
a Modular
Chiplet
Platform
🔬
Chip Fabrication
semiwiki.com
·
6d
Tailwind
: A Practical Framework for Query
Accelerators
⚡
Vectorized Execution
arxiv.org
·
2d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help