Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐ SIMD Programming
AVX512, Vector Instructions, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112370
posts in
665.4
ms
AVX2
SIMD Optimization for 12-bit JPEG Decoding in
libjpeg-turbo
โ Pair Programming with Copilot CLI
github.com
ยท
2d
ยท
Discuss:
DEV
โ๏ธ
CUTLASS
SimuScene
: Training and Benchmarking Code Generation to
Simulate
Physical Scenarios
arxiv.org
ยท
1d
๐
TorchScript
Allocators
from C to
Zig
antonz.org
ยท
17h
ยท
Discuss:
Lobsters
,
Hacker News
๐ง
CUDA Memory Management
From
Chunks
to Connections: The
Intuitive
Guide to Graph RAG
pub.towardsai.net
ยท
35m
โ๏ธ
CUTLASS
Show HN: A
header-only
C++ benchmark for predictive models on raw
binary
streams
github.com
ยท
20h
ยท
Discuss:
Hacker News
๐๏ธ
TensorRT
Technical "
whitepaper
" for
afl-fuzz
lcamtuf.coredump.cx
ยท
14h
ยท
Discuss:
Lobsters
๐
Ruff
How
Andrej
Karpathy
Built a Working Transformer in 243 Lines of Code
analyticsvidhya.com
ยท
15h
๐
TorchScript
Can you disable
multithreaded
calculations
for avoidance logic?
forrestthewoods.com
ยท
18h
ยท
Discuss:
r/godot
โก
CUDA Programming Patterns
Implementing
3D Graphics
Basics
hackaday.com
ยท
1d
โ๏ธ
CUTLASS
What Iโm Learning in Data Structures: The Algorithm Behind
Compression
(
bzip
, etc.)
dev.to
ยท
1d
ยท
Discuss:
DEV
๐
Model Quantization
A C implementation of the inference pipeline for the Mistral AIโs
Voxtral
Realtime
4B model
blog.adafruit.com
ยท
12h
๐๏ธ
TensorRT
A single
stable
kernel
for Thursday
lwn.net
ยท
14h
๐
Kernel Fusion
Training-Free Real-Time Control for
Autoregressive
Video Generation
daydream.live
ยท
14h
ยท
Discuss:
Hacker News
๐๏ธ
TensorRT
A
stack-buffer-overflow
exercise with
AddressSanitizer
and PostgreSQL
enterprisedb.com
ยท
3h
ยท
Discuss:
Lobsters
,
Hacker News
๐
Profiling Tools
AI in Multiple
GPUs
: Understanding the Host and Device
Paradigm
towardsdatascience.com
ยท
16h
โฑ๏ธ
CUDA Events
EyesOff
: Why Some Models
Quantize
Better Than Others
ym2132.github.io
ยท
1d
ยท
Discuss:
Hacker News
๐
Model Quantization
Discussion - Investigation of Single Thread CPU "
Thoughput/cycle
"
forums.anandtech.com
ยท
1d
๐
Profiling Tools
Show HN: Solving
Sudoku
reasoning via Energy
Geometric
models
davisgeometric.com
ยท
19h
ยท
Discuss:
Hacker News
โ๏ธ
CUTLASS
Scaling llama.cpp On
Neoverse
N2: Solving
Cross-NUMA
Performance Issues
semiengineering.com
ยท
21h
๐
Occupancy Optimization
Product
Forecasting
through Time Series Analysis (
Modelling
)
pub.towardsai.net
ยท
5h
๐ง
BF16
Loading...
Loading more...
« Page 1
โข
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help