Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🧮 MKL
Specific
Math Kernel Library, BLAS, LAPACK, Intel oneAPI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
25
posts in
30.0
ms
avencera/speakrs: Speaker diarization in Rust. 312–912x realtime on Apple Silicon, 50–121x on CUDA.
Matches
pyannote accuracy.
🍱
Nom
github.com
·
5d
·
Hacker News
,
r/rust
Less-relevant results
I Built a Neural Network from Scratch in Rust — Then Compiled It to WebAssembly
🍱
Nom
dev.to
·
2d
·
DEV
5 More Must-Know Python Concepts
⚡
FastAPI
kdnuggets.com
·
6d
From Roofline to Ruggedness: Decomposing and Smoothing the
GEMM
Performance Landscape
🚀
Performance
arxiv.org
·
2d
Marvel Vs. Capcom Fans Talk About What It’s Like Waiting So Long For A Comeback And Why Marvel Tōkon Could Be The Answer
🕹️
Retro Gaming
kotaku.com
·
5d
A Case for Tracing Based DSL
Kernel
Languages
🔍
KLEE
metaworld.me
·
4d
·
Hacker News
Velocity in Every Voxel
🧭
Inertial Navigation
atomsfrontier.substack.com
·
6d
·
Substack
CUDA 13.3: NVIDIA continues to move GPU programming from the thread to the tile
🎮
SIMT Execution
igorslab.de
·
3d
llama.cpp B9387 Significant AMD/ROCm PP Update
🧩
mimalloc
github.com
·
2d
·
r/LocalLLaMA
Elusive order of async GPU
kernels
: scheduling, abstractions, DSL implications
🎮
SIMT Execution
ianbarber.blog
·
5d
·
Hacker News
Real-time LLM Inference on Standard GPUs (3,000 tokens/s per request)
🌊
Memory Bandwidth
blog.kog.ai
·
3d
·
Hacker News
,
Hacker News
Introducing 1-bit and Ternary Bonsai Image 4B: Image Generation for Local Devices
💡
Photon
prismml.com
·
5d
·
Hacker News
,
Hacker News
Nonanti/mathcore
: Symbolic math
library
and computer
algebra
system for Rust
🦀
Rust Macros
github.com
·
3d
Verilog-Evolve: Feedback-Driven and Skill-Evolving Verilog Generation
🔌
FPGA Programming
arxiv.org
·
4d
NVIDIA CUTLASS: High-Performance CUDA Templates for AI
Linear
Algebra
🎮
SIMT Execution
dev.to
·
3d
·
DEV
On the Fast Fourier Transform on SU(2)
📐
Linear Algebra
arxiv.org
·
5d
Writing High-Performance
Kernels
in TileLang, from
GEMM
to MLA
🔄
Glommio vs Tokio
dev.to
·
5d
·
DEV
Oabraham1/wave: WAVE (Wide Architecture Virtual Encoding) - The universal GPU ISA. Write GPU
kernels
once, run on Apple, NVIDIA, AMD, and
Intel
GPUs. Includes compiler, four backends, emulator, and SDKs for Python, Rust, C++, and TypeScript.
⚡
Hardware Acceleration
github.com
·
5d
·
Hacker News
Apple Silicon's AI Ceiling Is Higher Than You Think
⚡
Hardware Acceleration
dev.to
·
5d
·
DEV
RT-Lynx: Putting the
GEMM
Sparsity
In a Right Way for Diffusion Models
🕸️
GraphBLAS
arxiv.org
·
4d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help