Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔲 Loop Tiling
Cache Optimization, Blocking, Matrix Multiplication, Locality
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80357
posts in
847.0
ms
DualMap
: Enabling Both Cache
Affinity
and Load Balancing for Distributed LLM Serving
arxiv.org
·
1d
📈
Occupancy Optimization
CoRefine
: Confidence-Guided
Self-Refinement
for Adaptive Test-Time Compute
arxiv.org
·
10h
⏱️
Benchmarking
Tip of the Week #227: Be Careful with Empty Containers and
Unsigned
Arithmetic
abseil.io
·
2d
🔍
Type Checkers
Enhancement
on
Caching
and Service Workers
dev.to
·
2d
·
Discuss:
DEV
🏗️
Build Optimization
Leveraging io_
uring
for
performant
asynchronous linux applications.
dev.to
·
1d
·
Discuss:
DEV
⏱️
CUDA Events
Revisiting
Regular
Types
abseil.io
·
2d
🔍
Type Checkers
4x NVMe SSD home server (
CNC
aluminum and
walnut
)
umbrel.com
·
4d
·
Discuss:
Hacker News
🏗️
Build Systems
google/flatbuffers
:
FlatBuffers
: Memory Efficient
Serialization
Library
github.com
·
3d
📜
TorchScript
Real space geometry of
aperiodic
tilings
as control knob for quantum physics
mappingignorance.org
·
5d
⚡
CUDA Programming Patterns
Replicating
the
Shadowglass
3D pixel-art technique
tesseractc.at
·
4d
·
Discuss:
Hacker News
⚡
CUDA Programming Patterns
The Top 10 Best
Practices
for AI/BI
Dashboards
Performance Optimization (Part 2)
databricks.com
·
5d
📈
Occupancy Optimization
High‑Performance Cryptographic Hash Function Based on
Mersenne
Prime Field Arithmetic and Efficient Modular Reduction **Abstract**
Mersenne
primes
\(p=2^q-1\...
freederia.com
·
5d
🎯
Tensor Cores
Sparse
Sum
‑of‑
Squares
Certification for High‑Dimensional Stochastic Control Systems — ### Abstract High‑dimensional stochastic control systems—such as ...
freederia.com
·
3d
🔗
Kernel Fusion
deepmriprep
: voxel-based
morphometry
preprocessing via deep neural networks
nature.com
·
4d
🏎️
TensorRT
Postgres
performance at any scale
pganalyze.com
·
5d
⏱️
Benchmarking
Building Highly Efficient Inference System for
Recommenders
Using
PyTorch
pytorch.org
·
4d
·
Discuss:
Hacker News
📜
TorchScript
CIPS
Stack – 5 memory systems that give your AI agents
persistent
memory
cipscorps.io
·
5d
·
Discuss:
Hacker News
💡
LSP
denysvitali/claude-code-patches
: Make Claude Code fast again.
github.com
·
5d
·
Discuss:
Hacker News
🏗️
Build Optimization
WebGPU
Compute
Shaders
webgpufundamentals.org
·
6d
🎮
NVIDIA
Using
Nsight
Compute with large
codebases
- Part 2 : Profiling large code bases
blog.ncompass.tech
·
6d
·
Discuss:
Hacker News
🔍
Nsight
Loading...
Loading more...
« Page 8
•
Page 10 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help