Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📏 ANN Benchmarks
Recall@K, Query Latency, Index Build Time, Memory Usage
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
32930
posts in
11.3
ms
Vectorizing
the
Trie
: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators
arxiv.org
·
2d
🔍
Information Retrieval
Optimal Heterogeneous Memory Configs for AI Tasks Under
Specified
Performance Metrics (Stanford,
UCSC
)
semiengineering.com
·
17m
🧠
Memory Hierarchy Design
From
19k
to 4.2M events/SEC: story of a SQLite query
optimisation
mnt.io
·
2h
·
Discuss:
Hacker News
🗄️
libSQL
[Benchmark]
Qwen3.5-122B-A10B
FP8 weights / bf16 KV on 8x RTX PRO 6000 (SM120): 1,985 tok/s burst, MTP 2.75x, fp8 KV silent corruption finding · Issue #19603
github.com
·
6h
·
Discuss:
r/LocalLLaMA
🖥
GPUs
Optimizing SSD-Resident Graph
Indexing
for
High-Throughput
Vector Search
arxiv.org
·
2d
🗂️
Vector Indexes
AI
Benchmark
Research
amplifying.ai
·
2d
🏆
LLM Benchmarking
Fast
Autoscheduling
for Sparse ML
Frameworks
fredrikbk.com
·
9h
·
Discuss:
Hacker News
🕯️
Candle
ann
_research
romanbikbulatov.bearblog.dev
·
4d
📊
Vector Databases
Physical echo state network based on the nonlinearity and dynamic response of
ambipolar
heterostructure
transistors
nature.com
·
13h
⚡
Hardware Acceleration
☕ AI battle
kill-the-newsletter.com
·
23h
🤖
AI
Improving Index
Selection
For Join
Queries
dolthub.com
·
2d
🔍
Query Optimization
Testing
Datadog
Explain Plan
Visualizer
With Oracle Execution Plans
tanelpoder.com
·
8h
🔍
EXPLAIN ANALYZE
SpacetimeDB
: A Short
Technical
Review
strn.cat
·
1d
·
Discuss:
Hacker News
⚙️
Database Internals
Context Window Optimization: Why
Ranking
, Not
Stuffing
, Is the Scaling Law for Agents
shaped.ai
·
2d
🧠
Agent Memory
Accuracy
vs. Speed in Local LLMs: Finding Your
Sweet
Spot
grigio.org
·
1d
·
Discuss:
Hacker News
🏗️
LLM Infrastructure
Build Your Own Key-Value Storage Engine—Week 7
read.thecoder.cafe
·
2d
🌳
Data Structures
An FPGA-based Accelerator Addressing Bottlenecks in GNN
Preprocessing
(
KAIST
et al.)
semiengineering.com
·
2d
⚡
Hardware Acceleration
Why I Built a
Masked
Autoencoder
(MAE) from Scratch (And How You Can Too)
pub.towardsai.net
·
1d
✨
Gemini
Beyond Porting: How vLLM
Orchestrates
High-Performance Inference on AMD
ROCm
blog.vllm.ai
·
2d
🏗️
LLM Infrastructure
fast-servers: an
interesting
pattern
geocar.sdf1.org
·
18h
·
Discuss:
Lobsters
🧵
Async
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help