Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ Inference Optimization
Quantization, Model Compression, KV Cache, Speculative Decoding
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
39
posts in
9.3
ms
How I
squeezed
a
BERT
sentiment analyzer into 1GB RAM on a $5 VPS
mohammedeabdelaziz.github.io
·
5d
·
Discuss:
Hacker News
💾
KV Cache
Zvec
: SQLite-like
simplicity
in an embedded vector database (By Alibaba)
zvec.org
·
6h
·
Discuss:
Hacker News
💾
KV Cache
Main
Content ||
Math
∩ Programming
jeremykun.com
·
3d
↩️
Backpropagation
Recursive
self-improvement
from AI models
marginalrevolution.com
·
2d
·
Discuss:
Hacker News
🎛️
Fine-Tuning
GLM-5
: From
Vibe
Coding to Agentic Engineering
simonwillison.net
·
1d
·
Discuss:
Hacker News
🤖
agentic system
Queer
Festival
Troubles
madeinchinajournal.com
·
9h
🤖
LLM Agents
Performance
Tip
of the Week #79: Make at most one
tradeoff
at a time
abseil.io
·
4d
💾
KV Cache
Snippets
? Apps?
Visuals
? Why classical music should stop trying to be pop
theguardian.com
·
1d
⚡
FlashAttention
ui.sh
ui.sh
·
23h
·
Discuss:
Hacker News
🔧
MLIR
Versatile Wide Leg Pants
Styling
Tips for Every
Occasion
artecloth.com
·
3d
🎭
Mixture of Experts
Bearblog
Resource
List
grizzlygazette.bearblog.dev
·
1d
🤖
LLM Agents
Homemade Chinese Fermented
Beancurd
Tofu
腐乳
thehongkongcookery.com
·
3d
⚙
post training infra
Travel to
Cheap
Destinations
nomagicpill.substack.com
·
4d
·
Discuss:
Substack
⚡
FlashAttention
A very good
deconstruction
of the #NATO expansion
fable
, with lots of details and actual document...
agora.echelon.pl
·
4d
📐
Linear Algebra
Journalism
isn’t
dying
onemanandhisblog.com
·
3d
⚡
FlashAttention
High CPU reading
epubs
and
PDFs
in Linux
octet-stream.net
·
2d
💾
KV Cache
A Place to Call Home
thebaffler.com
·
6d
·
Discuss:
r/boston
🤖
agentic system
The Web Has 1,437
TLDs
and a Single
Imagination
blog.palmer.lol
·
5d
🤖
LLM Agents
Zlob.h
100% POSIX and glibc compatible
globbing
lib that is faste and better
github.com
·
5d
·
Discuss:
Hacker News
,
r/Zig
,
r/rust
🔧
MLIR
C Isn't A
Programming
Language
Anymore
faultlore.com
·
6d
·
Discuss:
Hacker News
🔧
MLIR
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help