Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ Quantization
Model Compression, INT8, Weight Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
22753
posts in
15.4
ms
Quantization Explained: Q4_K_M vs
AWQ
vs
FP16
for Local LLMs
sitepoint.com
·
1d
🤖
LLM Inference
Understanding Representation Learning in Neural Networks (With
PyTorch
Example
)
dev.to
·
7h
·
Discuss:
DEV
🔥
PyTorch
SparseNUTS
: Preconditioning hierarchical models in
HMC
with a sparse “Laplace approximation” at the marginal mode
statmodeling.stat.columbia.edu
·
5h
🔀
Model Routing
Building a Real Image
Matching
Project with Gemini
Embedding
2
analyticsvidhya.com
·
11h
🔗
RAG
Understanding
Word2Vec
– Part 7: How Negative Sampling
Speeds
Up
Word2Vec
dev.to
·
5h
·
Discuss:
DEV
🔤
Tokenization
High-throughput
, low-cost
inference
ionrouter.io
·
6h
·
Discuss:
Hacker News
🤖
LLM Inference
Can models gradient hack
SFT
elicitation
?
lesswrong.com
·
1d
🐛
Fuzzing
Essential
Techniques
for Production
Vector
Search Systems, Part 4:
Multi-Vector
Search
dzone.com
·
12h
🔢
NumPy
Claude
Opus
4.6 Introduces Adaptive Reasoning and Context
Compaction
for Long-Running Agents
infoq.com
·
14h
🎭
Anthropic Claude
Recursion
in Python – A Practical Introduction for
Beginners
freecodecamp.org
·
8h
🔢
NumPy
roli-lpci/zer0dex
: Dual-layer memory for AI agents. Compressed index + vector store. 91% recall, 70ms, fully local.
github.com
·
9h
·
Discuss:
r/Python
💾
Agent Memory
Scaling
pgvector
: Memory,
Quantization
, and Index Build Strategies
mydba.dev
·
5d
·
Discuss:
DEV
🔢
NumPy
Cycle-Consistent
Activation
Oracles
lesswrong.com
·
21h
🤖
Large Language Models
Google's Gemini
Embedding
2
arrives
with native multimodal support to cut costs and speed up your enterprise data stack
venturebeat.com
·
1d
💬
NLP
Less-relevant results
Enabling
R8
optimization at scale with AI-assisted debugging
engineering.grab.com
·
1d
🚀
Performance
Machine Learning & AI Interview Study
Booklet
peymanr.github.io
·
2d
💬
NLP
Running Multiple Local Models: Memory Management
Strategies
sitepoint.com
·
1d
🤖
LLM Inference
AlloyDB
AI: Auto Embedding Generation & AI Functions now
Generally
Available
medium.com
·
1d
🤖
AI Tools
Faster
asin
() Was Hiding In
Plain
Sight
16bpp.net
·
1d
·
Discuss:
Lobsters
,
Hacker News
,
r/programming
💻
Terminal Emulators
Writing an LLM from scratch, part
32e
–
Interventions
: the learning rate
gilesthomas.com
·
2d
·
Discuss:
Hacker News
🎮
Reinforcement Learning
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help