Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📉 Model Quantization
INT8, Post-Training, QAT, Pruning, Model Compression
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81511
posts in
437.9
ms
Emergent Low-Rank Training Dynamics in
MLPs
with Smooth
Activations
arxiv.org
·
11h
📊
Gradient Accumulation
HyPER: Bridging Exploration and
Exploitation
for Scalable LLM Reasoning with
Hypothesis
Path Expansion and Reduction
arxiv.org
·
11h
🎓
Model Distillation
Local-First AI: How
SLMs
are Fixing the
Latency
Gap 💻✨
dev.to
·
13h
·
Discuss:
DEV
⚡
Flash Attention
How We Achieved 30% Conversion Lift by Moving from GPT-4 to
LoRA
Adapters
dev.to
·
1h
·
Discuss:
DEV
🏎️
TensorRT
Flexible and
Economical
UTF-8
Decoder
bjoern.hoehrmann.de
·
2d
🔍
Type Checkers
Converting
Color
Depth
eastfarthing.com
·
2d
🎮
NVIDIA
Pratt
Parsers
: Expression Parsing Made Easy
journal.stuffwithstuff.com
·
1d
🔍
Type Checkers
Chasing a Zig
AVR
Segfault
Down to LLVM
sourcery.zone
·
1d
·
Discuss:
r/Zig
💡
LSP
The
Abseil
str
_format Library
abseil.io
·
1d
🔍
Type Checkers
Free online
toolbox
with 50+ useful tools – no ads, no
signup
required
strongtools.site
·
2d
·
Discuss:
r/SideProject
🐕
Ruff
Neural Bitcoin Small Class
Number
Attack
leetarxiv.substack.com
·
4d
·
Discuss:
Substack
⚡
ONNX Runtime
Empirical
Optimization of Quantum Magnetic Resonance Imaging Calibration Using Non‑
Commutative
Geometric Neural Networks
freederia.com
·
3d
🏎️
TensorRT
Continual
learning and the post
monolith
AI era
baseten.co
·
2d
·
Discuss:
Hacker News
📊
Gradient Accumulation
Theory-independent monitoring of the
decoherence
of a superconducting qubit with generalized
contextuality
nature.com
·
2d
🔄
ONNX
How to Stay
Valuable
When AI
Writes
All The Code
pathtostaff.com
·
1d
·
Discuss:
r/programming
🤖
AI Coding Tools
A Neuro Symbolic Architecture For Induced
Epistemic
Agency and System 2 Reasoning in
Quantized
Large Language Models
papers.ssrn.com
·
3d
·
Discuss:
Hacker News
⚡
ONNX Runtime
Neural population
geometry
and optimal coding of tasks with shared
latent
structure
nature.com
·
3d
📊
Gradient Accumulation
Prompt injection in Google
Translate
reveals base model
behaviors
behind task-specific fine-tuning
lesswrong.com
·
2d
·
Discuss:
Hacker News
🏎️
TensorRT
What should I program?
jamesmcm.github.io
·
1d
✂️
CUTLASS
Humane
, adaptive AI
bootstrapping
natemeyvis.com
·
3d
🤖
AI Coding Tools
Loading...
Loading more...
« Page 7
•
Page 9 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help