Model Compression, Neural Networks, Precision Reduction, Efficient Inference

Feeds to Scour
SubscribedAll
Scoured 19416 posts in 262.9 ms
Quantization-Aware Regularizers for Deep Neural Networks Compression
arxiv.org·2h
🧠Learned Compression
Preview
Report Post
D$^2$Quant: Accurate Low-bit Post-Training Weight Quantization for LLMs
arxiv.org·2h
💻Local LLMs
Preview
Report Post
7 Advanced Feature Engineering Tricks Using LLM Embeddings
machinelearningmastery.com·15h
🧮Vector Embeddings
Preview
Report Post
What is Overfitting? - Overfitting in Machine Learning Explained
aws.amazon.com·3h·
Discuss: Hacker News
🧠Machine Learning
Preview
Report Post
AI streamlines deluge of data from particle collisions
phys.org·1d
🌊Stream Processing
Preview
Report Post
MichiAI: A 530M Full-Duplex Speech LLM with ~75ms Latency Using Flow Matching
ketsuilabs.io·15h·
Discuss: Hacker News
🎙️Whisper
Preview
Report Post
Training Design for Text-to-Image Models: Lessons from Ablations
huggingface.co·19h
📊Learned Metrics
Preview
Report Post
A New AI Architecture Without Prior Distributions: Stream-Based AI and Compositional Inference
dev.to·1d·
Discuss: DEV
🌊Streaming Algorithms
Preview
Report Post
Ask HN: Where does modern geometry survive contact with SGD?
news.ycombinator.com·55m·
Discuss: Hacker News
🌀Differential Geometry
Preview
Report Post
Convert & Compress
frontendmasters.com·15h
📦Deflate
Preview
Report Post
YOLOv2 & YOLO9000 Paper Walkthrough: Better, Faster, Stronger
towardsdatascience.com·16h
📊Learned Metrics
Preview
Report Post
Future leakage in block-quantized attention
matx.com·1d·
Discuss: Hacker News
📊Vector Quantization
Preview
Report Post
Beyond Two Towers: Re-architecting the Serving Stack for Next-Gen Ads Lightweight Ranking Models…
medium.com·1d
📊Feed Optimization
Preview
Report Post
Qwen3-Coder-Next: How to Run Locally
unsloth.ai·12h·
Discuss: Hacker News
💻Local LLMs
Preview
Report Post
Stop Torturing Your Data: How to Automate Rigor With AI
hackernoon.com·9h
Proof Automation
Preview
Report Post
CodeSOD: A Percise Parser
thedailywtf.com·1d
🌳Incremental Parsing
Preview
Report Post
The Gumbel-Max Trick
blog.quipu-strands.com·13h·
Discuss: Hacker News
🧮Kolmogorov Bounds
Preview
Report Post
YOLO11 x ArmPi Ultra: The Future of AI Waste Sorting
hackster.io·4h
🌊Streaming Systems
Preview
Report Post
Calling Lean Functions As Python Functions
philipzucker.com·2d
⚔️Lean Tactics
Preview
Report Post
The path to noncommutative function theory: a research story
noncommutativeanalysis.wordpress.com·17h
💻Programming languages
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help