Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📉 Model Quantization
INT8, Post-Training, QAT, Pruning, Model Compression
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80299
posts in
754.3
ms
On the
Infinite
Width
and Depth Limits of Predictive Coding Networks
arxiv.org
·
9h
📊
Gradient Accumulation
BitLogic
: Training Framework for Gradient-Based
FPGA-Native
Neural Networks
arxiv.org
·
9h
🎯
Tensor Cores
CNN-based
Segmentation
of Medical
Imaging
Data
dev.to
·
2d
·
Discuss:
DEV
🧮
cuDNN
Fastfood
: Approximate Kernel Expansions in
Loglinear
Time
dev.to
·
2d
·
Discuss:
DEV
🔗
Kernel Fusion
Practical
NLP
for Risk Modeling, Part II - Fine-tuning
DistilBERT
End-to-End on Tornado Narratives
jtrive.com
·
3d
🔄
ONNX
You don't need
RAG
in 2026
ryanlineng.substack.com
·
2d
·
Discuss:
Substack
⚡
ONNX Runtime
Finding the needle in the
logstack
: Reducing LLM context with
TF-IDF
eliseomartelli.it
·
4d
🔄
ONNX
Needed
10K
prompts for my ML dataset, so I made this tool instead of
copy-pasting
for hours
promptanvil.com
·
1d
·
Discuss:
r/SideProject
🤖
AI Coding Tools
How to Access and Use
Qwen3-Coder-Next
?
analyticsvidhya.com
·
5d
🤖
AI Coding Tools
AI
Workflows
chatprd.ai
·
1d
🤖
AI Coding Tools
Own your AI: Learn how to fine-tune
Gemma
3
270M
and run it on-device
developers.googleblog.com
·
5d
🏎️
TensorRT
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
4d
·
Discuss:
Hacker News
🔄
ONNX
Autoregressive
Model Beats Diffusion:
Llama
for Scalable Image Generation
paperium.net
·
4d
·
Discuss:
DEV
🏎️
TensorRT
When Optimization Works: The Role of
Convexity
in Business
Decisions
pub.towardsai.net
·
1d
🔗
Kernel Fusion
Teon
Demonstrates
Improved Pre-Training With Language Models Up To 1B Parameters
quantumzeitgeist.com
·
5d
🏎️
TensorRT
StatLLM
: A Dataset for Evaluating the Performance of Large Language Models in
Statistical
Analysis
nature.com
·
4d
🏎️
TensorRT
Jokes
on You AI: Turning the
Tables
dev-log.me
·
2d
·
Discuss:
Hacker News
🤖
AI Coding Tools
Three AI
engines
walk
into a bar in single file...
theregister.com
·
1d
🤖
AI Coding Tools
Shows
Learnable
Permutation
Improves Transformer Model Sparsity Performance
quantumzeitgeist.com
·
5d
📊
Gradient Accumulation
— ### Abstract This study presents a fully validated, commercially viable framework for extracting high‑level semantic content from
intracranial
electr
...
freederia.com
·
3d
🏎️
TensorRT
Loading...
Loading more...
« Page 9
•
Page 11 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help