Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐ Model Quantization
INT8, Post-Training, QAT, Pruning, Model Compression
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80563
posts in
836.9
ms
Regularized Calibration with
Successive
Rounding
for Post-Training Quantization
arxiv.org
ยท
4d
๐๏ธ
TensorRT
LQA
: A Lightweight
Quantized-Adaptive
Framework for Vision-Language Models on the Edge
arxiv.org
ยท
12h
๐๏ธ
TensorRT
Cleaning Up Complexity:
Preprocessing
Attribution
Maps for Better Evaluation
dev.to
ยท
5h
ยท
Discuss:
DEV
๐๏ธ
Attention Optimization
MiRAGE
: Open-source framework for multimodal
RAG
evaluation
news.ycombinator.com
ยท
1h
ยท
Discuss:
Hacker News
๐งฎ
cuDNN
Manufacturing
QMS
Software
samrian.com
ยท
1d
ยท
Discuss:
Hacker News
โฑ๏ธ
Benchmarking
the
mathematics
of
compression
in database systems
bitsxpages.com
ยท
21h
๐
Occupancy Optimization
From
Pixels
to
Precision
dev.to
ยท
5h
ยท
Discuss:
DEV
โก
Flash Attention
Gated
Attention &
DeltaNets
: The Missing Link for Long-Context AI
pub.towardsai.net
ยท
11h
๐๏ธ
Attention Optimization
A Note on
Flat
Abstract
Syntax
Trees
gist.github.com
ยท
22h
ยท
Discuss:
Hacker News
๐ฌ
Static Analysis
Geometrically
Allocated
Ads in AI Conversations
june.kim
ยท
14h
ยท
Discuss:
Hacker News
๐งฉ
Attention Kernels
Automating Inference Optimizations with NVIDIA
TensorRT
LLM
AutoDeploy
developer.nvidia.com
ยท
22h
๐๏ธ
TensorRT
Scale LLM fine-tuning with
Hugging
Face and Amazon
SageMaker
AI
aws.amazon.com
ยท
1d
๐
Model Distillation
Colab
marketplace.visualstudio.com
ยท
3h
โ๏ธ
CUTLASS
Sense8
WorldToolKit
Demo v1.01 :
Sense8
: Free Download, Borrow, and Streaming
archive.org
ยท
18h
๐๏ธ
TensorRT
A
Time-Synchronized
Multi-Sensor drone dataset acquired from multiple
radars
and RF receiver
nature.com
ยท
4h
๐
Kernel Fusion
Drifting
models
breno.bearblog.dev
ยท
1d
๐
Model Distillation
Show HN: Model Training Memory
Simulator
czheo.github.io
ยท
2d
ยท
Discuss:
Hacker News
๐
Gradient Accumulation
Handwriting
vs AI: Real Performance of AI on Handwritten
Documents
hackernoon.com
ยท
21m
๐
Gradient Accumulation
Your
VCL
App: 4x to 11x Faster Math Performance with
Elements
blogs.remobjects.com
ยท
1d
ยท
Discuss:
Hacker News
โ๏ธ
CUTLASS
AI-augmented
data quality engineering
infoworld.com
ยท
1d
๐ค
AI Coding Tools
Loading...
Loading more...
« Page 1
โข
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help