Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
π Model Quantization
INT8, Post-Training, QAT, Pruning, Model Compression
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81835
posts in
461.6
ms
HQP
: Sensitivity-Aware Hybrid Quantization and
Pruning
for Ultra-Low-Latency Edge AI Inference
arxiv.org
Β·
16h
ποΈ
TensorRT
FMBench
: Adaptive Large Language Model Output
Formatting
arxiv.org
Β·
16h
π
ONNX
Show HN: Model Training Memory
Simulator
czheo.github.io
Β·
1d
Β·
Discuss:
Hacker News
π
Gradient Accumulation
AI-augmented
data quality engineering
infoworld.com
Β·
11h
π€
AI Coding Tools
Claude Code Power Tips
kdnuggets.com
Β·
4h
π€
AI Coding Tools
Your
VCL
App: 4x to 11x Faster Math Performance with
Elements
blogs.remobjects.com
Β·
7h
Β·
Discuss:
Hacker News
βοΈ
CUTLASS
Fastfood
: Approximate Kernel Expansions in
Loglinear
Time
paperium.net
Β·
1d
Β·
Discuss:
DEV
π
Kernel Fusion
Import AI 444: LLM
societies
; Huawei makes kernels with AI;
ChipBench
importai.substack.com
Β·
7h
Β·
Discuss:
Substack
β‘
ONNX Runtime
Lean
4 and the CurryβHoward
correspondence
wildonblog.wordpress.com
Β·
2h
π
ONNX
Large Language Models Live in Time
lesswrong.com
Β·
6h
π
Gradient Accumulation
llama.cpp
guide - Running LLMs
locally
, on any hardware, from scratch
blog.steelph0enix.dev
Β·
17h
π‘
LSP
Simulated
depression risk classification from Parkinsonβs voice features using a self-attention-enhanced
MLP
architecture
nature.com
Β·
21h
π
Gradient Accumulation
How to Stop
Burning
Budget on
Over-Hyped
Image Models (The Selection Framework)
dev.to
Β·
14h
Β·
Discuss:
DEV
π
ONNX
Testing 80 LLMs on
spatial
reasoning on
grids
mihai.page
Β·
21h
Β·
Discuss:
Hacker News
β‘
ONNX Runtime
=============================================================================================================================== **Abstract**
freederia.com
Β·
3d
ποΈ
TensorRT
Cross Entropy
Derivatives
, Part 6: Using gradient
descent
to reach the final result
dev.to
Β·
1d
Β·
Discuss:
DEV
π
Gradient Accumulation
World Models and the Data Problem in
Robotics
joeljang.github.io
Β·
4h
Β·
Discuss:
Hacker News
π
Gradient Accumulation
Increasing
the Speed of Offline Raspberry Pi AI Chatbot #
raspberrypi
blog.adafruit.com
Β·
1h
π
Gradient Accumulation
π₯Top AI
Papers
of the Week
nlp.elvissaravia.com
Β·
1d
β‘
ONNX Runtime
An
attempt
at a
First-Proof
AI challenge
abhvio.us
Β·
1d
Β·
Discuss:
Hacker News
π
Kernel Fusion
Loading...
Loading more...
« Page 1
β’
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help