Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏎️ TensorRT
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82789
posts in
1.01
s
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
8h
·
Discuss:
Hacker News
📉
Model Quantization
Autoregressive
Model Beats Diffusion:
Llama
for Scalable Image Generation
paperium.net
·
2d
·
Discuss:
DEV
📊
Gradient Accumulation
How I
squeezed
a
BERT
sentiment analyzer into 1GB RAM on a $5 VPS
mohammedeabdelaziz.github.io
·
19h
·
Discuss:
Hacker News
⚡
ONNX Runtime
Show HN: Model Training Memory
Simulator
czheo.github.io
·
1h
·
Discuss:
Hacker News
📊
Gradient Accumulation
NVIDIA
VibeTensor
: AI Just Built Its Own Deep Learning Engine… And It Actually Works (AI
Revolution
youtube.com
·
1h
🤖
AI Coding Tools
Physics-Informed Neural Networks for
Inverse
PDE
Problems
pub.towardsai.net
·
19h
📊
Gradient Accumulation
Stochastic Gradient Descent
Optimizes
Over-parameterized Deep
ReLU
Networks
dev.to
·
12h
·
Discuss:
DEV
📊
Gradient Accumulation
— ### Abstract We introduce a
rigorously
engineered hybrid pipeline that transforms deep generative neural architectures into quadratic
unconstrained
b...
freederia.com
·
2d
📉
Model Quantization
qrafty-ai/teleop
_xr: Transforms your VR/AR headset into a powerful, precise robot controller
github.com
·
6h
·
Discuss:
Hacker News
🤖
AI Coding Tools
StatLLM
: A Dataset for Evaluating the Performance of Large Language Models in
Statistical
Analysis
nature.com
·
1d
🔄
ONNX
Logarithmic-time
Schedules
for Scaling Language Models with Momentum
arxiv.org
·
2d
📊
Gradient Accumulation
Run
Voxtral
Mini 4B Realtime on
vLLM
with Red Hat AI on Day 1: A step-by-step guide
developers.redhat.com
·
1d
⚡
ONNX Runtime
Crafting the Eyes for Thinking Machines: Rewiring the
Retina
- The Anatomy of
ViTStruct
pub.towardsai.net
·
1d
👁️
Attention Optimization
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
2d
·
Discuss:
Hacker News
📊
Gradient Accumulation
Training language models on
TPUs
shouldn't be
scary
dogac.dev
·
2d
·
Discuss:
Hacker News
📜
TorchScript
ggml
: backend-agnostic tensor parallelism by
JohannesGaessler
· Pull Request #19378
github.com
·
2d
·
Discuss:
r/LocalLLaMA
🎯
Tensor Cores
Prompt injection in Google
Translate
reveals base model
behaviors
behind task-specific fine-tuning
lesswrong.com
·
21h
·
Discuss:
Hacker News
🤖
AI Coding Tools
A Chinese Traditional
Opera
Video Super-Resolution Dataset Based on the “Real-world+”
Degradation
Fusion
nature.com
·
23h
🧮
cuDNN
TTT-Discover
optimizes
GPU kernels 2x faster than human experts — by training during inference
venturebeat.com
·
2d
⚡
ONNX Runtime
Matching
the right LLM for your GPU feels like an art, but I finally
cracked
it
xda-developers.com
·
10h
📈
GPU Occupancy
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help