Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐๏ธ TensorRT
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
83059
posts in
422.1
ms
Autoregressive
Model Beats Diffusion:
Llama
for Scalable Image Generation
paperium.net
ยท
1d
ยท
Discuss:
DEV
๐
Gradient Accumulation
โ ### Abstract We introduce a
rigorously
engineered hybrid pipeline that transforms deep generative neural architectures into quadratic
unconstrained
b...
freederia.com
ยท
1d
๐
Model Quantization
StatLLM
: A Dataset for Evaluating the Performance of Large Language Models in
Statistical
Analysis
nature.com
ยท
20h
๐
ONNX
Logarithmic-time
Schedules
for Scaling Language Models with Momentum
arxiv.org
ยท
1d
๐
Gradient Accumulation
**Title**
dev.to
ยท
2h
ยท
Discuss:
DEV
๐
Kernel Fusion
Crafting the Eyes for Thinking Machines: Rewiring the
Retina
- The Anatomy of
ViTStruct
pub.towardsai.net
ยท
5h
๐๏ธ
Attention Optimization
ggml
: backend-agnostic tensor parallelism by
JohannesGaessler
ยท Pull Request #19378
github.com
ยท
1d
ยท
Discuss:
r/LocalLLaMA
๐ฏ
Tensor Cores
Run
Voxtral
Mini 4B Realtime on
vLLM
with Red Hat AI on Day 1: A step-by-step guide
developers.redhat.com
ยท
13h
โก
ONNX Runtime
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
ยท
1d
ยท
Discuss:
Hacker News
๐
Gradient Accumulation
TTT-Discover
optimizes
GPU kernels 2x faster than human experts โ by training during inference
venturebeat.com
ยท
1d
โก
ONNX Runtime
Training language models on
TPUs
shouldn't be
scary
dogac.dev
ยท
1d
ยท
Discuss:
Hacker News
๐
TorchScript
Teon
Demonstrates
Improved Pre-Training With Language Models Up To 1B Parameters
quantumzeitgeist.com
ยท
1d
๐
Model Quantization
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
ยท
18h
ยท
Discuss:
Hacker News
๐
ONNX
Pathwise
Test-Time Correction for
Autoregressive
Long Video Generation
arxiv.org
ยท
1d
๐
Gradient Accumulation
Building Highly Efficient Inference System for
Recommenders
Using
PyTorch
pytorch.org
ยท
1d
ยท
Discuss:
Hacker News
๐
TorchScript
A
generalizable
foundation model for analysis of human brain
MRI
nature.com
ยท
22h
๐
Gradient Accumulation
NVIDIA Releases
VibeTensor
: A Deep Learning
Runtime
from Coding Agents
dev.to
ยท
2d
ยท
Discuss:
DEV
๐ฏ
Tensor Cores
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
ยท
2d
ยท
Discuss:
Hacker News
๐ฏ
Tensor Cores
Is Your Machine Learning
Pipeline
as Efficient as it Could Be?
kdnuggets.com
ยท
19h
๐
Gradient Accumulation
ML-LIB
: Machine Learning Library Proposed For The Linux Kernel
phoronix.com
ยท
13h
ยท
Discuss:
Hacker News
๐
MLOps
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help