Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐๏ธ TensorRT
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81190
posts in
2.14
s
AnyThermal
: Towards Learning Universal
Representations
for Thermal Perception
arxiv.org
ยท
5h
๐งฉ
Attention Kernels
Adaptive and Balanced
Re-initialization
for
Long-timescale
Continual Test-time Domain Adaptation
arxiv.org
ยท
5h
๐
Gradient Accumulation
Crafting the Eyes for Thinking Machines: Rewiring the
Retina
- The Anatomy of
ViTStruct
pub.towardsai.net
ยท
2d
๐๏ธ
Attention Optimization
CNN-based
Segmentation
of Medical
Imaging
Data
dev.to
ยท
20h
ยท
Discuss:
DEV
๐งฎ
cuDNN
Training language models on
TPUs
shouldn't be
scary
dogac.dev
ยท
3d
ยท
Discuss:
Hacker News
๐
TorchScript
Turn Claude From a
Chatbot
Into a
Thinking
Partner ๐ง
linas.substack.com
ยท
43m
ยท
Discuss:
Substack
๐ค
AI Coding Tools
ggml
: backend-agnostic tensor parallelism by
JohannesGaessler
ยท Pull Request #19378
github.com
ยท
3d
ยท
Discuss:
r/LocalLLaMA
๐ฏ
Tensor Cores
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
ยท
3d
ยท
Discuss:
Hacker News
๐
Gradient Accumulation
Recursive
Deductive
Verification: A framework for reducing AI
hallucinations
news.ycombinator.com
ยท
21h
ยท
Discuss:
Hacker News
๐
Gradient Accumulation
Main
Content ||
Math
โฉ Programming
jeremykun.com
ยท
12h
๐
Model Quantization
A Chinese Traditional
Opera
Video Super-Resolution Dataset Based on the โReal-world+โ
Degradation
Fusion
nature.com
ยท
1d
๐งฎ
cuDNN
Beyond RAG: Building an AI
Companion
with "Deep Memory" using Knowledge
Graphs
dev.to
ยท
10h
ยท
Discuss:
DEV
โก
ONNX Runtime
Prompt injection in Google
Translate
reveals base model
behaviors
behind task-specific fine-tuning
lesswrong.com
ยท
1d
ยท
Discuss:
Hacker News
๐ค
AI Coding Tools
Stanford
AI Breakthrough: Unlock ChatGPT
Creativity
medium.com
ยท
4h
๐ค
AI Coding Tools
Matching
the right LLM for your GPU feels like an art, but I finally
cracked
it
xda-developers.com
ยท
1d
๐
GPU Occupancy
AI-augmented
data quality engineering
infoworld.com
ยท
54m
๐ค
AI Coding Tools
Teon
Demonstrates
Improved Pre-Training With Language Models Up To 1B Parameters
quantumzeitgeist.com
ยท
3d
๐
Model Quantization
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
ยท
4d
ยท
Discuss:
Hacker News
,
r/Compilers
๐ฏ
Tensor Cores
Building Highly Efficient Inference System for
Recommenders
Using
PyTorch
pytorch.org
ยท
3d
ยท
Discuss:
Hacker News
๐
TorchScript
TTT-Discover
optimizes
GPU kernels 2x faster than human experts โ by training during inference
venturebeat.com
ยท
3d
โก
ONNX Runtime
Loading...
Loading more...
« Page 1
โข
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help