Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐๏ธ TensorRT
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81531
posts in
714.2
ms
MMEarth-Bench
: Global Model Adaptation via Multimodal Test-Time Training
arxiv.org
ยท
13h
๐
Gradient Accumulation
Learning a
Generative
Meta-Model of LLM
Activations
arxiv.org
ยท
13h
๐
Gradient Accumulation
Zero-Latency
Local AI:
Tuning
Your Linux Kernel for LLM Inference ๐ง๐ง
dev.to
ยท
2d
ยท
Discuss:
DEV
โก
ONNX Runtime
AI Automation with GPT +
n8n
: A Practical Guide for
CTOs
and Developers
dev.to
ยท
5h
ยท
Discuss:
DEV
๐ค
AI Coding Tools
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
ยท
2d
ยท
Discuss:
Hacker News
๐
Gradient Accumulation
Building a Dynamic
Multilanguage
System Without
Rebuilds
kuldeepmodi.vercel.app
ยท
1d
ยท
Discuss:
DEV
๐ก
LSP
Finding the needle in the
logstack
: Reducing LLM context with
TF-IDF
eliseomartelli.it
ยท
3d
๐
ONNX
LiteRT
for Web with
LiteRT.js
ย |ย Google AI Edge ย |ย Google AI for Developers
ai.google.dev
ยท
4d
๐
TorchScript
=============================================================================================================================== **Abstract**
freederia.com
ยท
3d
๐
Model Quantization
25W06
. Learning a language with the machine
z1nz0l1n.com
ยท
1d
๐
Ml-eng
**Abstract:** Algorithmic bias in generative adversarial networks (
GANs
) poses a significant challenge to
equitable
AI deployment. This paper proposes a nove...
freederia.com
ยท
4d
๐
Gradient Accumulation
Own your AI: Learn how to fine-tune
Gemma
3
270M
and run it on-device
developers.googleblog.com
ยท
4d
๐
Model Quantization
AI Sees And
Understands
Images Far More
Efficiently
With New Embedding Technique
quantumzeitgeist.com
ยท
3d
๐๏ธ
Attention Optimization
Proposal: A Framework for
Discovering
Alien Physics via Optimal
Compression
lesswrong.com
ยท
2d
๐
Model Quantization
30,000 NVIDIA
Engineers
Use Generative AI for 3x Higher Code
Output
techpowerup.com
ยท
1d
๐ฎ
NVIDIA
Examining
Turbopuffer
ANN v3
terencezl.github.io
ยท
4d
ยท
Discuss:
Hacker News
๐
Profiling Tools
Improving atlas-scale single-cell
annotation
models with hierarchical
cross-entropy
loss
nature.com
ยท
3d
๐
Gradient Accumulation
deepmriprep
: voxel-based
morphometry
preprocessing via deep neural networks
nature.com
ยท
3d
๐
Model Quantization
The vibe coding
spectrum
: from weekend
hacks
to the dark factory
betterthangood.xyz
ยท
12h
๐ค
AI Coding Tools
AI
Workflows
with
human-in-the-loop
weavemind.ai
ยท
1d
ยท
Discuss:
Hacker News
๐ค
AI Coding Tools
Loading...
Loading more...
« Page 8
โข
Page 10 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help