Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏎️ TensorRT
Inference Optimization, Model Deployment, NVIDIA, Quantization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
83166
posts in
1.59
s
Reducing the Computational Cost Scaling of Tensor Network Algorithms via
Field-Programmable
Gate Array
Parallelism
arxiv.org
·
22h
🎯
Tensor Cores
Multi-Scale
Global-Instance
Prompt Tuning for
Continual
Test-time Adaptation in Medical Image Segmentation
arxiv.org
·
22h
📊
Gradient Accumulation
LLM Inference
Benchmarking
-
Measure
What Matters
digitalocean.com
·
12h
⏱️
Benchmarking
Deterministic AI:
Reclaiming
Predictable Latency with Rust and Zero-Cost
Abstractions
dev.to
·
13h
·
Discuss:
DEV
🎯
Tensor Cores
— ### Abstract Accurate prediction of
physicochemical
descriptors (e.g., aqueous solubility log S,
octanol
‑water partition coefficient log P, and acid ...
freederia.com
·
1d
⚡
ONNX Runtime
Tensor‑Network Path‑Integral Algorithm for Efficient Simulation of Discrete 3‑D Quantum Gravity and its Application to
Cosmological
Data **Abstract** We
intr
...
freederia.com
·
17h
🎯
Tensor Cores
New AI system
pushes
the time
limits
of generative video
techxplore.com
·
12h
📊
Gradient Accumulation
Your Agent Is
Slow
Because of
Inference
futureagi.com
·
12h
·
Discuss:
DEV
⚡
ONNX Runtime
Stochastic
Adversarial
Video Prediction
dev.to
·
3h
·
Discuss:
DEV
🧮
cuDNN
Proposal: A Framework for
Discovering
Alien Physics via Optimal
Compression
lesswrong.com
·
9h
📉
Model Quantization
Own your AI: Learn how to fine-tune
Gemma
3
270M
and run it on-device
developers.googleblog.com
·
1d
📉
Model Quantization
Beyond Transformers.
Physics-Centric
Machine Learning for
Analog
semiwiki.com
·
1d
📉
Model Quantization
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
3h
·
Discuss:
Hacker News
📊
Gradient Accumulation
LiteRT
for Web with
LiteRT.js
| Google AI Edge | Google AI for Developers
ai.google.dev
·
1d
📜
TorchScript
Finding the needle in the
logstack
: Reducing LLM context with
TF-IDF
eliseomartelli.it
·
1d
🔄
ONNX
Logistic
Regression, Average Marginal Effects, and the Linear Probability Model - Part II:
Coefficients
and AMEs of nested models
elff.eu
·
12h
🔄
ONNX
The Data
Pipeline
for
Superintelligence
Starts with Your Screen
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
🧩
Attention Kernels
A Neuro Symbolic Architecture For Induced
Epistemic
Agency and System 2 Reasoning in
Quantized
Large Language Models
papers.ssrn.com
·
1d
·
Discuss:
Hacker News
⚡
ONNX Runtime
Released:
DeepBrainz-R1
— reasoning-first small models for agentic workflows (
4B
/ 2B
huggingface.co
·
1d
·
Discuss:
Hacker News
,
r/LocalLLaMA
⚡
ONNX Runtime
Released
genai
v0.1.0: a
sane
Go AI SDK
maruel.ca
·
1d
·
Discuss:
r/golang
⚡
ONNX Runtime
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help