Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ ONNX Runtime
Model Deployment, Cross-framework, Inference Engine, Optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112496
posts in
272.1
ms
GPU-Serving
Two-Tower
Models for Lightweight Ads Engagement Prediction
medium.com
·
23h
⚡
Flash Attention
Building
Physical
Agentic
AI
dansitu.substack.com
·
1d
·
Discuss:
Substack
🚀
MLOps
A Neural Network
Playground
playground.tensorflow.org
·
3h
🧮
cuDNN
Running an
experiment
with Claude Code
overnight
blog.nolank.ca
·
5h
🤖
AI Coding Tools
STAR :
Bridging
Statistical
and Agentic Reasoning for Large Model Performance Prediction
arxiv.org
·
1d
🔄
ONNX
harishsg993010/tiny-NPU
: opensource NPU for LLM inference (this run
gpt2
)
github.com
·
2d
·
Discuss:
r/LocalLLaMA
🔄
ONNX
Building an Embedding API with Rust, Arm, and
EmbeddingGemma
on AWS
Lambda
sobolev.substack.com
·
1d
·
Discuss:
Substack
🔄
ONNX
Presentation: Building
Embedding
Models for Large-Scale Real-World
Applications
infoq.com
·
1d
🎓
Model Distillation
How low-bit
inference
enables
efficient AI
dropbox.tech
·
11h
·
Discuss:
Hacker News
🎯
Tensor Cores
You are
probably
overpaying
for intelligence
residuals.bearblog.dev
·
1d
⏱️
Benchmarking
Show HN:
Darius
– An AI router that
selects
the best model for each prompt
withdarius.com
·
1d
·
Discuss:
Hacker News
🤖
AI Coding Tools
Best MCP
Gateways
to Connect Tools and MCP
Servers
to Your AI Agent
getmaxim.ai
·
5h
·
Discuss:
DEV
🤖
AI Coding Tools
Completed
Hyperparameter
Transfer across Modules, Width, Depth, Batch and
Duration
machinelearning.apple.com
·
1d
🎓
Model Distillation
Leaning Into the Coding Interview:
Lean
4 vs
Dafny
cage-match
ntaylor.ca
·
3h
·
Discuss:
Lobsters
,
Hacker News
🔍
Type Checkers
Show HN:
Metaxy
–
versioning
for multimodal data pipelines
docs.metaxy.io
·
23h
·
Discuss:
Hacker News
🔄
ONNX
AI-Powered Knowledge Graph Generator &
APTs
, (Thu,
Feb
12th)
isc.sans.edu
·
1d
🔄
ONNX
Power of Agent
assisted
coding and learning to
achieve
goals faster and cheaper
osm2pgsql.org
·
8h
·
Discuss:
DEV
🤖
AI Coding Tools
How to Cut Your AI API Costs by 60-90% With Smart Model
Routing
dev.to
·
22h
·
Discuss:
DEV
🤖
AI Coding Tools
Writing a
ONNX
Neural Network Inference Engine from Scratch in C to run image classification with
MobileNetV2
flexw.github.io
·
6d
·
Discuss:
r/C_Programming
🔄
ONNX
Recursive
Language Models: Stop
Stuffing
the Context Window
nlp.elvissaravia.com
·
2d
📊
Gradient Accumulation
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help