Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ ONNX Runtime
Model Deployment, Cross-framework, Inference Engine, Optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112613
posts in
262.4
ms
Building a Production ML Inference Stack with
KServe
, vLLM, and
Karmada
dev.to
·
1d
·
Discuss:
DEV
🚀
MLOps
Show HN: A
Deployable
Cross-Platform SIMD RNG Library for C++ (With
Bnchmks
)
news.ycombinator.com
·
32m
·
Discuss:
Hacker News
🔄
SIMD Programming
Computer Vision Agent
npmjs.com
·
9h
·
Discuss:
Hacker News
🧮
cuDNN
lightonai/next-plaid:
NextPlaid
,
ColGREP
: Multi-vector search, from database to coding agents.
github.com
·
16h
🔄
ONNX
Compiling
High-Level Neural Network Specifications into
VNN-LIB
Queries
arxiv.org
·
1d
🔄
ONNX
Tiny
Recursion
Models (
TRM
): How Tiny Networks With
Recursion
Beat Large Models on Hard Puzzles
pub.towardsai.net
·
17h
📊
Gradient Accumulation
LLM Optimization: From Research to Production
dev.to
·
6h
·
Discuss:
DEV
🚀
MLOps
ExaBiome
exascaleproject.org
·
1h
🔄
ONNX
AI Study
Platforms
trendhunter.com
·
8h
🤖
AI Coding Tools
borodark/exmc
: Probabilistic programming in BEAM
github.com
·
3d
🔄
ONNX
🎲
Fine-Tuning
an AI
zwischenzugs.com
·
6h
📜
TorchScript
Introducing
Dedicated
Container Inference:
Delivering
2.6x faster inference for custom AI models
together.ai
·
2d
🔄
ONNX
Large language model-enhanced home energy management with dynamic user
preference
elicitation
and hierarchical data-sharing
sciencedirect.com
·
14m
🔄
ONNX
SnowBall
:
Iterative
Context Processing When It Won't Fit in the LLM Window
enji.ai
·
15h
·
Discuss:
Hacker News
💡
LSP
Show HN:
PolyMCP
–
Orchestrate
AI agents across Python tools and MCP servers
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
🚀
MLOps
Breaking the
Tractability
Barrier: A Generic Low-Level Solver for
NP-Hard
Instances (N=63) on Commodity 64-Bit Silicon
zenodo.org
·
1d
·
Discuss:
r/programming
✂️
CUTLASS
BalatroBench
Benchmarks
Large Language Models Playing Balatro
balatrobench.com
·
1d
·
Discuss:
Hacker News
🔄
ONNX
Show HN: Free financial
calculators
built and
deployed
by an AI agent
smallbiz-finance.surge.sh
·
30m
·
Discuss:
Hacker News
🤖
AI Coding Tools
Olmix
: A framework for data mixing throughout
LM
development
allenai.org
·
1d
🔄
ONNX
GPU-Serving
Two-Tower
Models for Lightweight Ads Engagement Prediction
medium.com
·
21h
⚡
Flash Attention
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help