Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ ONNX Runtime
Specific
Model Deployment, Cross-framework, Inference Engine, Optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
146517
posts in
10.9
ms
Show HN: Meta-agent: self-improving agent
harnesses
from live
traces
🤖
AI Coding Tools
github.com
·
2d
·
Hacker News
Beyond End-to-End:
Dynamic
Chain Optimization for Private LLM
Adaptation
on the Edge
🔗
NCCL
arxiv.org
·
15h
Building AI
Visibility
Infrastructure: The Technical Architecture Behind
Jonomor
🤖
AI Coding Tools
jonomor.com
·
3d
·
DEV
Fast Isn’t Fast Enough:
Redefining
Metrics
for Edge AI
🎯
Tensor Cores
semiengineering.com
·
12h
Collecting
diverse near-optimal samples via
nested
Thompson sampling
📊
Gradient Accumulation
nature.com
·
2d
Select-then-Solve:
Paradigm
Routing
as Inference-Time Optimization for LLM Agents
🔄
ONNX
arxiv.org
·
15h
MysticCodingCat/CUDA-Native-HUBO
: A GPU-native solver for 3-way combinatorial optimization (HUBO). Achieving
digital-annealer-level
performance on a single RTX 3060 Ti
⚡
CUDA Programming Patterns
github.com
·
14h
·
Hacker News
Apriel-Reasoner
: RL Post-Training for General-Purpose and Efficient Reasoning
🎓
Model Distillation
arxiv.org
·
6d
RoboPhD
: Evolving Diverse Complex Agents Under Tight Evaluation
Budgets
📜
TorchScript
arxiv.org
·
2d
Coz
: Causal
profiling
that measures optimization potential
📊
Profiling Tools
github.com
·
20h
·
Hacker News
SPRIG
:
Improving
Large Language Model Performance by System Prompt Optimization
💡
LSP
arxiv.org
·
2d
NED-Tree
: Bridging the Semantic Gap with Nonlinear Element
Decomposition
Tree for LLM Nonlinear Optimization Modeling
🔄
ONNX
arxiv.org
·
6d
DP-OPD
:
Differentially
Private On-Policy Distillation for Language Models
🎓
Model Distillation
arxiv.org
·
2d
AgentOpt
v0.1 Technical Report:
Client-Side
Optimization for LLM-Based Agent
🚀
MLOps
arxiv.org
·
15h
Joint
Optimization of Reasoning and Dual-Memory for Self-Learning
Diagnostic
Agent
🎓
Model Distillation
arxiv.org
·
15h
FlatAttention
: Dataflow and Fabric
Collectives
Co-Optimization for Large Attention-Based Model Inference on Tile-Based Accelerators
✂️
CUTLASS
arxiv.org
·
6d
Agentic Code Optimization via
Compiler-LLM
Cooperation
🚀
Compiler Optimization
arxiv.org
·
2d
Incentive-Aware
Multi-Fidelity
Optimization for Generative Advertising in Large Language Models
🏎️
TensorRT
arxiv.org
·
15h
A Family of Open Time-Series
Foundation
Models for the Radio Access Network
🏎️
TensorRT
arxiv.org
·
2d
Efficient and
Principled
Scientific Discovery through
Bayesian
Optimization: A Tutorial
🎓
Model Distillation
arxiv.org
·
6d
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help