Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚙️ MLOps
Specific
model deployment, ML pipelines, inference, model serving
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
7143
posts in
15.4
ms
AI
Observability
for Large Language Model Systems: A Multi-Layer Analysis of Monitoring Approaches from Confidence
Calibration
to Infrastructure Tracing
🛡️
AI Safety
arxiv.org
·
20h
DeepSeek-V4 on Day 0: From Fast Inference to Verified
RL
with
SGLang
and Miles
📱
Edge AI Optimization
lmsys.org
·
5d
·
Hacker News
carlovalenti/TRiP
: A complete
transformer
engine in C — inference, training, chat, vision.
✨
Gemini
github.com
·
1d
·
Hacker News
,
r/C_Programming
Building
Semantic
Version Control in Rust
⚙️
Compilers
therohansharma.com
·
5d
·
Hacker News
Progressive
Semantic
Communication for Efficient Edge-Cloud Vision-Language Models
⚡
Edge AI
arxiv.org
·
20h
How we built the most performant DeepSeek V3.2, MiniMax-M2.5 and Qwen 3.5
397B
on DigitalOcean NVIDIA
HGX
™ B300 GPU Droplets
📱
Edge AI Optimization
digitalocean.com
·
2d
RaMP: Runtime-Aware
Megakernel
Polymorphism
for Mixture-of-Experts
📱
Edge AI Optimization
arxiv.org
·
20h
Rcarmo/gte-go
: Golang inference for the
GTE
Small embedding model
🤖
LLM
github.com
·
5d
·
Hacker News
Identifying the Achilles' Heel: An Iterative Method for
Dynamically
Uncovering
Factual
Errors in Large Language Models
🤖
LLM
arxiv.org
·
20h
FlowBot
: Inducing LLM Workflows with
Bilevel
Optimization and Textual Gradients
✨
LLMs
arxiv.org
·
20h
PAINT: Partial-Solution Adaptive
Interpolated
Training for Self-Distilled
Reasoners
⚗️
Knowledge Distillation
arxiv.org
·
20h
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from
Raw
Corpora
✨
LLMs
arxiv.org
·
1d
Benchmarking the Safety of Large Language Models for
Robotic
Health
Attendant
Control
🛡️
AI Safety
arxiv.org
·
20h
A Survey on Split Learning for LLM
Fine-Tuning
: Models, Systems, and Privacy
Optimizations
✨
LLMs
arxiv.org
·
2d
Efficient,
VRAM-Constrained
xLM
Inference on Clients
📱
Edge AI Optimization
arxiv.org
·
20h
Scalable Inference
Architectures
for
Compound
AI Systems: A Production Deployment Study
📱
Edge AI Optimization
arxiv.org
·
1d
LLM
Psychosis
: A Theoretical and
Diagnostic
Framework for Reality-Boundary Failures in Large Language Models
✨
LLMs
arxiv.org
·
20h
Optimization of Model
Splitting
, Placement, and
Chaining
for Multi-hop Split Learning and Inference
📱
Edge AI Optimization
arxiv.org
·
1d
LAF-Based Evaluation and
UTTL-Based
Learning Strategies with
MIATTs
🧠
Machine Learning
arxiv.org
·
6d
ClawGym
: A
Scalable
Framework for Building Effective Claw Agents
🕹️
Agentic AI
arxiv.org
·
20h
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help