Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚙️ MLOps
Specific
model deployment, ML pipelines, inference, model serving
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
4892
posts in
10.8
ms
The case for Model-as-a-Service over
self-managed
inference
✨
LLMs
news.ycombinator.com
·
3d
·
Hacker News
Benchmarking
LLMs with
Marimo
Pair
🕳
LLM Vulnerabilities
ericmjl.github.io
·
15h
·
Hacker News
An
empirical
study of
LoRA-based
fine-tuning of large language models for automated test case generation
✨
LLMs
arxiv.org
·
1d
milanm/AutoGrad-Engine
: A complete GPT language model (training and inference) in ~600 lines of pure C#, zero dependencies
🤖
LLM
github.com
·
22h
·
Hacker News
Own your AI.
Optimized
down to the
kernel
⚡
Edge AI
runinfra.ai
·
2d
·
Hacker News
Inference
Arena
– new
benchmark
of local inference and training
📱
Edge AI Optimization
kvark.github.io
·
4d
·
Hacker News
Optimizing
our inference back end with custom load
balancing
🌍
Distributed Systems
photoroom.com
·
1d
·
Hacker News
benchmarking
inference
of popular models on consumer hardware
📱
Edge AI Optimization
inferena.tech
·
5d
·
Hacker News
EU's
Exposed
AI Infrastructure
🤖
AI
insecurestack.substack.com
·
2d
·
Substack
Show HN: Pre-training,
fine-tuning
, and
evals
platform
✨
Gemini
oumi.ai
·
6d
·
Hacker News
LLM
inference
engine from
scratch
in C++
✨
LLMs
anirudhsathiya.com
·
4d
·
Hacker News
How Meta Used AI to Map
Tribal
Knowledge in Large-Scale Data
Pipelines
🕵️
AI Agents
engineering.fb.com
·
3d
Peer-to-Peer
acceleration
for AI model distribution with
Dragonfly
🇨🇳
Chinese AI
cncf.io
·
4d
Unlocking
LoRA
Moe
RL for Qwen3.5
🤖
AI
osmosis.ai
·
6d
·
Hacker News
How we built a real-world
evaluation
platform for autonomous
SRE
agents at scale
🔧
Agent Tooling
datadoghq.com
·
3d
·
Hacker News
CAKE
: Cloud Architecture Knowledge
Evaluation
of Large Language Models
🤖
LLM
arxiv.org
·
2d
AgentOpt
v0.1 Technical Report:
Client-Side
Optimization for LLM-Based Agent
🔧
Agent Tooling
arxiv.org
·
1d
Scaling tool
orchestration
data will
emerge
different intelligence and LLMs
🕵️
AI Agents
news.ycombinator.com
·
6d
·
Hacker News
vLLM
introduces memory
optimizations
for long-context inference
🤖
LLM
github.com
·
5d
·
Hacker News
ALTO: Adaptive
LoRA
Tuning and
Orchestration
for Heterogeneous
LoRA
Training Workloads
📱
Edge AI Optimization
arxiv.org
·
2d
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help