Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMOps
⚙ LLMOps
Specific
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
264
posts in
13.3
ms
🚀
MLOps
teachmecoolstuff.com
·
6d
6 days ago
Fine
Tuning
a Tiny Local
LLM
to Categorize Questions
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fine Tuning a Tiny Local LLM to Categorize Questions
🦀
Rust
medium.com
·
1d
1 day ago
Fighting the Amnesia Tax: The Hidden Cost of
Open-Weight
LLM
Serving
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fighting the Amnesia Tax: The Hidden Cost of Open-Weight LLM Serving
🤖
AI
GitHub
·
22h
22 hours ago
fix(
ollama
): bound
model-discovery
JSON response reads (#96027)
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for fix(ollama): bound model-discovery JSON response reads (#96027)
🚀
MLOps
buildaharness.com
·
23h
23 hours ago
open-source, modular harness layer for AI agents
Covers
Ollama
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for open-source, modular harness layer for AI agents
🚀
MLOps
vimal-dwarampudi.medium.com
·
6d
6 days ago
LLMOps
: Operationalizing Large Language
Models
in Production
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLMOps: Operationalizing Large Language Models in Production
🚀
MLOps
primeintellect.ai
·
2d
2 days ago
RL at 1T Scale: prime-rl Performance Deep Dive
Covers
6 stories
See all stories this covers
including
Kimi K2.7-Code: open-source coding model with better token efficiency
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for RL at 1T Scale: prime-rl Performance Deep Dive
🤖
AI
Hackster.io
·
4d
4 days ago
Offline AI Voice Assistant on Raspberry
Pi
4 with Gemma
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Offline AI Voice Assistant on Raspberry Pi 4 with Gemma
🚀
MLOps
IBM Research
·
1d
1 day ago
Running AI on mixed hardware for speed and affordability
Covers
Introduction to llm-d Open-source Kubernetes-native Framework for Distributed LLM Inference | Ep 140 #cloudnativefm
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running AI on mixed hardware for speed and affordability
🤖
AI
pi.dev
·
5d
5 days ago
Pi
0.79.9
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Pi 0.79.9
🦀
Rust
arXiv
·
1d
1 day ago
The Serialized Bridge: Understanding and Recovering
LLM
Serving
Performance under Blackwell GPU Confidential Computing
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Serialized Bridge: Understanding and Recovering LLM Serving Performance under Blackwell GPU Confidential Computing
🚀
MLOps
konxios.com
·
5d
5 days ago
Show HN: Konxios a local first AI OS that connects LM Studio,
Ollama
and cloud
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Konxios a local first AI OS that connects LM Studio, Ollama and cloud
🚀
MLOps
zafran.io
·
1d
1 day ago
DifyTap: Zafran discovers how attackers can silently wiretap AI data across tenants on a platform powering 1M+ apps
Covered by
4 sources
See all sources covering this story
including
SecurityWeek
,
The Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for DifyTap: Zafran discovers how attackers can silently wiretap AI data across tenants on a platform powering 1M+ apps
🐍
Python
GitHub
·
11h
11 hours ago
interviewstreet/hiring-agent
Covers
Ollama
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for interviewstreet/hiring-agent
🤖
AI
Hugging Face
·
1d
1 day ago
Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments
Covers
2 stories
See all stories this covers
including
vllm-project/vllm
Covered by
3 sources
See all sources covering this story
including
GitHub
,
indiehacker.news
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments
🚀
MLOps
robert-mcdermott.github.io
·
2d
2 days ago
Phlox: A full-featured AI platform you own
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Phlox: A full-featured AI platform you own
🚀
MLOps
medium.com
·
3d
3 days ago
Build Your Own Private ChatGPT: Local
RAG
App (Part 1 —
Ollama
Setup)
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Build Your Own Private ChatGPT: Local RAG App (Part 1 — Ollama Setup)
🚀
MLOps
medium.com
·
6d
6 days ago
Hands-On Guide to LangChain: Build an End‑to‑End
LLM
Pipeline
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Hands-On Guide to LangChain: Build an End‑to‑End LLM Pipeline
🛠
Building Indie Products
pipevoice.app
Content type:
Video
·
1d
1 day ago
PipeVoice: The Free local alternative to wispr flow
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for PipeVoice: The Free local alternative to wispr flow
🚀
MLOps
Hacker News
·
5d
5 days ago
Changes that cut our
LLM
pipeline
costs more than
model-switching
did
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Changes that cut our LLM pipeline costs more than model-switching did
🚀
MLOps
Red Hat Developer
·
2d
2 days ago
Connect EvalHub to protected production
model
servers
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Connect EvalHub to protected production model servers
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report