Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMOps
⚙ LLMOps
Specific
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
307
posts in
12.7
ms
🤖
AI
GitHub
·
4d
4 days ago
fix(
ollama
): preserve configured API during discovery (#93729)
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for fix(ollama): preserve configured API during discovery (#93729)
🤖
AI
fitservers.com
·
1d
1 day ago
The Complete Guide to Deploying DeepSeek R1 on a Dedicated Server
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Complete Guide to Deploying DeepSeek R1 on a Dedicated Server
Less-relevant results
🚀
MLOps
Cocoanetics
·
10h
10 hours ago
Responses Bug in LM Studio
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Responses Bug in LM Studio
🚀
MLOps
medium.com
·
2d
2 days ago
Don’t Use
Ollama
for Local
LLMs
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Don’t Use Ollama for Local LLMs
🚀
MLOps
pyimagesearch.com
·
6d
6 days ago
RAG
Observability
with Langfuse,
vLLM
, and FAISS
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for RAG Observability with Langfuse, vLLM, and FAISS
🚀
MLOps
medium.com
·
1d
1 day ago
vLLM
, Function Calling, and World
Models
explained
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for vLLM, Function Calling, and World Models explained
🦀
Rust
arxiv.org
·
5d
5 days ago
Tropical: Enhancing SLO Attainment in Disaggregated
LLM
Serving
via SLO-Aware Multiplexing
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tropical: Enhancing SLO Attainment in Disaggregated LLM Serving via SLO-Aware Multiplexing
🚀
MLOps
mstar.stanford.edu
·
2d
2 days ago
M* (M-Star): A Modular, Extensible,
Serving
System for Multimodal
Models
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for M* (M-Star): A Modular, Extensible, Serving System for Multimodal Models
🤖
AI
hackster.io
·
23h
23 hours ago
Offline AI Voice Assistant on Raspberry
Pi
4 with Gemma
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Offline AI Voice Assistant on Raspberry Pi 4 with Gemma
🐍
Python
Anyscale blog posts
·
3d
3 days ago
High Performance Distributed
Inference
with Ray
Serve
LLM
Covered by
Google Cloud Blog
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for High Performance Distributed Inference with Ray Serve LLM
🤖
AI
lemmy.world
·
1d
1 day ago
Wrote up a full guide for running AI locally on Windows (LM Studio +
Ollama
+ Open WebUI)
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Wrote up a full guide for running AI locally on Windows (LM Studio + Ollama + Open WebUI)
🚀
MLOps
nazarboyko.com
·
6d
6 days ago
Running Local
LLMs
With
Ollama
For Private Development
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running Local LLMs With Ollama For Private Development
🚀
MLOps
abhishek.it
·
2d
2 days ago
Running GLM-5.2 5x faster at 500tps with limitation
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running GLM-5.2 5x faster at 500tps with limitation
🤖
AI
Red Hat Developer Blog
·
6d
6 days ago
llama.cpp vs.
vLLM
: Choosing the right local
LLM
inference
engine
Covers
7 stories
See all stories this covers
including
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for llama.cpp vs. vLLM: Choosing the right local LLM inference engine
🐍
Python
pypi.org
·
5d
5 days ago
Show HN: Subagent-fleet – AI coding subagents across local
Ollama
machines
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Subagent-fleet – AI coding subagents across local Ollama machines
🦀
Rust
langchain.com
·
5d
5 days ago
A self-improving agent loop (Sponsor)
Covered by
tldr.tech
,
Steve Sun
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for A self-improving agent loop (Sponsor)
🚀
MLOps
teachmecoolstuff.com
·
2d
2 days ago
Fine
Tuning
a Tiny Local
LLM
to Categorize Questions
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fine Tuning a Tiny Local LLM to Categorize Questions
🚀
MLOps
vimal-dwarampudi.medium.com
·
2d
2 days ago
LLMOps
: Operationalizing Large Language
Models
in Production
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLMOps: Operationalizing Large Language Models in Production
🐍
Python
youtube.com
Content type:
Video
·
6d
6 days ago
How to Build a High-Performance
RAG
Pipeline
with
Ollama
, Python and TypeScript
Discussed on
DEV
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How to Build a High-Performance RAG Pipeline with Ollama, Python and TypeScript
🤖
AI
GitHub
·
20h
20 hours ago
Show HN: Alloy – a PyTorch backend and
inference
engine for Apple Silicon
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report