Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMOps
⚙ LLMOps
Specific
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
266
posts in
15.6
ms
🚀
MLOps
medium.com
·
2d
2 days ago
How Much Does It Actually Cost to Run a Local
LLM
? (€ per Million Tokens, Measured)
Discussed on
DEV
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How Much Does It Actually Cost to Run a Local LLM? (€ per Million Tokens, Measured)
🚀
MLOps
GitHub
·
2h
2 hours ago
For users with 4x-8x 6000
PROs
, how is your experience with bigger
models
lately? (GLM 5.2, Kimi 2.7, DeepSeek V4 Pro)
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for For users with 4x-8x 6000 PROs, how is your experience with bigger models lately? (GLM 5.2, Kimi 2.7, DeepSeek V4 Pro)
🛠
Building Indie Products
tronbrowser.dev
·
4h
4 hours ago
TronBrowser is an open-source, privacy-first, AI-native web browser
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for TronBrowser is an open-source, privacy-first, AI-native web browser
🚀
MLOps
abhishek.it
·
6d
6 days ago
Running GLM-5.2 5x faster at 500tps with limitation
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Running GLM-5.2 5x faster at 500tps with limitation
🦀
Rust
Malware Analysis, News and Indicators
·
2d
2 days ago
Malware analysis: part 9. AI-assisted deobfuscation: control flow flattening. Simple C example
Covers
2 stories
See all stories this covers
including
Ollama
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Malware analysis: part 9. AI-assisted deobfuscation: control flow flattening. Simple C example
🪂
Forward Deploy Engineers
cobusgreyling.medium.com
·
6d
6 days ago
Fleet Engineering
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fleet Engineering
🤖
AI
NVIDIA Technical Blog
·
1d
1 day ago
Boost
Inference
Performance up to 15x on NVIDIA Blackwell Using DFlash Speculative Decoding
Covers
3 stories
See all stories this covers
including
NVIDIA/TensorRT-LLM
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Boost Inference Performance up to 15x on NVIDIA Blackwell Using DFlash Speculative Decoding
🤖
AI
fitservers.com
·
5d
5 days ago
The Complete Guide to Deploying DeepSeek R1 on a Dedicated Server
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Complete Guide to Deploying DeepSeek R1 on a Dedicated Server
🤖
AI
YouTube
Content type:
Video
·
6d
6 days ago
Token Injection: Crashing
LLM
Inference
With Special Tokens
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Token Injection: Crashing LLM Inference With Special Tokens
🚀
MLOps
arXiv
·
2d
2 days ago
CELEUS: Certifiable and Efficient
LLM
Evaluation
via E-Processes
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for CELEUS: Certifiable and Efficient LLM Evaluation via E-Processes
🦀
Rust
medium.com
·
6d
6 days ago
LocalForge: Building an On-Device AI Security Gateway for Git Commits
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LocalForge: Building an On-Device AI Security Gateway for Git Commits
🚀
MLOps
zafran.io
·
22h
22 hours ago
DifyTap: Zafran discovers how attackers can silently wiretap AI data across tenants on a platform powering 1M+ apps
Covered by
4 sources
See all sources covering this story
including
SecurityWeek
,
The Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for DifyTap: Zafran discovers how attackers can silently wiretap AI data across tenants on a platform powering 1M+ apps
🚀
MLOps
medium.com
·
2d
2 days ago
How I Design AI Projects Using LangChain,
LangSmith
, and LangServe
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How I Design AI Projects Using LangChain, LangSmith, and LangServe
🤖
AI
GitHub
·
14h
14 hours ago
fix(
ollama
): bound
model-discovery
JSON response reads (#96027)
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for fix(ollama): bound model-discovery JSON response reads (#96027)
🚀
MLOps
Red Hat Developer
·
3d
3 days ago
Designing distributed AI
inference
: Core concepts and scaling dimensions
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Designing distributed AI inference: Core concepts and scaling dimensions
🤖
AI
Hugging Face
·
1d
1 day ago
Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments
Covers
2 stories
See all stories this covers
including
vllm-project/vllm
Covered by
3 sources
See all sources covering this story
including
GitHub
,
indiehacker.news
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Qwen-AgentWorld-35B-A3B: a 3B-active MoE trained to simulate MCP, terminal, SWE, Android, web and OS environments
🛠
Building Indie Products
pipevoice.app
Content type:
Video
·
19h
19 hours ago
PipeVoice: The Free local alternative to wispr flow
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for PipeVoice: The Free local alternative to wispr flow
🚀
MLOps
medium.com
·
2d
2 days ago
Build Your Own Private ChatGPT: Local
RAG
App (Part 1 —
Ollama
Setup)
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Build Your Own Private ChatGPT: Local RAG App (Part 1 — Ollama Setup)
🚀
MLOps
Cocoanetics
·
3d
3 days ago
Responses Bug in LM Studio
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Responses Bug in LM Studio
🦀
Rust
medium.com
·
1d
1 day ago
Fighting the Amnesia Tax: The Hidden Cost of
Open-Weight
LLM
Serving
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fighting the Amnesia Tax: The Hidden Cost of Open-Weight LLM Serving
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report