Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMOps
⚙️ LLMOps
LLM operations, model deployment, ML lifecycle, LLMOps
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
193
posts in
8.3
ms
How I benchmarked a 100% local
RAG
pipeline
to 9/9 (zero API keys)
💻
AI Engineering
buy.polar.sh
·
4d
4 days ago
·
DEV
Actions for How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
MarkSentry – zero-trust document-to-Markdown for
RAG
pipelines
💻
AI Engineering
sunilgentyala.github.io
·
2d
2 days ago
·
Hacker News
Actions for MarkSentry – zero-trust document-to-Markdown for RAG pipelines
The Hidden Reasons Your
RAG
Pipeline
Stops Working at Scale
💻
AI Engineering
Content type:
Blog
medium.com
·
18h
18 hours ago
Actions for The Hidden Reasons Your RAG Pipeline Stops Working at Scale
langchain-ai/langchain
langchain-core
==1.4.6
🤖
AI Agents
Content type:
Code
github.com
·
1d
1 day ago
Actions for langchain-ai/langchain langchain-core==1.4.6
Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP Developers via Malicious PyPI Wheels
💻
AI Engineering
Content type:
Blog
8
articles covering this post
socket.dev
·
4d
4 days ago
·
Hacker News
·
Cited by 8 articles
Actions for Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP Developers via Malicious PyPI Wheels
AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
🌐
Open Source AI
phoronix.com
·
2d
2 days ago
·
r/artificial
·
Cited by 1 article
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
MiniPIC: Flexible Position-Independent Caching in <100LOC
🌐
Open Source AI
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for MiniPIC: Flexible Position-Independent Caching in <100LOC
Latest technical articles & videos.
💻
AI Engineering
certdepot.net
·
6d
6 days ago
Actions for Latest technical articles & videos.
Friday Five — June 12, 2026
🌐
Open Source AI
redhat.com
·
1d
1 day ago
Actions for Friday Five — June 12, 2026
NVIDIA A100 vs RTX 4090 for AI Workloads: The Cost Per FLOP Reality
🌐
Open Source AI
Content type:
Blog
fitservers.com
·
4d
4 days ago
Actions for NVIDIA A100 vs RTX 4090 for AI Workloads: The Cost Per FLOP Reality
vLLM
Transformers Backend: Bridging Hugging Face Compatibility and High-Performance Inference
🌐
Open Source AI
Content type:
Blog
odsc.medium.com
·
1d
1 day ago
Actions for vLLM Transformers Backend: Bridging Hugging Face Compatibility and High-Performance Inference
Mi50 32GB / GFX906 -
vLLM
Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
🌐
Open Source AI
huggingface.co
·
1d
1 day ago
·
r/LocalLLaMA
Actions for Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
BoxAgnts Tool System (1) — Design Motivation & Architecture Overview
🤖
AI Agents
Content type:
Blog
medium.com
·
4d
4 days ago
Actions for BoxAgnts Tool System (1) — Design Motivation & Architecture Overview
Building Agents Without Harness-Engineering
🤖
AI Agents
rajitkhanna.com
·
2d
2 days ago
·
Hacker News
Actions for Building Agents Without Harness-Engineering
How to Run an
LLM
Locally: Ultimate Guide to Local AI 2026
🧠
LLMs
Content type:
Blog
cswithsanjay.blogspot.com
·
22h
22 hours ago
Actions for How to Run an LLM Locally: Ultimate Guide to Local AI 2026
New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"
💻
AI Engineering
Content type:
Discussion
news.ycombinator.com
·
3d
3 days ago
·
Hacker News
Actions for New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"
Intelligent inference scheduling with
llm-d
on Red Hat AI
💻
AI Engineering
developers.redhat.com
·
2d
2 days ago
Actions for Intelligent inference scheduling with llm-d on Red Hat AI
I Processed 2.4 Billion Tokens Across 52 AI
Models
for $0.52. Here's the Full Breakdown.
🤖
AI Agents
saintlex.sbs
·
1d
1 day ago
·
DEV
Actions for I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.
RAG
Pipeline
Explained: From Query to Answer, Step by Step
💻
AI Engineering
Content type:
Blog
medium.com
·
4d
4 days ago
Actions for RAG Pipeline Explained: From Query to Answer, Step by Step
Show HN: BeamWeaver –
LangChain/DeepAgents-style
agents and workflows for Elixir
🤖
AI Agents
Content type:
Code
github.com
·
13h
13 hours ago
·
Hacker News
·
Cited by 1 article
Actions for Show HN: BeamWeaver – LangChain/DeepAgents-style agents and workflows for Elixir
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help