Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Engineering
🤖 AI Engineering
AI systems, LLM apps, AI pipelines, model deployment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
238
posts in
7.5
ms
Breaking the Ice: Analyzing Cold Start
Latency
in
vLLM
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Breaking the Ice: Analyzing Cold Start Latency in vLLM
Philosophy
✍️
Prompt Engineering
Content type:
Reference
docs.langchain.com
·
4d
4 days ago
Actions for Philosophy
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🧠
LLMs
zozo123.github.io
·
7h
7 hours ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
LangChain
Explained: Understanding
Models
, Prompts, Chains, Memory, Indexes, and Agents
📚
RAG
Content type:
Blog
towardsai.net
·
2d
2 days ago
Actions for LangChain Explained: Understanding Models, Prompts, Chains, Memory, Indexes, and Agents
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local
AI
🔬
AI Research
Content type:
Blog
blogs.nvidia.com
·
1h
1 hour ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
Nvidia DGX Spark GB10 –
AI
Models
and Guide with
vLLM
and Autonomous Script
🧠
LLMs
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script
A Fun & Absurd Introduction to
Vector
Databases
• Alexander Chatzizacharias
📚
RAG
Content type:
Video
youtu.be
·
5h
5 hours ago
·
r/programming
Actions for A Fun & Absurd Introduction to Vector Databases • Alexander Chatzizacharias
New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"
🗄️
Vector Databases
Content type:
Discussion
news.ycombinator.com
·
1d
1 day ago
·
Hacker News
Actions for New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"
a desktop GUI to browse, search, & visualize your
vector
databases
📚
RAG
vectorlens.dev
·
3h
3 hours ago
·
Hacker News
Actions for a desktop GUI to browse, search, & visualize your vector databases
New comment by jasonlayton4323 in "Ask HN: Who wants to be hired? (June 2026)"
✍️
Prompt Engineering
drive.google.com
·
5d
5 days ago
·
Hacker News
Actions for New comment by jasonlayton4323 in "Ask HN: Who wants to be hired? (June 2026)"
Enterprises Are Quietly Moving Their
AI
Back On-Premises. Here Is Why.
🗄️
Vector Databases
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Enterprises Are Quietly Moving Their AI Back On-Premises. Here Is Why.
DiffusionGemma: The Developer Guide- Google Developers Blog
🔬
AI Research
Content type:
Blog
developers.googleblog.com
·
18h
18 hours ago
·
r/LocalLLaMA
Actions for DiffusionGemma: The Developer Guide- Google Developers Blog
How I benchmarked a 100% local
RAG
pipeline
to 9/9 (zero API keys)
📚
RAG
buy.polar.sh
·
1d
1 day ago
·
DEV
Actions for How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
Agentic
AI
frameworks compared:
LangChain
, LangGraph, AutoGen
✍️
Prompt Engineering
Content type:
Blog
udacity.com
·
4d
4 days ago
Actions for Agentic AI frameworks compared: LangChain, LangGraph, AutoGen
MarkSentry – zero-trust document-to-Markdown for
RAG
pipelines
🔒
Zero Trust
sunilgentyala.github.io
·
1h
1 hour ago
·
Hacker News
Actions for MarkSentry – zero-trust document-to-Markdown for RAG pipelines
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🧠
LLMs
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
AMD's Lemonade SDK For Local
AI
Adds NVIDIA CUDA Support
🧠
LLMs
phoronix.com
·
1h
1 hour ago
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic
Infrastructure
📊
Observability
devops.com
·
5d
5 days ago
Actions for The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic Infrastructure
2x GH200 for
LLM
inference
, Part 2:
vLLM
, DeepSeek V4 Flash, and MTP
🧠
LLMs
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for 2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP
Speculators v0.5.0: DFlash support and online training
🧠
LLMs
developers.redhat.com
·
6d
6 days ago
Actions for Speculators v0.5.0: DFlash support and online training
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help