Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Engineering
🤖 AI Engineering
AI systems, LLM apps, AI pipelines, model deployment
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
233
posts in
7.4
ms
Breaking the Ice: Analyzing Cold Start
Latency
in
vLLM
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Breaking the Ice: Analyzing Cold Start Latency in vLLM
Philosophy
✍️
Prompt Engineering
Content type:
Reference
docs.langchain.com
·
4d
4 days ago
Actions for Philosophy
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🧠
LLMs
zozo123.github.io
·
1h
1 hour ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
ashp15205/guardian-runtime: A
zero-latency
, local-first runtime firewall for
LLMs
. Intercept every prompt and response locally to stop
data
leaks and runaway token costs.
✍️
Prompt Engineering
Content type:
Code
github.com
·
19h
19 hours ago
·
Hacker News
Actions for ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.
LangChain
Explained: Understanding
Models
, Prompts, Chains, Memory, Indexes, and Agents
📚
RAG
Content type:
Blog
towardsai.net
·
2d
2 days ago
Actions for LangChain Explained: Understanding Models, Prompts, Chains, Memory, Indexes, and Agents
New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"
🗄️
Vector Databases
Content type:
Discussion
news.ycombinator.com
·
18h
18 hours ago
·
Hacker News
Actions for New comment by yorktanaka2024 in "Ask HN: Who wants to be hired? (June 2026)"
New comment by jasonlayton4323 in "Ask HN: Who wants to be hired? (June 2026)"
✍️
Prompt Engineering
drive.google.com
·
5d
5 days ago
·
Hacker News
Actions for New comment by jasonlayton4323 in "Ask HN: Who wants to be hired? (June 2026)"
Enterprises Are Quietly Moving Their
AI
Back On-Premises. Here Is Why.
🗄️
Vector Databases
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Enterprises Are Quietly Moving Their AI Back On-Premises. Here Is Why.
Full Observability for Pinecone: Introducing an Open-Source Monitoring Stack for SaaS and BYOC
🗄️
Vector Databases
Content type:
Blog
pinecone.io
·
23h
23 hours ago
Actions for Full Observability for Pinecone: Introducing an Open-Source Monitoring Stack for SaaS and BYOC
Agentic
AI
frameworks compared:
LangChain
, LangGraph, AutoGen
✍️
Prompt Engineering
Content type:
Blog
udacity.com
·
4d
4 days ago
Actions for Agentic AI frameworks compared: LangChain, LangGraph, AutoGen
How I benchmarked a 100% local
RAG
pipeline
to 9/9 (zero API keys)
📚
RAG
buy.polar.sh
·
1d
1 day ago
·
DEV
Actions for How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🧠
LLMs
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic
Infrastructure
📊
Observability
devops.com
·
5d
5 days ago
Actions for The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic Infrastructure
Presentation: Beyond Prompting: Context
Engineering
and Memory Management for
AI
Systems
at Scale
✍️
Prompt Engineering
Content type:
News
infoq.com
·
28m
28 minutes ago
Actions for Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at Scale
2x GH200 for
LLM
inference
, Part 2:
vLLM
, DeepSeek V4 Flash, and MTP
🧠
LLMs
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for 2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP
Speculators v0.5.0: DFlash support and online training
🧠
LLMs
developers.redhat.com
·
6d
6 days ago
Actions for Speculators v0.5.0: DFlash support and online training
google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
🧠
LLMs
huggingface.co
·
2d
2 days ago
·
r/LocalLLaMA
Actions for google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
Quiz: Embeddings and
Vector
Databases
With ChromaDB
🗄️
Vector Databases
realpython.com
·
1d
1 day ago
Actions for Quiz: Embeddings and Vector Databases With ChromaDB
LLM
Inference
Engineering
Room — Part 3: The Orchestration Layer
🧠
LLMs
Content type:
Blog
vimal-dwarampudi.medium.com
·
6d
6 days ago
Actions for LLM Inference Engineering Room — Part 3: The Orchestration Layer
Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP Developers via Malicious PyPI Wheels
✍️
Prompt Engineering
Content type:
Blog
socket.dev
·
1d
1 day ago
·
Hacker News
Actions for Mini Shai-Hulud, Miasma, and Hades Worms Target Bioinformatics and MCP Developers via Malicious PyPI Wheels
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help