Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Engineering
🤖 AI Engineering
LLM, RAG, AI systems, prompt engineering, inference
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
621
posts in
11.8
ms
How I benchmarked a 100% local
RAG
pipeline to 9/9 (zero API keys)
☕
Java
buy.polar.sh
·
1d
1 day ago
·
DEV
Actions for How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
ashp15205/guardian-runtime: A
zero-latency
, local-first runtime firewall for LLMs. Intercept every
prompt
and response locally to stop
data
leaks and runaway token costs.
✍️
Prompt Engineering
Content type:
Code
github.com
·
20h
20 hours ago
·
Hacker News
Actions for ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.
Speculators v0.5.0: DFlash support and online training
🧠
Machine Learning
developers.redhat.com
·
6d
6 days ago
Actions for Speculators v0.5.0: DFlash support and online training
Azure OpenAI Architecture: The Decisions That Actually Matter (Part 1)
🔍
RAG
techcommunity.microsoft.com
·
2d
2 days ago
Actions for Azure OpenAI Architecture: The Decisions That Actually Matter (Part 1)
A
system
programmer’s guide to
LLM
inference
🧠
LLMs
Content type:
Blog
blog.xiangpeng.systems
·
2d
2 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
Agentic
AI
frameworks compared:
LangChain
, LangGraph, AutoGen
🔍
RAG
Content type:
Blog
udacity.com
·
4d
4 days ago
Actions for Agentic AI frameworks compared: LangChain, LangGraph, AutoGen
RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step
LLM
Inference
🧠
LLMs
Content type:
Academic
arxiv.org
·
10h
10 hours ago
Actions for RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference
Report: GKE
Inference
Gateway delivers up to 92% faster
AI
responses
✍️
Prompt Engineering
Content type:
Blog
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
LLM
Inference
Engineering
Room — Part 3: The Orchestration Layer
🧠
LLMs
Content type:
Blog
vimal-dwarampudi.medium.com
·
6d
6 days ago
Actions for LLM Inference Engineering Room — Part 3: The Orchestration Layer
Location: Arlington Heights, IL, USA (Chicago Area) Remote: Yes Willing to reloc...
🔍
RAG
Content type:
Discussion
news.ycombinator.com
·
19h
19 hours ago
·
Hacker News
Actions for Location: Arlington Heights, IL, USA (Chicago Area) Remote: Yes Willing to reloc...
The
AI
Agents Stack (2026 Edition)
🔍
RAG
Content type:
Blog
oreilly.com
·
2d
2 days ago
Actions for The AI Agents Stack (2026 Edition)
Hashtag Jakarta EE #336
🔍
RAG
agilejava.eu
·
3d
3 days ago
Actions for Hashtag Jakarta EE #336
New comment by Revanthkodati in "Ask HN: Who wants to be hired? (June 2026)"
🧠
Machine Learning
drive.google.com
·
1d
1 day ago
·
Hacker News
Actions for New comment by Revanthkodati in "Ask HN: Who wants to be hired? (June 2026)"
146th airhacks tv: Rust, Java 25,
AI
Agents, BCE, Web Components, zunit, zb
☕
Java
Content type:
Blog
adambien.blog
·
10h
10 hours ago
Actions for 146th airhacks tv: Rust, Java 25, AI Agents, BCE, Web Components, zunit, zb
If Claude Fable stops helping you, you'
ll
never know
🧠
Machine Learning
Content type:
Blog
jonready.com
·
16h
16 hours ago
·
Lobsters
,
Hacker News
Actions for If Claude Fable stops helping you, you'll never know
Show HN:
Ext-Infer
🔍
RAG
infer.displace.tech
·
3d
3 days ago
·
Hacker News
Actions for Show HN: Ext-Infer
google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
🐹
Golang
huggingface.co
·
2d
2 days ago
·
r/LocalLLaMA
Actions for google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
Built my first proper agentic
AI
project
🔍
RAG
Content type:
Code
github.com
·
4h
4 hours ago
·
DEV
Actions for Built my first proper agentic AI project
ICYMI: Inside the Microsoft Agent Framework: How we designed a layered SDK
🔍
RAG
Content type:
Blog
devblogs.microsoft.com
·
21h
21 hours ago
Actions for ICYMI: Inside the Microsoft Agent Framework: How we designed a layered SDK
MongoDB as a
Vector
Database
for
AI
Agents-MongoDB
🔍
RAG
foojay.io
·
6d
6 days ago
Actions for MongoDB as a Vector Database for AI Agents-MongoDB
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help