Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
programming, security, AI, llms, science, finance
🤖 programming, security, AI, llms, science, finance
Broad
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
2189
posts in
19.1
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
☕
Espresso
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
LangChain
Explained: Understanding Models, Prompts, Chains, Memory, Indexes, and Agents
🔷
Go, typescript
Content type:
Blog
towardsai.net
·
2d
2 days ago
Actions for LangChain Explained: Understanding Models, Prompts, Chains, Memory, Indexes, and Agents
LangChain
vs
LlamaIndex
2026: Response Time on 10 RAG Tasks
☕
Espresso
Content type:
Blog
Content type:
Discussion
tildalice.io
·
7h
7 hours ago
Actions for LangChain vs LlamaIndex 2026: Response Time on 10 RAG Tasks
Dynamic ReACT Loop with Conductor
🔷
Go, typescript
conductor-oss.github.io
·
3h
3 hours ago
·
Hacker News
Actions for Dynamic ReACT Loop with Conductor
Report: GKE
Inference
Gateway delivers up to 92% faster
AI
responses
☕
Espresso
Content type:
Blog
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
147th airhacks tv: Local
LLMs
, LightMetal, ZSmith Agents,
AI
Rails, Saving Tokens
🔷
Go, typescript
Content type:
Blog
adambien.blog
·
19h
19 hours ago
Actions for 147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
Philosophy
🔷
Go, typescript
Content type:
Reference
docs.langchain.com
·
4d
4 days ago
Actions for Philosophy
RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step
LLM
Inference
🔷
Go, typescript
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
☕
Espresso
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
LLM
Routing: From Strategy Selection to Production Architecture
🥓
Charcuterie
Content type:
Blog
blog.n8n.io
·
7h
7 hours ago
Actions for LLM Routing: From Strategy Selection to Production Architecture
Ollama 0.30 GPU Boost: Faster local Qwen
inference
on NVIDIA
☕
Espresso
everylocalai.com
·
2h
2 hours ago
·
DEV
Actions for Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA
Why
LLMs
(still) lack taste
☕
Coffee Roasting
beyondtheprior.com
·
1d
1 day ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Making Local
LLM
Go Brrr
☕
Espresso
seanpedersen.github.io
·
6d
6 days ago
Actions for Making Local LLM Go Brrr
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🔷
Go, typescript
zozo123.github.io
·
12h
12 hours ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
AI
inference
: what it is and why it matters for product managers
🔷
Go, typescript
marcabraham.com
·
2d
2 days ago
Actions for AI inference: what it is and why it matters for product managers
The
Inference
Alpha: Maximizing Frontier Models on AMD
☕
Espresso
Content type:
Blog
digitalocean.com
·
8h
8 hours ago
Actions for The Inference Alpha: Maximizing Frontier Models on AMD
Using
Scikit-LLM
with Open-Source LLMs
🔷
Go, typescript
machinelearningmastery.com
·
6d
6 days ago
Actions for Using Scikit-LLM with Open-Source LLMs
A system
programmer
’s guide to
LLM
inference
☕
Coffee Roasting
Content type:
Blog
blog.xiangpeng.systems
·
2d
2 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
DiffusionGemma: The Developer Guide- Google Developers Blog
☕
Espresso
Content type:
Blog
developers.googleblog.com
·
22h
22 hours ago
·
r/LocalLLaMA
Actions for DiffusionGemma: The Developer Guide- Google Developers Blog
Apple WWDC On-Device
AI
Deep Dive - Google Docs
☕
Espresso
gist.is
·
45m
45 minutes ago
·
Hacker News
Actions for Apple WWDC On-Device AI Deep Dive - Google Docs
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help