Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🧠 LLMs
Specific
large language models, GPT, Claude, Gemini, foundation models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
157
posts in
5.3
ms
Philosophy
🤖
AI Agents
Content type:
Reference
docs.langchain.com
·
5d
5 days ago
Actions for Philosophy
Energy-Efficient On-Device
RAG
on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite
🪟
Context Windows
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Energy-Efficient On-Device RAG on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite
ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for
LLMs
. Intercept every prompt and response locally to stop data leaks and runaway token costs.
🪟
Context Windows
Content type:
Code
github.com
·
2d
2 days ago
·
Hacker News
,
Hacker News
Actions for ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.
Apple WWDC On-Device
AI
Deep Dive - Google Docs
🤖
Data science
gist.is
·
22h
22 hours ago
·
Hacker News
Actions for Apple WWDC On-Device AI Deep Dive - Google Docs
Why
LLMs
(still) lack taste
🤖
LLM
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Introducing the Third
Generation
of Apple’s
Foundation
Models
🤖
Machine Learning
machinelearning.apple.com
·
3d
3 days ago
·
Hacker News
,
r/apple
Actions for Introducing the Third Generation of Apple’s Foundation Models
CommBench: Can
LLMs
Write Correct and Efficient GPU Communication Code?
⚡
CUDA
uccl-project.github.io
·
13h
13 hours ago
·
Hacker News
Actions for CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?
How to Build an Agentic
RAG
with RubyLLM and Rails
🔍
Information Retrieval
Content type:
Blog
panasiti.me
·
1d
1 day ago
·
Hacker News
Actions for How to Build an Agentic RAG with RubyLLM and Rails
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🧠
LLM Inference
zozo123.github.io
·
1d
1 day ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
Building Agents Without Harness-Engineering
🤖
AI Agents
rajitkhanna.com
·
22h
22 hours ago
·
Hacker News
Actions for Building Agents Without Harness-Engineering
Show HN: Audit any
AI/data
pairing with Veritrooper
🪟
Context Windows
veritrooper.com
·
6d
6 days ago
·
Hacker News
Actions for Show HN: Audit any AI/data pairing with Veritrooper
Auto complete tickets using
Claude
Code loop on telegram with linear MCP
🤖
AI Agents
Content type:
Blog
niptao.com
·
1d
1 day ago
·
Hacker News
Actions for Auto complete tickets using Claude Code loop on telegram with linear MCP
How we fight GPU scarcity without compromise
🧠
LLM Inference
Content type:
Blog
equixly.com
·
6d
6 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🧠
LLM Inference
Content type:
News
newsletter.semianalysis.com
·
2d
2 days ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Fine-tuning Multi-modal
LLMs
with ART: Art-based Reinforcement Training
🎯
Fine-tuning
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
Research Proposal: Decoupled
RISC-LLM
Architectures
via Circadian Synaptic Consolidation
🪟
Context Windows
aermia.com
·
4d
4 days ago
·
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
DiffusionGemma 26B A4B results on my 5090
🧠
LLM Inference
huggingface.co
·
1d
1 day ago
·
r/LocalLLaMA
Actions for DiffusionGemma 26B A4B results on my 5090
Show HN: Ext-Infer
🪟
Context Windows
infer.displace.tech
·
4d
4 days ago
·
Hacker News
Actions for Show HN: Ext-Infer
NetX-lab/Frontier: Frontier: A Discrete-Event Simulator for
Modern
LLM
Serving
🧠
LLM Inference
Content type:
Code
github.com
·
13h
13 hours ago
·
Hacker News
Actions for NetX-lab/Frontier: Frontier: A Discrete-Event Simulator for Modern LLM Serving
Less-relevant results
The Missing Link Between Agents and Applications
🤖
AI Agents
Content type:
Blog
langchain.com
·
1d
1 day ago
·
Hacker News
Actions for The Missing Link Between Agents and Applications
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help