Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🧠 LLMs
Specific
large language models, GPT, Claude, Gemini, foundation models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
525
posts in
8.3
ms
Embeddings in
Generative
AI
: The Hidden Technology That Makes
AI
Actually Useful
🔍
Information Retrieval
Content type:
Blog
medium.com
·
13h
13 hours ago
Actions for Embeddings in Generative AI: The Hidden Technology That Makes AI Actually Useful
How to Build an Agentic
RAG
with RubyLLM and Rails
🔍
Information Retrieval
Content type:
Blog
panasiti.me
·
1d
1 day ago
·
Hacker News
Actions for How to Build an Agentic RAG with RubyLLM and Rails
Unlocking dependable responses with
Gemini
Enterprise Agent Platform’s Agentic
RAG
🪟
Context Windows
Content type:
Blog
research.google
·
6d
6 days ago
Actions for Unlocking dependable responses with Gemini Enterprise Agent Platform’s Agentic RAG
Why Your
LLM
Gets Dumber With More Context
🪟
Context Windows
siliconopera.com
·
8h
8 hours ago
Actions for Why Your LLM Gets Dumber With More Context
massimo92/spark: CLI tool for serving
LLMs
with
vLLM
on NVIDIA DGX Spark. One file, zero friction.
🧠
LLM Inference
Content type:
Code
github.com
·
4h
4 hours ago
·
Hacker News
Actions for massimo92/spark: CLI tool for serving LLMs with vLLM on NVIDIA DGX Spark. One file, zero friction.
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🧠
LLM Inference
zozo123.github.io
·
1d
1 day ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
Energy-Efficient On-Device
RAG
on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite
🪟
Context Windows
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Energy-Efficient On-Device RAG on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite
An autopsy of
Claude
Code's deep research
🤖
AI Agents
nibzard.com
·
3d
3 days ago
Actions for An autopsy of Claude Code's deep research
Location: Arlington Heights, IL, USA (Chicago Area) Remote: Yes Willing to reloc...
🐍
Python
Content type:
Discussion
news.ycombinator.com
·
2d
2 days ago
·
Hacker News
Actions for Location: Arlington Heights, IL, USA (Chicago Area) Remote: Yes Willing to reloc...
CommBench: Can
LLMs
Write Correct and Efficient GPU Communication Code?
⚡
CUDA
uccl-project.github.io
·
16h
16 hours ago
·
Hacker News
Actions for CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?
Introducing the Third
Generation
of Apple’s
Foundation
Models
🤖
Machine Learning
machinelearning.apple.com
·
3d
3 days ago
·
Hacker News
,
r/apple
Actions for Introducing the Third Generation of Apple’s Foundation Models
Why Most
RAG
Systems Slow Down After the First 3 Months
🪟
Context Windows
Content type:
Blog
blog.stackademic.com
·
19h
19 hours ago
Actions for Why Most RAG Systems Slow Down After the First 3 Months
Built and launched a research-reading and highlighting tool with
Claude
over a few months. Here are the things
AI
was surprisingly good (and bad) at.
🤖
LLM
highlyt.app
·
2d
2 days ago
·
r/ClaudeAI
Actions for Built and launched a research-reading and highlighting tool with Claude over a few months. Here are the things AI was surprisingly good (and bad) at.
Google's new open-weights
model
brings
image-generation
tricks to
AI
text
generation
🤖
Data science
Content type:
News
theregister.com
·
5h
5 hours ago
Actions for Google's new open-weights model brings image-generation tricks to AI text generation
New comment by Ayaz_Saifi in "Ask HN: Who wants to be hired? (June 2026)"
🤖
Machine Learning
drive.google.com
·
6d
6 days ago
·
Hacker News
Actions for New comment by Ayaz_Saifi in "Ask HN: Who wants to be hired? (June 2026)"
#068 - Apple runs Siri on Google's
Gemini
, OpenAI files a secret IPO at $852B, Xiaomi clocks 1,000 tps
🧠
LLM Inference
indiehacker.news
·
2d
2 days ago
Actions for #068 - Apple runs Siri on Google's Gemini, OpenAI files a secret IPO at $852B, Xiaomi clocks 1,000 tps
An
AI-Powered
Trisomy 21 Research Assistant
🪟
Context Windows
Content type:
Academic
biorxiv.org
·
6h
6 hours ago
Actions for An AI-Powered Trisomy 21 Research Assistant
You Probably Don’t Need a Vector Database - If Your Data Already Lives in BigQuery
🪟
Context Windows
Content type:
Blog
medium.com
·
4h
4 hours ago
Actions for You Probably Don’t Need a Vector Database - If Your Data Already Lives in BigQuery
Your
AI
agent reads the fine print: building a
RAG
pipeline over EU regulations with Elasticsearch and OGX
🔍
Information Retrieval
Content type:
Blog
elastic.co
·
2d
2 days ago
Actions for Your AI agent reads the fine print: building a RAG pipeline over EU regulations with Elasticsearch and OGX
Takeway from AWS
Generative
AI
Lens
🤖
AI Agents
Content type:
Reference
docs.aws.amazon.com
·
17h
17 hours ago
·
DEV
Actions for Takeway from AWS Generative AI Lens
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help