Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
large language models, GPT, foundation models, inference
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
623
posts in
8.1
ms
New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
🧠
Agentic AI
Content type:
Discussion
news.ycombinator.com
·
11h
11 hours ago
·
Hacker News
Actions for New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
Context
compression
finally
works in production: new research cuts
LLM
input 16x without the accuracy hit
🚀
MLOps
venturebeat.com
·
4h
4 hours ago
Actions for Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit
Apple's
Foundation
Models
can now use third-party
LLMs
(Claude, Gemini) [video]
🚀
MLOps
developer.apple.com
·
3d
3 days ago
·
Hacker News
Actions for Apple's Foundation Models can now use third-party LLMs (Claude, Gemini) [video]
Token4Token — pay-per-token
inference
on Gnosis + Swarm
☁️
Cloud Infrastructure
t4t.eth.link
·
2d
2 days ago
·
Hacker News
Actions for Token4Token — pay-per-token inference on Gnosis + Swarm
Show HN: In-browser real
LLM
token counter and cost estimation
🚀
MLOps
holaclaw.ai
·
6h
6 hours ago
·
Hacker News
Actions for Show HN: In-browser real LLM token counter and cost estimation
LLM
Cheat Sheet
🤖
AI/ML
Content type:
Blog
drkpxl.bearblog.dev
·
4h
4 hours ago
Actions for LLM Cheat Sheet
Why
LLMs
(still) lack taste
☁️
Cloud Infrastructure
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
🚀
MLOps
huggingface.co
·
3d
3 days ago
·
Hacker News
Actions for NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
I Built a
RAG
System in 2025. The “
RAG
Is Dead” Posts Keep Telling Me to Delete It.
🔬
eBPF
ai.gopubby.com
·
19h
19 hours ago
Actions for I Built a RAG System in 2025. The “RAG Is Dead” Posts Keep Telling Me to Delete It.
Google's new open-weights
model
brings image-generation tricks to AI text generation
🤖
AI/ML
Content type:
News
theregister.com
·
3h
3 hours ago
Actions for Google's new open-weights model brings image-generation tricks to AI text generation
Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM
🖥️
Hypervisors
everylocalai.com
·
1d
1 day ago
·
DEV
Actions for Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM
Fine-tuning
Multi-modal
LLMs
with ART: Art-based Reinforcement Training
🤖
AI/ML
Content type:
Academic
arxiv.org
·
17h
17 hours ago
Actions for Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
What Are Tokens in
LLMs
?
🤖
AI/ML
Content type:
Blog
bearisland.dev
·
4d
4 days ago
·
Hacker News
Actions for What Are Tokens in LLMs?
Context
windows
in AI: why every token is a budget decision
🚀
MLOps
Content type:
Blog
redis.io
·
1d
1 day ago
Actions for Context windows in AI: why every token is a budget decision
Google's DiffusionGemma generates 256 tokens in parallel and self-corrects as it goes
🚀
MLOps
venturebeat.com
·
6h
6 hours ago
Actions for Google's DiffusionGemma generates 256 tokens in parallel and self-corrects as it goes
6. Air-Gapped Claude Code - The Claude Code SRE Handbook
☁️
Cloud Infrastructure
har-ki.github.io
·
5h
5 hours ago
·
Hacker News
Actions for 6. Air-Gapped Claude Code - The Claude Code SRE Handbook
fix(
ollama
): use provider thinking default in SDK session factory (#9… · openclaw/openclaw@4f3c2cd
🔬
eBPF
Content type:
Code
github.com
·
10h
10 hours ago
Actions for fix(ollama): use provider thinking default in SDK session factory (#9… · openclaw/openclaw@4f3c2cd
Fixing a stuck
Ollama
runner and building a GPU watchdog
🔬
eBPF
patrickmccanna.net
·
3d
3 days ago
·
Hacker News
Actions for Fixing a stuck Ollama runner and building a GPU watchdog
What's in the Box? A Field Guide to AI
Models
🤖
AI/ML
Content type:
Blog
iankduncan.com
·
2d
2 days ago
Actions for What's in the Box? A Field Guide to AI Models
Timing Trick Cuts Energy Used in
LLM
Training by Up to 14 Percent
🤖
AI/ML
Content type:
News
spectrum.ieee.org
·
1d
1 day ago
·
Hacker News
Actions for Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help