Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Large Language Models (LLMs)
🧠 Large Language Models (LLMs)
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
719
posts in
7.4
ms
Markov Chains: The Grandparents of
LLMs
✨
Model optimizations in LLMs
dmanco.dev
·
2d
2 days ago
·
Hacker News
Actions for Markov Chains: The Grandparents of LLMs
Show HN: In-browser real
LLM
token counter and cost estimation
💬
Prompt optimizations for LLM serving
holaclaw.ai
·
11h
11 hours ago
·
Hacker News
Actions for Show HN: In-browser real LLM token counter and cost estimation
Ask HN: Any Local
LLM
can I run without GPU for Local Agentic workflow AI?
🤖
Agents using LLMs
Content type:
Discussion
news.ycombinator.com
·
19h
19 hours ago
·
Hacker News
Actions for Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
LLM
are universal simulators
✨
Model optimizations in LLMs
invertedpassion.com
·
3d
3 days ago
·
Hacker News
Actions for LLM are universal simulators
Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
🔍
Retrieval-augmented generation
Content type:
Audio
oreilly.com
·
1d
1 day ago
Actions for Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
Context
compression
finally
works in production: new research cuts
LLM
input 16x without the accuracy hit
🔍
Retrieval-augmented generation
venturebeat.com
·
9h
9 hours ago
Actions for Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit
Every
LLM
Tool Call Needs an Output Budget
🤖
Agents using LLMs
Content type:
Blog
axamy.com
·
5h
5 hours ago
·
Hacker News
Actions for Every LLM Tool Call Needs an Output Budget
Google open-sources speedy DiffusionGemma text diffusion
model
🔍
Retrieval-augmented generation
siliconangle.com
·
1d
1 day ago
Actions for Google open-sources speedy DiffusionGemma text diffusion model
local
llm
on laptop 780M GPU using
llama
+ gemma 4 qat
🔢
Quantization of LLMs
Content type:
Blog
alper.bearblog.dev
·
5d
5 days ago
Actions for local llm on laptop 780M GPU using llama + gemma 4 qat
Google's new open-weights
model
brings image-generation tricks to AI text generation
📊
AI Performance Profiling
Content type:
News
theregister.com
·
8h
8 hours ago
Actions for Google's new open-weights model brings image-generation tricks to AI text generation
147th airhacks tv: Local
LLMs
, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
🔢
Quantization of LLMs
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for 147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
🤖
Agents using LLMs
Content type:
Discussion
news.ycombinator.com
·
16h
16 hours ago
·
Hacker News
Actions for New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an
LLM
?
💬
Prompt optimizations for LLM serving
Content type:
Blog
medium.com
·
4d
4 days ago
Actions for How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?
AI
context
windows
: Why
context
quality beats
context
size
🔍
Retrieval-augmented generation
Content type:
Blog
redis.io
·
1d
1 day ago
Actions for AI context windows: Why context quality beats context size
If
LLMs
are all persona, whose persona are they?
✨
Model optimizations in LLMs
persona.earthpilot.ai
·
23h
23 hours ago
·
Hacker News
Actions for If LLMs are all persona, whose persona are they?
Report: GKE
Inference
Gateway delivers up to 92% faster AI responses
🔧
Systems-level optimizations for LLM serving
Content type:
Blog
cloud.google.com
·
3d
3 days ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
Don't let the
LLM
speak, just probe it (8 minute read)
💬
Prompt optimizations for LLM serving
Content type:
Blog
blog.j11y.io
·
1d
1 day ago
·
Hacker News
Actions for Don't let the LLM speak, just probe it (8 minute read)
langchain-ai/langchain
langchain-core
==1.4.6
🔍
Retrieval-augmented generation
Content type:
Code
github.com
·
19h
19 hours ago
Actions for langchain-ai/langchain langchain-core==1.4.6
Tokenminning: Because Tokenmaxxing Is a Bad Idea
💬
Prompt optimizations for LLM serving
tokenminning.com
·
2d
2 days ago
·
Hacker News
Actions for Tokenminning: Because Tokenmaxxing Is a Bad Idea
Comprehensive evaluation of
LLM
capabilities for interpretation and analysis of genome-scale metabolic
models
in metabolic engineering
✨
Model optimizations in LLMs
Content type:
Academic
biorxiv.org
·
3d
3 days ago
Actions for Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help