Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Large Language Models (LLMs)
🧠 Large Language Models (LLMs)
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
718
posts in
9.4
ms
Markov Chains: The Grandparents of
LLMs
✨
Model optimizations in LLMs
dmanco.dev
·
2d
2 days ago
·
Hacker News
Actions for Markov Chains: The Grandparents of LLMs
Show HN: In-browser real
LLM
token counter and cost estimation
💬
Prompt optimizations for LLM serving
holaclaw.ai
·
12h
12 hours ago
·
Hacker News
Actions for Show HN: In-browser real LLM token counter and cost estimation
Ask HN: Any Local
LLM
can I run without GPU for Local Agentic workflow AI?
🤖
Agents using LLMs
Content type:
Discussion
news.ycombinator.com
·
21h
21 hours ago
·
Hacker News
Actions for Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
NVIDIA A100 vs RTX 4090 for AI Workloads: The Cost Per FLOP Reality
⚙️
AI Infrastructure Automation
Content type:
Blog
fitservers.com
·
3d
3 days ago
Actions for NVIDIA A100 vs RTX 4090 for AI Workloads: The Cost Per FLOP Reality
Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
🔍
Retrieval-augmented generation
Content type:
Audio
oreilly.com
·
1d
1 day ago
Actions for Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
Context
compression
finally
works in production: new research cuts
LLM
input 16x without the accuracy hit
🔍
Retrieval-augmented generation
venturebeat.com
·
10h
10 hours ago
Actions for Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit
Google open-sources speedy DiffusionGemma text diffusion
model
🔍
Retrieval-augmented generation
siliconangle.com
·
1d
1 day ago
Actions for Google open-sources speedy DiffusionGemma text diffusion model
LLM
are universal simulators
✨
Model optimizations in LLMs
invertedpassion.com
·
3d
3 days ago
·
Hacker News
Actions for LLM are universal simulators
Google's new open-weights
model
brings image-generation tricks to AI text generation
📊
AI Performance Profiling
Content type:
News
theregister.com
·
9h
9 hours ago
Actions for Google's new open-weights model brings image-generation tricks to AI text generation
147th airhacks tv: Local
LLMs
, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
🔢
Quantization of LLMs
Content type:
Blog
adambien.blog
·
2d
2 days ago
Actions for 147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
local
llm
on laptop 780M GPU using
llama
+ gemma 4 qat
🔢
Quantization of LLMs
Content type:
Blog
alper.bearblog.dev
·
5d
5 days ago
Actions for local llm on laptop 780M GPU using llama + gemma 4 qat
New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
🤖
Agents using LLMs
Content type:
Discussion
news.ycombinator.com
·
17h
17 hours ago
·
Hacker News
Actions for New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
Don't let the
LLM
speak, just probe it (8 minute read)
💬
Prompt optimizations for LLM serving
Content type:
Blog
blog.j11y.io
·
1d
1 day ago
·
Hacker News
Actions for Don't let the LLM speak, just probe it (8 minute read)
How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an
LLM
?
💬
Prompt optimizations for LLM serving
Content type:
Blog
medium.com
·
4d
4 days ago
Actions for How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?
AI
context
windows
: Why
context
quality beats
context
size
🔍
Retrieval-augmented generation
Content type:
Blog
redis.io
·
1d
1 day ago
Actions for AI context windows: Why context quality beats context size
[NEW
MODEL
] SupraLabs just released Supra1.5-50M Base (Experimental)!
🔧
Systems-level optimizations for LLM serving
huggingface.co
·
15h
15 hours ago
·
r/LocalLLaMA
Actions for [NEW MODEL] SupraLabs just released Supra1.5-50M Base (Experimental)!
langchain-ai/langchain
langchain-core
==1.4.6
🔍
Retrieval-augmented generation
Content type:
Code
github.com
·
20h
20 hours ago
Actions for langchain-ai/langchain langchain-core==1.4.6
Report: GKE
Inference
Gateway delivers up to 92% faster AI responses
🔧
Systems-level optimizations for LLM serving
Content type:
Blog
cloud.google.com
·
3d
3 days ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
Tokenminning: Because Tokenmaxxing Is a Bad Idea
💬
Prompt optimizations for LLM serving
tokenminning.com
·
2d
2 days ago
·
Hacker News
Actions for Tokenminning: Because Tokenmaxxing Is a Bad Idea
AI The Truly Environmentally Friendly Way
⚙️
AI Infrastructure Automation
hackaday.com
·
20h
20 hours ago
Actions for AI The Truly Environmentally Friendly Way
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help