Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🧠 LLMs
large language models, GPT, Claude, transformer, ChatGPT
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
119
posts in
9.2
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🤗
Open Source AI
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
A free diagnostic for the
Claude
Certified
Architect
exam
✍️
Prompt Engineering
Content type:
Discussion
Content type:
Tutorial
claudecertifiedarchitects.com
·
1d
1 day ago
·
Hacker News
Actions for A free diagnostic for the Claude Certified Architect exam
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🤗
Open Source AI
zozo123.github.io
·
10h
10 hours ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local
AI
🟢
NVIDIA
Content type:
Blog
blogs.nvidia.com
·
4h
4 hours ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
Why
LLMs
(still) lack taste
⚙️
DevOps
beyondtheprior.com
·
1d
1 day ago
·
Hacker News
Actions for Why LLMs (still) lack taste
How
LLMs
work | Practical Leaders
🧠
Transformers
practical-leaders.com
·
6d
6 days ago
·
Hacker News
Actions for How LLMs work | Practical Leaders
our workplace
LLM
mass delusion
✍️
Prompt Engineering
Content type:
Blog
blog.avas.space
·
9h
9 hours ago
·
Hacker News
Actions for our workplace LLM mass delusion
Melanie Mitchell: What We Get Wrong About
AI
✍️
Prompt Engineering
yalereview.org
·
2d
2 days ago
·
Substack
,
Hacker News
,
Hacker News
Actions for Melanie Mitchell: What We Get Wrong About AI
DiffusionGemma: 4x Faster Text
Generation
🤗
Open Source AI
Content type:
News
Content type:
Blog
blog.google
·
4h
4 hours ago
·
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
DiffusionGemma: The Developer Guide- Google Developers Blog
🤗
Open Source AI
Content type:
Blog
developers.googleblog.com
·
20h
20 hours ago
·
r/LocalLLaMA
Actions for DiffusionGemma: The Developer Guide- Google Developers Blog
How we fight GPU scarcity without compromise
✍️
Prompt Engineering
Content type:
Blog
equixly.com
·
5d
5 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
Why Do
LLMs
Corrupt Your Documents When You Delegate?
🔬
ML Research
kdnuggets.com
·
2d
2 days ago
Actions for Why Do LLMs Corrupt Your Documents When You Delegate?
Presentation: Beyond
Prompting
: Context
Engineering
and Memory Management for
AI
Systems at Scale
✍️
Prompt Engineering
Content type:
News
infoq.com
·
8h
8 hours ago
Actions for Presentation: Beyond Prompting: Context Engineering and Memory Management for AI Systems at Scale
Context
Engineering
Is Eating
Prompt
Engineering
✍️
Prompt Engineering
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for Context Engineering Is Eating Prompt Engineering
Why
Claude
Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
✍️
Prompt Engineering
Content type:
Blog
medium.com
·
5d
5 days ago
Actions for Why Claude Produces High-Quality Output: A Developer’s Guide to Token Efficiency and Hallucination…
Machinic Psychopharmacology: Do
LLMs
Self-Medicate?
🤗
Open Source AI
lesswrong.com
·
6h
6 hours ago
·
Hacker News
Actions for Machinic Psychopharmacology: Do LLMs Self-Medicate?
Multimedia Building Blocks
🤗
Open Source AI
Content type:
Blog
huggingface.co
·
1d
1 day ago
Actions for Multimedia Building Blocks
LLM
Inference
Engineering
Room — Part 3: The Orchestration Layer
🤗
Open Source AI
Content type:
Blog
vimal-dwarampudi.medium.com
·
1w
1 week ago
Actions for LLM Inference Engineering Room — Part 3: The Orchestration Layer
How to Become an AWS
AI
Architect
,The Honest Roadmap, the Projects, and Landing the Job
☁️
Cloud Computing
hackernoon.com
·
14h
14 hours ago
Actions for How to Become an AWS AI Architect,The Honest Roadmap, the Projects, and Landing the Job
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🤗
Open Source AI
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help