Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Large Language Models (LLMs)
🧠 Large Language Models (LLMs)
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
718
posts in
7.5
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🔧
Systems-level optimizations for LLM serving
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
,
r/LLM
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
How
LLMs
are Actually Trained
✨
Model optimizations in LLMs
Content type:
News
Content type:
Blog
blog.algomaster.io
·
21h
21 hours ago
Actions for How LLMs are Actually Trained
The Neutral Mask: How
RLHF
Provides Shallow Alignment while Leaving Partisan Structure Intact in a
Large
Language
Model
✨
Model optimizations in LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Making a Vintage
LLM
from Scratch
💬
Prompt optimizations for LLM serving
crlf.link
·
17h
17 hours ago
·
Hacker News
Actions for Making a Vintage LLM from Scratch
LangChain
vs
LlamaIndex
2026: Response Time on 10 RAG Tasks
🔍
Retrieval-augmented generation
Content type:
Blog
Content type:
Discussion
tildalice.io
·
1d
1 day ago
Actions for LangChain vs LlamaIndex 2026: Response Time on 10 RAG Tasks
How ChatGPT Actually Works (Beginner Friendly)
🤖
Agents using LLMs
Content type:
Blog
medium.com
·
2h
2 hours ago
Actions for How ChatGPT Actually Works (Beginner Friendly)
LangChain
Explained: Understanding
Models
, Prompts, Chains, Memory, Indexes, and Agents
🔍
Retrieval-augmented generation
Content type:
Blog
towardsai.net
·
3d
3 days ago
Actions for LangChain Explained: Understanding Models, Prompts, Chains, Memory, Indexes, and Agents
Orchestrate your
LLM
pipeline. Locally
✨
Model optimizations in LLMs
llmforge.app
·
9h
9 hours ago
·
Hacker News
Actions for Orchestrate your LLM pipeline. Locally
Context
windows
in AI: why every token is a budget decision
🔍
Retrieval-augmented generation
Content type:
Blog
redis.io
·
1d
1 day ago
Actions for Context windows in AI: why every token is a budget decision
Why Your
LLM
Gets Dumber With More
Context
🔍
Retrieval-augmented generation
siliconopera.com
·
11h
11 hours ago
Actions for Why Your LLM Gets Dumber With More Context
Philosophy
🔍
Retrieval-augmented generation
Content type:
Reference
docs.langchain.com
·
5d
5 days ago
Actions for Philosophy
lightmetal: GPU
LLM
Inference
From a Single Java 25 JAR
🔢
Quantization of LLMs
Content type:
Blog
adambien.blog
·
2d
2 days ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon
✨
Model optimizations in LLMs
xda-developers.com
·
10h
10 hours ago
Actions for Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon
LLM
Routing: From Strategy Selection to Production
Architecture
📊
AI Performance Profiling
Content type:
Blog
blog.n8n.io
·
1d
1 day ago
Actions for LLM Routing: From Strategy Selection to Production Architecture
DiffusionGemma: Discrete diffusion in a
large
language
model
🔧
Systems-level optimizations for LLM serving
idlemachines.co.uk
·
4h
4 hours ago
·
Hacker News
Actions for DiffusionGemma: Discrete diffusion in a large language model
Research Proposal: Decoupled
RISC-LLM
Architectures
via Circadian Synaptic Consolidation
🔍
Retrieval-augmented generation
aermia.com
·
5d
5 days ago
·
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
Prompt Caching Explained: The AI Concept That Can Save Millions of Tokens
🔍
Retrieval-augmented generation
Content type:
Blog
sweta-nit.medium.com
·
17h
17 hours ago
Actions for Prompt Caching Explained: The AI Concept That Can Save Millions of Tokens
My Notes on the Progression from
Context
to Prompt to Harness engineering in making
GPT
LLMs
Useful: (TUESDAY) MAMLMs
🔍
Retrieval-augmented generation
Content type:
News
Content type:
Blog
braddelong.substack.com
·
2d
2 days ago
·
Substack
Actions for My Notes on the Progression from Context to Prompt to Harness engineering in making GPT LLMs Useful: (TUESDAY) MAMLMs
LLM
Cheat Sheet
🔍
Retrieval-augmented generation
Content type:
Blog
drkpxl.bearblog.dev
·
9h
9 hours ago
Actions for LLM Cheat Sheet
Why
LLMs
(still) lack taste
💬
Prompt optimizations for LLM serving
beyondtheprior.com
·
3d
3 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help