Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🤖 LLMs
Specific
large language models, GPT, Claude, AI models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
1294
posts in
17.8
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🤖
AI
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
,
r/LLM
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
Intelligent
inference
scheduling with
llm-d
on Red Hat
AI
⚙️
Systems Programming
developers.redhat.com
·
16h
16 hours ago
Actions for Intelligent inference scheduling with llm-d on Red Hat AI
Cross-LLM
Consistency in
Inference
: Evidence from Shared Interactions
🔧
Compilers
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Cross-LLM Consistency in Inference: Evidence from Shared Interactions
Fine-tuning
Large
Language Models (LLMs) using PEFT
🤖
AI
Content type:
Blog
medium.com
·
14h
14 hours ago
Actions for Fine-tuning Large Language Models (LLMs) using PEFT
Report: GKE
Inference
Gateway delivers up to 92% faster
AI
responses
🤖
AI
Content type:
Blog
cloud.google.com
·
2d
2 days ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
How
LLMs
work | Practical Leaders
🤖
AI
practical-leaders.com
·
6d
6 days ago
·
Hacker News
Actions for How LLMs work | Practical Leaders
How to Build
Financial
Services
AI
Agents with
Claude
🤖
AI
Content type:
Blog
odsc.medium.com
·
17h
17 hours ago
Actions for How to Build Financial Services AI Agents with Claude
Comprehensive evaluation of
LLM
capabilities for interpretation and analysis of genome-scale metabolic
models
in metabolic
engineering
🔧
Compilers
Content type:
Academic
biorxiv.org
·
2d
2 days ago
Actions for Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering
How Effective Are
LLM
Trading Agents?
🤖
AI
Content type:
News
Content type:
Blog
harbourfrontquant.substack.com
·
11h
11 hours ago
·
Substack
Actions for How Effective Are LLM Trading Agents?
RAG Pipeline Explained: From Query to Answer, Step by Step
🗄️
Databases
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for RAG Pipeline Explained: From Query to Answer, Step by Step
Law Professors Prefer
AI
over Peer Answers
🤖
AI
Content type:
Academic
law.stanford.edu
·
4d
4 days ago
·
Hacker News
Actions for Law Professors Prefer AI over Peer Answers
How
Large
Language
Models
Are Creating New Security Challenges
🔧
Compilers
Content type:
Blog
medium.com
·
9h
9 hours ago
Actions for How Large Language Models Are Creating New Security Challenges
Why
LLMs
(still) lack taste
🤖
AI
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
AI
Evaluation: How to Test
LLM
Applications Properly
🤖
AI
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for AI Evaluation: How to Test LLM Applications Properly
LLM
Routing: From Strategy Selection to Production Architecture
🤖
AI
Content type:
Blog
blog.n8n.io
·
1d
1 day ago
Actions for LLM Routing: From Strategy Selection to Production Architecture
High Bandwidth Flash | A New Memory for
AI
Data Centers and Edge Computing | Sandisk
🤖
AI
ncnonline.net
·
2d
2 days ago
Actions for High Bandwidth Flash | A New Memory for AI Data Centers and Edge Computing | Sandisk
How
LLMs
Work?
🤖
AI
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for How LLMs Work?
Stop Wasting GPU Budget: Autoscaling
AI
Inference
on Kubernetes with KEDA
⚙️
Systems Programming
cloudnativenow.com
·
2d
2 days ago
Actions for Stop Wasting GPU Budget: Autoscaling AI Inference on Kubernetes with KEDA
How we fight GPU scarcity without compromise
💻
Computer Science
Content type:
Blog
equixly.com
·
6d
6 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
The
Inference
Alpha: Maximizing Frontier
Models
on AMD
🤖
AI
Content type:
Blog
digitalocean.com
·
1d
1 day ago
Actions for The Inference Alpha: Maximizing Frontier Models on AMD
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help