Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🤖 LLMs
Specific
large language models, GPT, Claude, AI models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
1306
posts in
5.5
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🤖
AI
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
,
r/LLM
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
Intelligent
inference
scheduling with
llm-d
on Red Hat
AI
⚙️
Systems Programming
developers.redhat.com
·
19h
19 hours ago
Actions for Intelligent inference scheduling with llm-d on Red Hat AI
Cross-LLM
Consistency in
Inference
: Evidence from Shared Interactions
🔧
Compilers
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Cross-LLM Consistency in Inference: Evidence from Shared Interactions
AI
chatbots mimic fear, sadness and stress, then calm down after mindfulness exercise
🤖
AI
medicalxpress.com
·
2h
2 hours ago
Actions for AI chatbots mimic fear, sadness and stress, then calm down after mindfulness exercise
Fine-tuning
Large
Language Models (LLMs) using PEFT
🤖
AI
Content type:
Blog
medium.com
·
17h
17 hours ago
Actions for Fine-tuning Large Language Models (LLMs) using PEFT
Report: GKE
Inference
Gateway delivers up to 92% faster
AI
responses
🤖
AI
Content type:
Blog
cloud.google.com
·
2d
2 days ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
How
LLMs
work | Practical Leaders
🤖
AI
practical-leaders.com
·
6d
6 days ago
·
Hacker News
Actions for How LLMs work | Practical Leaders
How to Build
Financial
Services
AI
Agents with
Claude
🤖
AI
Content type:
Blog
odsc.medium.com
·
20h
20 hours ago
Actions for How to Build Financial Services AI Agents with Claude
Comprehensive evaluation of
LLM
capabilities for interpretation and analysis of genome-scale metabolic
models
in metabolic
engineering
🔧
Compilers
Content type:
Academic
biorxiv.org
·
2d
2 days ago
Actions for Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering
Orchestrate your
LLM
pipeline. Locally
🤖
AI
llmforge.app
·
2h
2 hours ago
·
Hacker News
Actions for Orchestrate your LLM pipeline. Locally
How Effective Are
LLM
Trading Agents?
🤖
AI
Content type:
News
Content type:
Blog
harbourfrontquant.substack.com
·
14h
14 hours ago
·
Substack
Actions for How Effective Are LLM Trading Agents?
Why
LLMs
(still) lack taste
🤖
AI
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Law Professors Prefer
AI
over Peer Answers
🤖
AI
Content type:
Academic
law.stanford.edu
·
5d
5 days ago
·
Hacker News
Actions for Law Professors Prefer AI over Peer Answers
How
Large
Language
Models
Are Creating New Security Challenges
🔧
Compilers
Content type:
Blog
medium.com
·
12h
12 hours ago
Actions for How Large Language Models Are Creating New Security Challenges
LLM
Routing: From Strategy Selection to Production Architecture
🤖
AI
Content type:
Blog
blog.n8n.io
·
1d
1 day ago
Actions for LLM Routing: From Strategy Selection to Production Architecture
AI
Evaluation: How to Test
LLM
Applications Properly
🤖
AI
Content type:
Blog
medium.com
·
6d
6 days ago
Actions for AI Evaluation: How to Test LLM Applications Properly
High Bandwidth Flash | A New Memory for
AI
Data Centers and Edge Computing | Sandisk
🤖
AI
ncnonline.net
·
2d
2 days ago
Actions for High Bandwidth Flash | A New Memory for AI Data Centers and Edge Computing | Sandisk
RAG Pipeline Explained: From Query to Answer, Step by Step
🗄️
Databases
Content type:
Blog
medium.com
·
3d
3 days ago
Actions for RAG Pipeline Explained: From Query to Answer, Step by Step
Stop Wasting GPU Budget: Autoscaling
AI
Inference
on Kubernetes with KEDA
⚙️
Systems Programming
cloudnativenow.com
·
2d
2 days ago
Actions for Stop Wasting GPU Budget: Autoscaling AI Inference on Kubernetes with KEDA
The
Inference
Alpha: Maximizing Frontier
Models
on AMD
🤖
AI
Content type:
Blog
digitalocean.com
·
1d
1 day ago
Actions for The Inference Alpha: Maximizing Frontier Models on AMD
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help