Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🤖 LLMs
Specific
large language models, GPT, Claude, AI models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
369
posts in
16.4
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🤖
AI
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
,
r/LLM
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
How to Run an
LLM
Locally: Ultimate Guide to Local
AI
2026
🤖
AI
Content type:
Blog
cswithsanjay.blogspot.com
·
18h
18 hours ago
Actions for How to Run an LLM Locally: Ultimate Guide to Local AI 2026
Fine-tuning
Multi-modal
LLMs
with ART: Art-based Reinforcement Training
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
From Chatbot Hallucinations to Deterministic Agents: Forcing Local
LLMs
to Run Production-Grade…
🤖
AI
Content type:
Blog
medium.com
·
9h
9 hours ago
Actions for From Chatbot Hallucinations to Deterministic Agents: Forcing Local LLMs to Run Production-Grade…
Intelligent
inference
scheduling with
llm-d
on Red Hat
AI
🤖
AI
developers.redhat.com
·
1d
1 day ago
Actions for Intelligent inference scheduling with llm-d on Red Hat AI
A reporting checklist for
large
language
models
in behavioural science
🤖
AI
Content type:
Academic
nature.com
·
3d
3 days ago
Actions for A reporting checklist for large language models in behavioural science
WhatLLM.org: Compare
LLMs
by Benchmarks, Price & Speed
🤖
AI
Content type:
Discussion
Content type:
Reference
whatllm.org
·
16h
16 hours ago
Actions for WhatLLM.org: Compare LLMs by Benchmarks, Price & Speed
Introducing
LLM
as a Judge: Scaling search relevance evaluation with
AI
🤖
AI
Content type:
Blog
opensearch.org
·
1d
1 day ago
Actions for Introducing LLM as a Judge: Scaling search relevance evaluation with AI
DiffusionGemma: 4x Faster Text Generation
🤖
AI
Content type:
News
Content type:
Blog
19
sources covering this post
blog.google
·
2d
2 days ago
·
Hacker News
,
r/LocalLLaMA
,
r/singularity
·
Cited by 21 articles
Actions for DiffusionGemma: 4x Faster Text Generation
12B Gemma 4 QAT Deployment with NVIDIA L4, Cloud Run, MCP, and Antigravity CLI
🤖
AI
Content type:
Blog
medium.com
·
19h
19 hours ago
Actions for 12B Gemma 4 QAT Deployment with NVIDIA L4, Cloud Run, MCP, and Antigravity CLI
Comprehensive evaluation of
LLM
capabilities for interpretation and analysis of genome-scale metabolic
models
in metabolic engineering
🤖
AI
Content type:
Academic
biorxiv.org
·
3d
3 days ago
Actions for Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering
How
LLMs
are Actually Trained
🤖
AI
Content type:
News
Content type:
Blog
blog.algomaster.io
·
1d
1 day ago
Actions for How LLMs are Actually Trained
Why
Transformer
Models
Get Costlier as
Context
Grows
🤖
AI
siliconopera.com
·
11h
11 hours ago
Actions for Why Transformer Models Get Costlier as Context Grows
Report: GKE
Inference
Gateway delivers up to 92% faster
AI
responses
🤖
AI
Content type:
Blog
cloud.google.com
·
3d
3 days ago
·
Hacker News
·
Cited by 1 article
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
Ollama
0.30 GPU Boost: Faster local Qwen
inference
on NVIDIA
🎮
Game Engines
everylocalai.com
·
2d
2 days ago
·
DEV
Actions for Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA
How ChatGPT Actually Works (Beginner Friendly)
🤖
AI
Content type:
Blog
medium.com
·
22h
22 hours ago
Actions for How ChatGPT Actually Works (Beginner Friendly)
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
📊
Formal Methods
zozo123.github.io
·
2d
2 days ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
What's in the Box? A Field Guide to
AI
Models
🤖
AI
Content type:
Blog
iankduncan.com
·
3d
3 days ago
Actions for What's in the Box? A Field Guide to AI Models
Run ChatGPT,
Claude
, Gemini and Perplexity Side-by-Side
🤖
AI
aiverdict.github.io
·
21h
21 hours ago
·
Hacker News
Actions for Run ChatGPT, Claude, Gemini and Perplexity Side-by-Side
6. Air-Gapped
Claude
Code - The
Claude
Code SRE Handbook
🤖
AI
har-ki.github.io
·
1d
1 day ago
·
Hacker News
Actions for 6. Air-Gapped Claude Code - The Claude Code SRE Handbook
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help