Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
large language model, LLM, foundation model, transformer
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
632
posts in
22.0
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🔌
Embedded Systems
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
,
r/LLM
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
Ollama
0.30 GPU Boost: Faster local Qwen inference on NVIDIA
✨
Neural Radiance Fields
everylocalai.com
·
1d
1 day ago
·
DEV
Actions for Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA
Should
LLM
Agents Decide in Social Simulations? Comparing Finite-State and
LLM-Based
Decision Policies
🛡️
AI Safety
Content type:
Academic
arxiv.org
·
20h
20 hours ago
Actions for Should LLM Agents Decide in Social Simulations? Comparing Finite-State and LLM-Based Decision Policies
Introducing
LLM
as a Judge: Scaling search relevance evaluation with
AI
👁️
Computer Vision
Content type:
Blog
opensearch.org
·
2h
2 hours ago
Actions for Introducing LLM as a Judge: Scaling search relevance evaluation with AI
Comprehensive evaluation of
LLM
capabilities for interpretation and analysis of genome-scale metabolic
models
in metabolic engineering
🌐
AGI
Content type:
Academic
biorxiv.org
·
2d
2 days ago
Actions for Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering
How
LLMs
are Actually Trained
🛡️
AI Safety
Content type:
News
Content type:
Blog
blog.algomaster.io
·
19h
19 hours ago
Actions for How LLMs are Actually Trained
local
llm
on laptop 780M GPU using
llama
+ gemma 4 qat
🔌
Embedded Systems
Content type:
Blog
alper.bearblog.dev
·
5d
5 days ago
Actions for local llm on laptop 780M GPU using llama + gemma 4 qat
Orchestrate your
LLM
pipeline. Locally
🧠
AI Research
llmforge.app
·
8h
8 hours ago
·
Hacker News
Actions for Orchestrate your LLM pipeline. Locally
What
Ollama
Reveals About Local
AI
, Agents, and Open
Models
🛡️
AI Safety
Content type:
Blog
odsc.medium.com
·
1d
1 day ago
Actions for What Ollama Reveals About Local AI, Agents, and Open Models
How ChatGPT Actually Works (Beginner Friendly)
🏳️🌈
LGBT Tech
Content type:
Blog
medium.com
·
48m
48 minutes ago
Actions for How ChatGPT Actually Works (Beginner Friendly)
Two old GPUs I salvaged are doing more
AI
work than a brand new $2000 card, and I won't be upgrading anytime soon
🔓
Open Source
xda-developers.com
·
8h
8 hours ago
Actions for Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon
Intelligent inference scheduling with
llm-d
on Red Hat
AI
🔌
Embedded Systems
developers.redhat.com
·
1d
1 day ago
Actions for Intelligent inference scheduling with llm-d on Red Hat AI
WWDC 2026:
Foundation
Models
(& Anarlog)
🏳️🌈
LGBT Tech
skushagra.com
·
3d
3 days ago
Actions for WWDC 2026: Foundation Models (& Anarlog)
6. Air-Gapped Claude Code - The Claude Code SRE Handbook
🔌
Embedded Systems
har-ki.github.io
·
8h
8 hours ago
·
Hacker News
Actions for 6. Air-Gapped Claude Code - The Claude Code SRE Handbook
lightmetal: GPU
LLM
Inference From a Single Java 25 JAR
🔌
Embedded Systems
Content type:
Blog
adambien.blog
·
2d
2 days ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
How we fight GPU scarcity without compromise
🔌
Embedded Systems
Content type:
Blog
equixly.com
·
6d
6 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
Ask HN: Any Local
LLM
can I run without GPU for Local Agentic workflow
AI
?
✨
Neural Radiance Fields
Content type:
Discussion
news.ycombinator.com
·
17h
17 hours ago
·
Hacker News
Actions for Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
Generative
AI
in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
🌐
AGI
Content type:
Audio
oreilly.com
·
1d
1 day ago
Actions for Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
Why Your
LLM
Gets Dumber With More Context
🛡️
AI Safety
siliconopera.com
·
9h
9 hours ago
Actions for Why Your LLM Gets Dumber With More Context
MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
🔌
Embedded Systems
Content type:
Blog
bric.pe.kr
·
3d
3 days ago
·
DEV
Actions for MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help