Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
large language model, LLM, foundation model, transformer
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
631
posts in
7.0
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🔌
Embedded Systems
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
,
r/LLM
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
Ollama
0.30 GPU Boost: Faster local Qwen inference on NVIDIA
✨
Neural Radiance Fields
everylocalai.com
·
1d
1 day ago
·
DEV
Actions for Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA
Should
LLM
Agents Decide in Social Simulations? Comparing Finite-State and
LLM-Based
Decision Policies
🛡️
AI Safety
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Should LLM Agents Decide in Social Simulations? Comparing Finite-State and LLM-Based Decision Policies
Orchestrate your
LLM
pipeline. Locally
🧠
AI Research
llmforge.app
·
5h
5 hours ago
·
Hacker News
Actions for Orchestrate your LLM pipeline. Locally
WWDC 2026:
Foundation
Models
(& Anarlog)
🏳️🌈
LGBT Tech
skushagra.com
·
2d
2 days ago
Actions for WWDC 2026: Foundation Models (& Anarlog)
What
Ollama
Reveals About Local
AI
, Agents, and Open
Models
🛡️
AI Safety
Content type:
Blog
odsc.medium.com
·
23h
23 hours ago
Actions for What Ollama Reveals About Local AI, Agents, and Open Models
Improved performance and
model
support with GGUF
🔌
Embedded Systems
Content type:
Blog
ollama.com
·
6d
6 days ago
Actions for Improved performance and model support with GGUF
Intelligent inference scheduling with
llm-d
on Red Hat
AI
🔌
Embedded Systems
developers.redhat.com
·
22h
22 hours ago
Actions for Intelligent inference scheduling with llm-d on Red Hat AI
MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
🔌
Embedded Systems
Content type:
Blog
bric.pe.kr
·
2d
2 days ago
·
DEV
Actions for MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
Generative
AI
in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
🌐
AGI
Content type:
Audio
oreilly.com
·
22h
22 hours ago
Actions for Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst
Comprehensive evaluation of
LLM
capabilities for interpretation and analysis of genome-scale metabolic
models
in metabolic engineering
🌐
AGI
Content type:
Academic
biorxiv.org
·
2d
2 days ago
Actions for Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering
local
llm
on laptop 780M GPU using
llama
+ gemma 4 qat
🔌
Embedded Systems
Content type:
Blog
alper.bearblog.dev
·
5d
5 days ago
Actions for local llm on laptop 780M GPU using llama + gemma 4 qat
6. Air-Gapped Claude Code - The Claude Code SRE Handbook
🔌
Embedded Systems
har-ki.github.io
·
5h
5 hours ago
·
Hacker News
Actions for 6. Air-Gapped Claude Code - The Claude Code SRE Handbook
Two old GPUs I salvaged are doing more
AI
work than a brand new $2000 card, and I won't be upgrading anytime soon
🔓
Open Source
xda-developers.com
·
6h
6 hours ago
Actions for Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon
Why
LLMs
(still) lack taste
🛡️
AI Safety
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Google's new open-weights
model
brings
image-generation
tricks to
AI
text
generation
✨
Neural Radiance Fields
Content type:
News
theregister.com
·
3h
3 hours ago
Actions for Google's new open-weights model brings image-generation tricks to AI text generation
How we fight GPU scarcity without compromise
🔌
Embedded Systems
Content type:
Blog
equixly.com
·
6d
6 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
Ask HN: Any Local
LLM
can I run without GPU for Local Agentic workflow
AI
?
✨
Neural Radiance Fields
Content type:
Discussion
news.ycombinator.com
·
15h
15 hours ago
·
Hacker News
Actions for Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
lightmetal: GPU
LLM
Inference From a Single Java 25 JAR
🔌
Embedded Systems
Content type:
Blog
adambien.blog
·
2d
2 days ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
Why Your
LLM
Gets Dumber With More Context
🛡️
AI Safety
siliconopera.com
·
7h
7 hours ago
Actions for Why Your LLM Gets Dumber With More Context
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help