Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ Fast AI Inference
Cerebras, Groq, fast LLM tokens
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
221
posts in
25.4
ms
Llama.cpp
now has an official website:
llama.app
🤖
AI
llama.app
·
6d
·
Hacker News
Build Personal
AI
Agents on Windows PCs with New Tools from Microsoft and Nvidia
🤖
AI
Blog
developer.nvidia.com
·
2d
·
Hacker News
Dell's
AI
Surge, SpaceX Spending, & SoftBanks's New Play
🔬
AI Labs
briefing.forwardfuture.ai
·
3d
How to Run Gemma 4 12B Locally - The Best
AI
For Consumer Laptops
🤖
AI
Video
youtube.com
·
1d
Location: Oslo, Norway (CET) Remote: Yes (EU/US time zones) Willing to relocate:...
🦀
Rust
Discussion
news.ycombinator.com
·
3d
·
Hacker News
paralleliq/piqc: Kubernetes scanner that discovers LLMs running on
vLLM
and extracts their deployment and runtime facts.
🏗️
LLM Infrastructure
Code
github.com
·
2d
·
Hacker News
We have built the first of it's kind interactive blog for matching open-source LLMs to GPUs.
🤖
AI
Blog
agentswarms.fyi
·
2d
·
r/ChatGPT
,
r/OpenAI
Micron Powers
AI
Everywhere at COMPUTEX 2026
🤖
AI
cdrinfo.com
·
3d
GGUF vs MLX: A Decision Guide, Not Another Benchmark
🤖
AI
muhammadraza.me
·
2d
[AINews] not much happened today
🤖
AI
News
latent.space
·
4h
Multi-Lora-Continual-Learning
📅
Resource Scheduling
trajectory.ai
·
6d
·
Hacker News
Holo3.1:
Fast
& Local Computer Use Agents
🤖
AI
Blog
huggingface.co
·
2d
Lodestar: An Online-Learning
LLM
Inference
Router
🏗️
LLM Infrastructure
Academic
arxiv.org
·
3d
Show HN: We built an
LLM
inference
engine in pure Python
🏗️
LLM Infrastructure
Code
github.com
·
2d
·
Hacker News
A Sovereign Brain on a Laptop: Local
LLM
+ Pi Agent + Markdown
🏠
Self-Hosting
sovereignbrain.me
·
3d
Google makes Gemma 4 12B a local
AI
bet for startups
🆕
New AI
startupfortune.com
·
1d
Location: Göttingen, Germany Remote: Yes (preferred; hybrid also fine) Willing t...
🤖
AI
Discussion
news.ycombinator.com
·
1d
·
Hacker News
Nemotron 3 Ultra announced: high-speed, leading US open weights intelligence
🆕
New AI
artificialanalysis.ai
·
4d
·
Hacker News
Part 2 — Serve-Level Speed: System Design That Stabilizes P95/P99
🧠
LLM Inference
towardsai.net
·
1d
Experience with "nvidia/LocateAnything-3B"
🤖
AI
huggingface.co
·
6d
·
r/LocalLLaMA
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help