Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI
🤖 AI
Broad
Claude, local llms
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
232
posts in
25.1
ms
🌐
web development
GitHub
·
6d
6 days ago
RimantasZ/contextspy: Context profiler for
LLMs
and
AI
agents - used to introspect context contents and reduce token costs
Covers
5 stories
See all stories this covers
including
FastAPI
Covered by
indiehacker.news
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for RimantasZ/contextspy: Context profiler for LLMs and AI agents - used to introspect context contents and reduce token costs
🥟
Bun
lector.dev
·
2d
2 days ago
Show HN: Evaluating
Local
LLMs
as
language
translators for my app
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Evaluating Local LLMs as language translators for my app
🥟
Bun
akarouter.dev
·
15h
15 hours ago
Flat per-call
LLM
API gateway (20x cheaper than
Claude
Max)
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Flat per-call LLM API gateway (20x cheaper than Claude Max)
🔧
technical deep dives
portal.neuralwatt.com
·
15h
15 hours ago
Neuralwatt: Energy-based pricing for
AI
inference
. Efficient prompts cost less
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Neuralwatt: Energy-based pricing for AI inference. Efficient prompts cost less
🔧
technical deep dives
ianbarber.blog
·
2d
2 days ago
LLMs
Are Complicated Now
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLMs Are Complicated Now
🥟
Bun
huggingface.co
·
3d
3 days ago
225B-A23B
Covered by
news.smol.ai
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for 225B-A23B
🌐
web development
ludion.ai
·
4h
4 hours ago
WebGPU feature detection was not enough to run small
LLMs
on phones
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for WebGPU feature detection was not enough to run small LLMs on phones
⚡
performance optimization
youtube.com
Content type:
Video
·
2d
2 days ago
Musician correctly predicts rise of
local
LLMs
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Musician correctly predicts rise of local LLMs
💻
personal programming project explainers
didon.app
Content type:
Video
·
20h
20 hours ago
Show HN: Didon –
AI
workday reports for productivity analysis
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Didon – AI workday reports for productivity analysis
🥟
Bun
Anyscale blog posts
·
3d
3 days ago
High Performance Distributed
Inference
with Ray Serve
LLM
Covered by
Google Cloud Blog
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for High Performance Distributed Inference with Ray Serve LLM
🥟
Bun
youtu.be
Content type:
Video
·
1d
1 day ago
Two Word docs talking to each other via
local
LLMs
— what real use cases would you actually want?
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Two Word docs talking to each other via local LLMs — what real use cases would you actually want?
🔧
technical deep dives
Electrek
·
16h
16 hours ago
Tesla plans to sell modular
AI
data center hardware called ‘Megapod’
Covered by
hardware.slashdot.org
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tesla plans to sell modular AI data center hardware called ‘Megapod’
💻
personal programming project explainers
fareedkhan-dev.github.io
·
1d
1 day ago
Train
LLM
from Scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train LLM from Scratch
🔧
technical deep dives
rocm.blogs.amd.com
·
5d
5 days ago
Unlocking Extreme AMD Instinct
Inference
with Software-Hardware Co-Optimization
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Unlocking Extreme AMD Instinct Inference with Software-Hardware Co-Optimization
⚡
performance optimization
GitHub
·
7h
7 hours ago
Show HN: Phileas –
Local-first
long-term memory for the
AI
you chat with
Covers
2 stories
See all stories this covers
including
Model Context Protocol And OAuth
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Phileas – Local-first long-term memory for the AI you chat with
🥟
Bun
unsloth.ai
·
2d
2 days ago
GLM-5.2 – How to Run
Locally
Covers
2 stories
See all stories this covers
including
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Covered by
news.smol.ai
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GLM-5.2 – How to Run Locally
🔧
technical deep dives
lesbarclays.substack.com
·
5d
5 days ago
What Is the Return on Tokens?
Covers
6 stories
See all stories this covers
including
Statement on the US government directive to suspend access to Fable 5 and Mythos 5
Discussed on
Substack
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for What Is the Return on Tokens?
💻
personal programming project explainers
speechmark.co
·
2d
2 days ago
On-device meeting notes for Mac (no bot, no cloud)
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for On-device meeting notes for Mac (no bot, no cloud)
🥟
Bun
konxios.com
·
2d
2 days ago
Show HN: Konxios a
local
first
AI
OS that connects LM Studio,
Ollama
and cloud
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Konxios a local first AI OS that connects LM Studio, Ollama and cloud
🌐
web development
brightray.ai
·
4d
4 days ago
Built Uber aggregator that tracks top
AI
researchers and leaders
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Built Uber aggregator that tracks top AI researchers and leaders
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report