Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
large language models, GPT, Claude, foundation models
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
207
posts in
22.5
ms
🔧
Hardware
ludion.ai
·
1d
1 day ago
WebGPU feature detection was not enough to run small
LLMs
on phones
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for WebGPU feature detection was not enough to run small LLMs on phones
🤖
AI
ianbarber.blog
·
3d
3 days ago
LLMs
Are Complicated Now
Covered by
tldr.tech
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLMs Are Complicated Now
📚
LMS
aggressivelyparaphrasing.me
·
2d
2 days ago
Effective Use-Cases for
LLMs
Covers
Extinction-level capitalism
Discussed on
Hacker News
and
Lobsters
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Effective Use-Cases for LLMs
🔧
Hardware
auriko.ai
·
4d
4 days ago
Quantifying
LLM
Cost Savings from Cache-Aware
Inference
Routing
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Quantifying LLM Cost Savings from Cache-Aware Inference Routing
Less-relevant results
🎓
eLearning
Baseten
·
18h
18 hours ago
Baseten raised a $1.5B Series F and achieved a $13B valuation
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Baseten raised a $1.5B Series F and achieved a $13B valuation
🔧
Hardware
XDA
·
4d
4 days ago
I tested Google's new
Gemma
4 12B on my 8GB GPU, and now I don't want to go back to smaller
models
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for I tested Google's new Gemma 4 12B on my 8GB GPU, and now I don't want to go back to smaller models
📚
LMS
GitHub
·
14h
14 hours ago
Show HN: Loqi, a "local-first" translation tool using
Ollama/llama.cpp
Covers
Ollama
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Loqi, a "local-first" translation tool using Ollama/llama.cpp
🧠
Knowledge Management
alshe.substack.com
·
1d
1 day ago
I Canceled My French Tutor and Built an
LLM
Tool That Does It Better
Discussed on
Substack
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for I Canceled My French Tutor and Built an LLM Tool That Does It Better
🔧
Hardware
Anyscale blog posts
·
4d
4 days ago
High Performance Distributed
Inference
with Ray Serve
LLM
Covered by
Google Cloud Blog
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for High Performance Distributed Inference with Ray Serve LLM
🏫
AI in Education
nextweekai.com
·
4d
4 days ago
How to Build ChatGPT from Scratch: Understanding
LLMs
Step by Step
Covered by
JavaScript Development Space
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How to Build ChatGPT from Scratch: Understanding LLMs Step by Step
📚
LMS
marble.onl
·
1d
1 day ago
There is minimal downside to switching to open
models
Covered by
4 sources
See all sources covering this story
including
tldr.tech
,
daemonology.net
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for There is minimal downside to switching to open models
🧠
Agentic AI
aircityshops.com
·
13h
13 hours ago
Zero Weights Graph
Language
Engine (MSE-GLM)
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Zero Weights Graph Language Engine (MSE-GLM)
🧠
Agentic AI
av.codes blog
·
5d
5 days ago
On local
inference
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for On local inference
🧠
Agentic AI
Fergus's blog
·
1d
1 day ago
Adaptive speculative decoding: picking draft lengths at runtime
Covers
4 stories
See all stories this covers
including
Looking for a self-hosted alternative to Modal.com for running ML workloads
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Adaptive speculative decoding: picking draft lengths at runtime
🧠
Agentic AI
Hacker News
·
1d
1 day ago
The AI Conundrum: We are living in highly subsidized, interesting times
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The AI Conundrum: We are living in highly subsidized, interesting times
📚
LMS
Hugging Face
·
6d
6 days ago
NovaVest/VN-Noxa-v1-7B-Beta-Low
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for NovaVest/VN-Noxa-v1-7B-Beta-Low
📚
LMS
wattfare.com
·
6d
6 days ago
LLM
API that's paid by users, not dev
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLM API that's paid by users, not dev
🤖
AI
venturebeat.com
·
6d
6 days ago
Z.ai’s open-weights GLM-5.2 beats
GPT-5.5
on multiple long-horizon coding benchmarks for 1/6th the cost
Covers
8 stories
See all stories this covers
including
GLM-5.2 (6 minute read)
Covered by
4 sources
See all sources covering this story
including
vettedconsumer.com
,
AI Changes Everything
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost
🏫
AI in Education
teachmecoolstuff.com
·
4d
4 days ago
Fine
Tuning
a Tiny Local
LLM
to Categorize Questions
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fine Tuning a Tiny Local LLM to Categorize Questions
🤖
AI
GitHub
·
2d
2 days ago
Show HN: Alloy – a PyTorch backend and
inference
engine for Apple Silicon
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report