Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
large language models, GPT, Claude, foundation models
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
215
posts in
25.7
ms
🎓
eLearning
baseten.co
·
11h
11 hours ago
Baseten raised a $1.5B Series F and achieved a $13B valuation
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Baseten raised a $1.5B Series F and achieved a $13B valuation
📚
LMS
marble.onl
·
1d
1 day ago
There is minimal downside to switching to open
models
Covered by
tldr.tech
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for There is minimal downside to switching to open models
🤖
AI
ianbarber.blog
·
3d
3 days ago
LLMs
Are Complicated Now
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLMs Are Complicated Now
🤖
AI
lmsys.org
·
6d
6 days ago
DFlash and Spec V2 Decoding (14 minute read)
Covers
6 stories
See all stories this covers
including
Looking for a self-hosted alternative to Modal.com for running ML workloads
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for DFlash and Spec V2 Decoding (14 minute read)
🤖
AI
GitHub
·
2d
2 days ago
Show HN: Alloy – a PyTorch backend and
inference
engine for Apple Silicon
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
🧠
Agentic AI
moorcheh.ai
·
9h
9 hours ago
Information-Theoretic Vector Search Is Having Its Moment
Covered by
GitHub
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Information-Theoretic Vector Search Is Having Its Moment
🔧
Hardware
auriko.ai
·
4d
4 days ago
Quantifying
LLM
Cost Savings from Cache-Aware
Inference
Routing
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Quantifying LLM Cost Savings from Cache-Aware Inference Routing
🔧
Hardware
XDA
·
4d
4 days ago
I tested Google's new
Gemma
4 12B on my 8GB GPU, and now I don't want to go back to smaller
models
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for I tested Google's new Gemma 4 12B on my 8GB GPU, and now I don't want to go back to smaller models
🧠
Agentic AI
Hacker News
·
1d
1 day ago
The AI Conundrum: We are living in highly subsidized, interesting times
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The AI Conundrum: We are living in highly subsidized, interesting times
🔧
Hardware
Anyscale blog posts
·
4d
4 days ago
High Performance Distributed
Inference
with Ray Serve
LLM
Covered by
Google Cloud Blog
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for High Performance Distributed Inference with Ray Serve LLM
🧠
Agentic AI
aircityshops.com
·
6h
6 hours ago
Zero Weights Graph
Language
Engine (MSE-GLM)
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Zero Weights Graph Language Engine (MSE-GLM)
🏫
AI in Education
nextweekai.com
·
3d
3 days ago
How to Build ChatGPT from Scratch: Understanding
LLMs
Step by Step
Covered by
JavaScript Development Space
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How to Build ChatGPT from Scratch: Understanding LLMs Step by Step
🧠
Agentic AI
av.codes blog
·
4d
4 days ago
On local
inference
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for On local inference
📚
LMS
huggingface.co
·
6d
6 days ago
NovaVest/VN-Noxa-v1-7B-Beta-Low
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for NovaVest/VN-Noxa-v1-7B-Beta-Low
📚
LMS
wattfare.com
·
6d
6 days ago
LLM
API that's paid by users, not dev
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for LLM API that's paid by users, not dev
📚
LMS
GitHub
·
6h
6 hours ago
Show HN: Loqi, a "local-first" translation tool using
Ollama/llama.cpp
Covers
Ollama
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Loqi, a "local-first" translation tool using Ollama/llama.cpp
🤖
AI
venturebeat.com
·
6d
6 days ago
Z.ai’s open-weights GLM-5.2 beats
GPT-5.5
on multiple long-horizon coding benchmarks for 1/6th the cost
Covers
8 stories
See all stories this covers
including
GLM-5.2 (6 minute read)
Covered by
4 sources
See all sources covering this story
including
vettedconsumer.com
,
AI Changes Everything
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost
🏫
AI in Education
teachmecoolstuff.com
·
4d
4 days ago
Fine
Tuning
a Tiny Local
LLM
to Categorize Questions
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fine Tuning a Tiny Local LLM to Categorize Questions
📚
LMS
arxiv.org
·
5d
5 days ago
The Benchmark Illusion: Pruned
LLMs
Can Pass Multiple Choice but Fail to Answer
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The Benchmark Illusion: Pruned LLMs Can Pass Multiple Choice but Fail to Answer
🤖
AI
unsloth.ai
·
3d
3 days ago
GLM-5.2 – How to Run Locally
Covers
2 stories
See all stories this covers
including
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Covered by
news.smol.ai
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for GLM-5.2 – How to Run Locally
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report