Skip to main content
Scour
Discover
Docs
Login
Sign Up
Discover
About
Docs
Changelog
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🤖 Qwen
GitHub
·
4d
4 days ago
Native Inference Engine for macOS 14 or newer
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Native Inference Engine for macOS 14 or newer
aiweekly.co
·
6d
6 days ago
Lin Junyang AI Lab Closes Round at $2B Valuation
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Lin Junyang AI Lab Closes Round at $2B Valuation
Anyscale blog posts
·
3d
3 days ago
High Performance Distributed Inference with Ray Serve
LLM
Covered by
Google Cloud Blog
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for High Performance Distributed Inference with Ray Serve LLM
brightray.ai
·
4d
4 days ago
Built Uber aggregator that tracks top AI researchers and leaders
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Built Uber aggregator that tracks top AI researchers and leaders
huggingface.co
·
5d
5 days ago
bartowski/command-a-plus-05-2026-GGUF
Covers
4 stories
See all stories this covers
including
GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for bartowski/command-a-plus-05-2026-GGUF
arxiv.org
·
5d
5 days ago
Qwen-RobotWorld
Technical Report: Unifying Embodied World
Modeling
through
Language-Conditioned
Video Generation
Covered by
tldr.tech
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Qwen-RobotWorld Technical Report: Unifying Embodied World Modeling through Language-Conditioned Video Generation
teachmecoolstuff.com
·
3d
3 days ago
Fine Tuning a Tiny Local
LLM
to Categorize Questions
Discussed on
Hacker News
and
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Fine Tuning a Tiny Local LLM to Categorize Questions
Alex Ellis' Blog
·
5d
5 days ago
Local
Qwen
isn't a worse Opus, it's a different tool
Covered by
4 sources
See all sources covering this story
including
lemmy.ml
,
tldr.tech
Discussed on
Hacker News
,
Lobsters
, and
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Local Qwen isn't a worse Opus, it's a different tool
lmsys.org
·
6d
6 days ago
DFlash and Spec V2 Decoding (14 minute read)
Covers
5 stories
See all stories this covers
including
Looking for a self-hosted alternative to Modal.com for running ML workloads
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for DFlash and Spec V2 Decoding (14 minute read)
parsiya.net
·
4d
4 days ago
Brain the Size of a Planet: Are LLMs Thonking too Hard? (30 minute read)
Covers
Defense at AI speed: Microsoft’s new multi-model agentic security system tops leading industry benchmark
Covered by
tldr.tech
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Brain the Size of a Planet: Are LLMs Thonking too Hard? (30 minute read)
GitHub
·
1d
1 day ago
Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
anbeeld.com
·
1d
1 day ago
7900XTX 24GB vram, can finally fit Q6K+MTP with
Qwen
3.6 27B at 131k context
Discussed on
r/LocalLLaMA
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for 7900XTX 24GB vram, can finally fit Q6K+MTP with Qwen 3.6 27B at 131k context
lector.dev
·
2d
2 days ago
Show HN: Evaluating Local LLMs as
language
translators for my app
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Evaluating Local LLMs as language translators for my app
substack.productmind.co
·
6d
6 days ago
The US just treated an
LLM
as a munition
Covers
Statement on the US government directive to suspend access to Fable 5 and Mythos 5
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The US just treated an LLM as a munition
autonomy-landing-page.vercel.app
·
1d
1 day ago
Show HN: Autonomy – Self-Harness/Self-Directed AI Agent Core Under Development
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Autonomy – Self-Harness/Self-Directed AI Agent Core Under Development
agide.dev
·
1d
1 day ago
Ag.ide Index, rank, and refactor your repo's worst code
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Ag.ide Index, rank, and refactor your repo's worst code
hackernoon.com
·
5d
5 days ago
How I Built a Pipeline to Restore Old B&W Photos to 4K Color Using
Open-Source
AI
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for How I Built a Pipeline to Restore Old B&W Photos to 4K Color Using Open-Source AI
GitHub
·
6d
6 days ago
ahwurm/localharness:
Model-agnostic
agent harness for local LLMs — configure agents in YAML and run them on your own hardware (vLLM, Ollama, LM Studio, llama.cpp).
Covers
uv
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for ahwurm/localharness: Model-agnostic agent harness for local LLMs — configure agents in YAML and run them on your own hardware (vLLM, Ollama, LM Studio, llama.cpp).
Hacker News
·
4h
4 hours ago
The AI Conundrum: We are living in highly subsidized, interesting times
Discussed on
Hacker News
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The AI Conundrum: We are living in highly subsidized, interesting times
venturebeat.com
·
5d
5 days ago
Z.ai’s
open-weights
GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost
Covers
8 stories
See all stories this covers
including
GLM-5.2 (6 minute read)
Covered by
4 sources
See all sources covering this story
including
vettedconsumer.com
,
AI Changes Everything
Love
Like
Not for me
Save
Add to your feed
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report