Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
ai models
馃 ai models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
14
posts in
6.9
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
聽
馃
language models
聽
Content type:
Code
github.com
路
5d
5 days ago
路
Hacker News
,
r/LLM
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
DiffusionGemma: 4x Faster Text
Generation
聽
馃挰
LLM
聽
Content type:
News
聽
Content type:
Blog
blog.google
路
1d
1 day ago
路
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
Initial impressions of Claude Fable 5
聽
馃摑
Git
simonwillison.net
路
1d
1 day ago
路
Hacker News
Actions for Initial impressions of Claude Fable 5
Show HN:
Ext-Infer
聽
馃
Ollama
infer.displace.tech
路
4d
4 days ago
路
Hacker News
Actions for Show HN: Ext-Infer
A wild idea: Abstract reality using ontology
聽
馃
language models
聽
Content type:
Discussion
news.ycombinator.com
路
5d
5 days ago
路
Hacker News
Actions for A wild idea: Abstract reality using ontology
Show HN: Run
Llama.cpp
In-Process from Java with Project Panama FFM
聽
馃
Ollama
deemwar-products.github.io
路
6d
6 days ago
路
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
Show HN: Audit any
AI/data
pairing with Veritrooper
聽
馃啎
New AI
veritrooper.com
路
6d
6 days ago
路
Hacker News
Actions for Show HN: Audit any AI/data pairing with Veritrooper
Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
聽
馃惂
Linux
local-llm.utop.workers.dev
路
4d
4 days ago
路
Hacker News
Actions for Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
How to Train Your Goblin
聽
馃啎
New AI
goblins.mchen.workers.dev
路
4d
4 days ago
路
Hacker News
,
Hacker News
Actions for How to Train Your Goblin
GGUF vs GPTQ vs AWQ: The Plain-English Guide to
LLM
Quantization (and Which One to Pick)
聽
馃
Ollama
vettedconsumer.com
路
5d
5 days ago
路
Hacker News
Actions for GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
vishal-dehurdle/state-harness: Runtime safety
net
for
LLM
agents. Detects token spirals, kills doomed tasks early, tells you exactly why. Rust core, Python SDK. pip install state-harness
聽
馃啎
New AI
聽
Content type:
Code
github.com
路
2d
2 days ago
路
Hacker News
,
Hacker News
Actions for vishal-dehurdle/state-harness: Runtime safety net for LLM agents. Detects token spirals, kills doomed tasks early, tells you exactly why. Rust core, Python SDK. pip install state-harness
How to Set Up Codebase Indexing in Kilo Code
聽
馃
Ollama
聽
Content type:
News
聽
Content type:
Blog
blog.kilo.ai
路
5d
5 days ago
Actions for How to Set Up Codebase Indexing in Kilo Code
Unsloth Gemma 4 QAT
聽
馃
Ollama
unsloth.ai
路
6d
6 days ago
Actions for Unsloth Gemma 4 QAT
ninoxAI/nightwatch: Open-source, local-first, read-only
AI
SRE: clusters alert storms, investigates root cause over your live systems, proposes human-gated fixes.
聽
馃
AI Agent
聽
Content type:
Code
github.com
路
3d
3 days ago
路
Hacker News
Actions for ninoxAI/nightwatch: Open-source, local-first, read-only AI SRE: clusters alert storms, investigates root cause over your live systems, proposes human-gated fixes.
No more posts from comwena's subscribed feeds.
Scour all
25258
feeds
Learn more about Feeds
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help