🤖 ai models - comwena

💬LLM News Blog

blog.google··Hacker News, r/LocalLLaMA, r/singularity

Initial impressions of Claude Fable 5

📝Git

simonwillison.net··Hacker News

Show HN: Ext-Infer

🦙Ollama

infer.displace.tech··Hacker News

A wild idea: Abstract reality using ontology

🤖language models Discussion

news.ycombinator.com··Hacker News

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

🦙Ollama

deemwar-products.github.io··Hacker News

Show HN: Audit any AI/data pairing with Veritrooper

🆕New AI

veritrooper.com··Hacker News

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

🐧Linux

local-llm.utop.workers.dev··Hacker News

How to Train Your Goblin

🆕New AI

goblins.mchen.workers.dev··Hacker News, Hacker News

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

🦙Ollama

vettedconsumer.com··Hacker News

vishal-dehurdle/state-harness: Runtime safety net for LLM agents. Detects token spirals, kills doomed tasks early, tells you exactly why. Rust core, Python SDK. pip install state-harness

🆕New AI Code

github.com··Hacker News, Hacker News

How to Set Up Codebase Indexing in Kilo Code

🦙Ollama News Blog

blog.kilo.ai·

Unsloth Gemma 4 QAT

🦙Ollama

unsloth.ai·

ninoxAI/nightwatch: Open-source, local-first, read-only AI SRE: clusters alert storms, investigates root cause over your live systems, proposes human-gated fixes.

🤖AI Agent Code

github.com··Hacker News

No more posts from comwena's subscribed feeds.

Scour all 25258 feeds Learn more about Feeds

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

DiffusionGemma: 4x Faster Text Generation

Initial impressions of Claude Fable 5

Show HN: Ext-Infer

A wild idea: Abstract reality using ontology

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

Show HN: Audit any AI/data pairing with Veritrooper

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

How to Train Your Goblin

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

vishal-dehurdle/state-harness: Runtime safety net for LLM agents. Detects token spirals, kills doomed tasks early, tells you exactly why. Rust core, Python SDK. pip install state-harness

How to Set Up Codebase Indexing in Kilo Code

Unsloth Gemma 4 QAT

ninoxAI/nightwatch: Open-source, local-first, read-only AI SRE: clusters alert storms, investigates root cause over your live systems, proposes human-gated fixes.