AI Infrastructure

Feeds to Scour
SubscribedAll
Scoured 35 posts in 8.3 ms

Show HN: Zerostack, an open coding agent optimized for memory footprint

 🧠LLM Engineering

A drop-in replacement chat template for google/gemma-4-31B-it tuned for open-source agentic coding harnesses.

 🧠LLM Engineering

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 🤖AI  Content type: News  Content type: Blog
blog.google··Hacker News

Introducing Granite Libraries and Project Granite Switch

 🧠LLM Engineering  Content type: Blog

TjWheeler/deep-memory: A GraphRAG implementation with a Vocabulary system to optimise AI integration

 🏠HomeLab  Content type: Code
github.com··Hacker News

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖AI  Content type: Code
github.com··Hacker News

If Claude Fable stops helping you, you'll never know

 🧠LLM Engineering  Content type: Blog

Does anyone know what PCIe mode was used for these benchmarks?

 🧠LLM Engineering  Content type: Code
github.com··r/LocalLLaMA

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 🤖AI  Content type: Code
github.com··r/LocalLLaMA

mirkolenz/llmhop: Tiny, stateless Go router that dispatches OpenAI-compatible requests to single-model vLLM and sglang backends with zero external dependencies

 🧠LLM Engineering  Content type: Code
github.com··Hacker News

zhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability

 🍎Apple  Content type: Code
github.com··Hacker News

chipmates/agoracosmica: A Living Library You Can Talk To. Open-source educational platform with 30 historical figures from philosophy, science, art, mysticism, and activism. Stories, dialogues, AI conversation, multi-figure councils. Nonprofit, BYOK, self-hostable, no behavioral tracking.

 🖥️Self-hosted apps  Content type: Code
github.com··Hacker News

ninoxAI/nightwatch: Open-source, local-first, read-only AI SRE: clusters alert storms, investigates root cause over your live systems, proposes human-gated fixes.

 🖥️Self-hosted apps  Content type: Code
github.com··Hacker News

Shrivastava-Aditya/boolean-algebra-engine: Deterministic boolean algebra engine — evaluates expressions, detects contradictions, audits logic rules. MCP server, NL layer, REST API, CLI, Streamlit UI.

 🧠LLM Engineering  Content type: Code
github.com··Hacker News, r/LLM
Less-relevant results

Show HN: CLI for scoring OpenAPI for LLM legibility

 🧠LLM Engineering  Content type: Code
github.com··Hacker News

No more posts from moznotes's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help