LLMs

Feeds to Scour
SubscribedAll
Scoured 157 posts in 8.4 ms

Show HN: RiskKernel, kill -9 an AI agent and resume it without paying twice

 🏠Self-hosting

Tool to convert technical PDFs into RAG-ready chunks and Obsidian vaults

 🪨Obsidian

TheArcForge/Hades: Unity-aware AI infrastructure for Claude Code — a knowledge graph + 88 MCP tools that let your AI agent know your project, not just grep its files.

 🕸️Knowledge Graphs  Content type: Code
github.com··Hacker News

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

 🤖Machine Learning  Content type: Academic
arxiv.org·

GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)

 🧠LLM Inference

Florian Brand, Prime Intellect research engineer, adopts Gemma 4 E4B 6-bit quantized as his primary local Mac LLM

 💬Natural Language Processing  Content type: News
digg.com··Hacker News

Anthropic and OpenAI both said context is the bottleneck for data agents. Here's what they didn't say.

 🪟Context Windows  Content type: Blog

When More Documents Hurt RAG: Mitigating Vector Search Dilution with Domain-Scoped, Model-Agnostic Retrieval

 🔍Information Retrieval  Content type: Academic
arxiv.org·

google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation

 🧠LLM Inference

scottpurdy/llmbuffer: LLM conversation buffer with cache optimization and dynamic context.

 🪟Context Windows  Content type: Code

Sales Is the Customer Clock

 🪟Context Windows
hari.computer··Hacker News

An interactive introduction to the terrific experience of rendering Arabic and its technical debt

 🪟Context Windows  Content type: Blog

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 🧠LLM Inference  Content type: News  Content type: Blog
blog.google··Hacker News

TrustMargin: Training-Free Arbitration between Parametric Memory and Retrieved Evidence in Large Language Models

 🪟Context Windows  Content type: Academic
arxiv.org·

Stack Overflow didn't just help AI learn to code

 🤖LLM

memory OS for AI agents (ranks, compresses and evolves agents memory)

 🔍Information Retrieval

The Wrong Epsilon to the Brain

 🪟Context Windows

Jott2121/agent-gate: MCP server that adds a fail-closed quality gate and hash-chained receipt ledger to any AI agent workflow.

 🐍Python  Content type: Code
github.com··Hacker News

The Injection Paradox: Brand-Level Suppression in Safety-Trained LLM Recommendations via RAG Context Injection

 🪟Context Windows  Content type: Academic
arxiv.org·

Tokenminning: Because Tokenmaxxing Is a Bad Idea

 🪟Context Windows

No more posts from saeedesmaili's subscribed feeds.

Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help