LLM

Feeds to Scour
SubscribedAll
Scoured 1475 posts in 19.2 ms

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 💬LLMs  Content type: Code
github.com··Hacker News

What is Agentic RAG? Building Multi-Agent Agentic RAG Systems

 🤖Large Language Models
pub.towardsai.net
·

Slack bot for the whole team, not per-seat

 💬NLP  Content type: Discussion
plugand.ai··Hacker News

LLM-Based Code Documentation Generation and Multi-Judge Evaluation

 🤖Large Language Models  Content type: Academic
arxiv.org·

These open-source tools do what Claude charges for, and some do it better

 🧩Logseq
xda-developers.com·

Initial impressions of Claude Fable 5

 🕸️WebAssembly
simonwillison.net··Hacker News

LLM Inference Handbook 2026

 💬NLP
pub.towardsai.net
·

TA-RAG: Tone-Aware Retrieval-Augmented Generation for Peer-Support Health Communication

 🤖Large Language Models  Content type: Academic
arxiv.org·

SaqlainXoas/llm-system-patterns: A docs-first guide to LLM system design — hybrid search, embedding pipelines, reranking, and LLM-as-judge patterns.

 🤖Large Language Models  Content type: Code

Fine-Tuning vs. RAG vs. Prompting: the Definitive Decision Framework for 2026

 💬LLMs
pub.towardsai.net
·

Flaws in the LLM Automation Narrative

 💬NLP  Content type: Academic
arxiv.org·

LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…

 🤖Large Language Models
pub.towardsai.net
·

A handy llama-server launcher with easy model and configuration customisation

 💬NLP  Content type: Code
github.com··r/LocalLLaMA

An LLM-Native Psychometric Instrument Does Not Predict LLM Behavior: Evidence Across 25 Models

 💬NLP  Content type: Academic
arxiv.org·

The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model

 💬LLMs  Content type: Academic
arxiv.org·

Optimizing Local LLM Inference on Constrained Hardware

 🤖Large Language Models
pub.towardsai.net
·

AIchain Skill: A Prompt as a Reusable Object

 🤖Large Language Models  Content type: Code
github.com··DEV

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

 💬LLMs
pub.towardsai.net
·

Rosetta Memory: Adaptive Memory for Cross-LLM Agents

 🤖Agentic AI  Content type: Academic
arxiv.org·

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 🤖LLM Inference  Content type: Code
github.com··r/LocalLLaMA

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help