Context Windows

Feeds to Scour
SubscribedAll
Scoured 301 posts in 11.8 ms

rag-explained-how-it-works

 🗂️Vector Databases  Content type: Blog
dev.to··DEV

Beyond Basic RAG (Part 3): Agentic RAG, CRAG, Self-RAG and GraphRAG Explained | M012 | Mehul Ligade

 🗂️Vector Databases
pub.towardsai.net
·

RAG-Based Testing Series — Part 1: What Is RAG & Why Your Old Testing Playbook Won't Work Here

 🗂️Vector Databases
linkedin.com··DEV

Why LLMs (still) lack taste

 📝NLP
Less-relevant results

manavgup/context-analyzer: Context window usage analyzer for Claude Code — MCP server + interactive dashboard

 💬Prompt Engineering  Content type: Code
github.com··Hacker News

How LLMs work | Practical Leaders

 💬Prompt Engineering

Quiz: Embeddings and Vector Databases With ChromaDB

 🗂️Vector Databases
realpython.com·

Why my first RAG system hallucinated (and how I fixed it)

 🗂️Vector Databases  Content type: Blog
dev.to··DEV

LLM are universal simulators

 💬Prompt Engineering

Dense Contexts Are Hard Contexts: Lexical Density Limits Effective Context in LLMs

 💬Prompt Engineering  Content type: Academic
arxiv.org··Hacker News

Kimi Code: Next-Gen AI Code Agent for Terminal & IDE

 💻CLI Tools
kimi.com
·

Show HN: Lore – LLM proxy for coding agent context and memory management

 💬Prompt Engineering
withlore.ai··Hacker News

Larger context windows and configurable reasoning levels for GitHub Copilot - GitHub Changelog

 💬Prompt Engineering  Content type: Blog
github.blog··Hacker News

Stop Whispering to the Model, Start Furnishing Its Brain

 💬Prompt Engineering  Content type: Blog
dev.to··DEV

Choosing the Right Vector Database for RAG and AI Applications

 🗂️Vector Databases  Content type: Blog
analyticsvidhya.com·

JeevanJoshi2061/titan_engine_core: Constant-memory sequence modeling engine combining selective holographic-compression (ASH-C) with a coordinate pointer network (HEP-DNA). Bypasses the linear KV Cache bottleneck on consumer GPUs.

 🌐Open Source  Content type: Code
github.com··Hacker News

Initial impressions of Claude Fable 5

 🐍Python
simonwillison.net··Hacker News

ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.

 💬Prompt Engineering  Content type: Code
github.com··Hacker News

Why I stopped using LLMs to generate code (and what I use instead)

 💬Prompt Engineering  Content type: Blog
dev.to··DEV

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

 🧩LLM Integration

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help