🪟 Context Windows - minezone

Less-relevant results

manavgup/context-analyzer: Context window usage analyzer for Claude Code — MCP server + interactive dashboard

💬Prompt Engineering Code

github.com··Hacker News

How LLMs work | Practical Leaders

💬Prompt Engineering

practical-leaders.com··Hacker News

Quiz: Embeddings and Vector Databases With ChromaDB

🗂️Vector Databases

realpython.com·

Why my first RAG system hallucinated (and how I fixed it)

🗂️Vector Databases Blog

dev.to··DEV

LLM are universal simulators

💬Prompt Engineering

invertedpassion.com··Hacker News

Dense Contexts Are Hard Contexts: Lexical Density Limits Effective Context in LLMs

💬Prompt Engineering Academic

arxiv.org··Hacker News

Kimi Code: Next-Gen AI Code Agent for Terminal & IDE

💻CLI Tools

kimi.com

Show HN: Lore – LLM proxy for coding agent context and memory management

💬Prompt Engineering

withlore.ai··Hacker News

Larger context windows and configurable reasoning levels for GitHub Copilot - GitHub Changelog

💬Prompt Engineering Blog

github.blog··Hacker News

Stop Whispering to the Model, Start Furnishing Its Brain

💬Prompt Engineering Blog

dev.to··DEV

Choosing the Right Vector Database for RAG and AI Applications

🗂️Vector Databases Blog

analyticsvidhya.com·

JeevanJoshi2061/titan_engine_core: Constant-memory sequence modeling engine combining selective holographic-compression (ASH-C) with a coordinate pointer network (HEP-DNA). Bypasses the linear KV Cache bottleneck on consumer GPUs.

🌐Open Source Code

github.com··Hacker News

Initial impressions of Claude Fable 5

🐍Python

simonwillison.net··Hacker News

ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.

💬Prompt Engineering Code

github.com··Hacker News

Why I stopped using LLMs to generate code (and what I use instead)

💬Prompt Engineering Blog

dev.to··DEV

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

🧩LLM Integration

local-llm.utop.workers.dev··Hacker News

rag-explained-how-it-works

Beyond Basic RAG (Part 3): Agentic RAG, CRAG, Self-RAG and GraphRAG Explained | M012 | Mehul Ligade

RAG-Based Testing Series — Part 1: What Is RAG & Why Your Old Testing Playbook Won't Work Here

Why LLMs (still) lack taste

manavgup/context-analyzer: Context window usage analyzer for Claude Code — MCP server + interactive dashboard

How LLMs work | Practical Leaders

Quiz: Embeddings and Vector Databases With ChromaDB

Why my first RAG system hallucinated (and how I fixed it)

LLM are universal simulators

Dense Contexts Are Hard Contexts: Lexical Density Limits Effective Context in LLMs

Kimi Code: Next-Gen AI Code Agent for Terminal & IDE

Show HN: Lore – LLM proxy for coding agent context and memory management

Larger context windows and configurable reasoning levels for GitHub Copilot - GitHub Changelog

Stop Whispering to the Model, Start Furnishing Its Brain

Choosing the Right Vector Database for RAG and AI Applications

JeevanJoshi2061/titan_engine_core: Constant-memory sequence modeling engine combining selective holographic-compression (ASH-C) with a coordinate pointer network (HEP-DNA). Bypasses the linear KV Cache bottleneck on consumer GPUs.

Initial impressions of Claude Fable 5

ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.

Why I stopped using LLMs to generate code (and what I use instead)

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU