🧠 Large Language Models (LLMs) - pleto · Scour

Markov Chains: The Grandparents of LLMs

✨Model optimizations in LLMs

dmanco.dev··Hacker News

Show HN: In-browser real LLM token counter and cost estimation

💬Prompt optimizations for LLM serving

holaclaw.ai··Hacker News

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

🤖Agents using LLMs Discussion

news.ycombinator.com··Hacker News

LLM are universal simulators

✨Model optimizations in LLMs

invertedpassion.com··Hacker News

Generative AI in the Real World: Agentic Systems Fundamentals with Maarten Grootendorst

🔍Retrieval-augmented generation Audio

Context compression finally works in production: new research cuts LLM input 16x without the accuracy hit

🔍Retrieval-augmented generation

venturebeat.com·

Every LLM Tool Call Needs an Output Budget

🤖Agents using LLMs Blog

axamy.com··Hacker News

Google open-sources speedy DiffusionGemma text diffusion model

🔍Retrieval-augmented generation

siliconangle.com·

local llm on laptop 780M GPU using llama + gemma 4 qat

🔢Quantization of LLMs Blog

alper.bearblog.dev·

Google's new open-weights model brings image-generation tricks to AI text generation

📊AI Performance Profiling News

theregister.com·

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

🔢Quantization of LLMs Blog

adambien.blog·

New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"

🤖Agents using LLMs Discussion

news.ycombinator.com··Hacker News

How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?

💬Prompt optimizations for LLM serving Blog

AI context windows: Why context quality beats context size

🔍Retrieval-augmented generation Blog

If LLMs are all persona, whose persona are they?

✨Model optimizations in LLMs

persona.earthpilot.ai··Hacker News

Report: GKE Inference Gateway delivers up to 92% faster AI responses

🔧Systems-level optimizations for LLM serving Blog

cloud.google.com··Hacker News

Don't let the LLM speak, just probe it (8 minute read)

💬Prompt optimizations for LLM serving Blog

blog.j11y.io··Hacker News

langchain-ai/langchain langchain-core==1.4.6

🔍Retrieval-augmented generation Code

·

Tokenminning: Because Tokenmaxxing Is a Bad Idea

💬Prompt optimizations for LLM serving

tokenminning.com··Hacker News

Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering

✨Model optimizations in LLMs Academic

Log in to enable infinite scrolling