🧠 LLMs - nate_dkz · Scour

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🧠LLM Code

github.com··Hacker News

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

🤖Large Language Models

pub.towardsai.net

·

Why LLMs (still) lack taste

beyondtheprior.com··Hacker News

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

🧠LLM News

spectrum.ieee.org

··Hacker News

FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention

🤖AI Academic

arxiv.org··Hacker News

How LLMs work | Practical Leaders

practical-leaders.com··Hacker News

A Plea to the Labs: Let the Models Diagnose.

🧠LLM Blog

tangent.bearblog.dev··Hacker News

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

🤖AI Tools News

newsletter.semianalysis.com

··Hacker News

How we fight GPU scarcity without compromise

🧠LLM Blog

equixly.com··Hacker News

Apple WWDC On-Device AI Deep Dive - Google Docs

gist.is··Hacker News

DiffusionGemma: 4x Faster Text Generation

🤖AI Tools News Blog

blog.google··Hacker News, r/LocalLLaMA, r/singularity

know the mother tongue of your LLMs

mothertoken.inigoimaz.com··Hacker News

What Are Tokens in LLMs?

🧠LLM Blog

bearisland.dev··Hacker News

Token4Token — pay-per-token inference on Gnosis + Swarm

t4t.eth.link··Hacker News

Less-relevant results

Machinic Psychopharmacology: Do LLMs Self-Medicate?

lesswrong.com··Hacker News

A system programmer’s guide to LLM inference

💬Natural Language Processing Blog

blog.xiangpeng.systems··Hacker News

Melanie Mitchell: What We Get Wrong About AI

yalereview.org··Substack, Hacker News, Hacker News

Slack bot for the whole team, not per-seat

🧠LLM Discussion

plugand.ai··Hacker News

LLM Research Papers: The 2026 List (January to May)

🤖AI News

magazine.sebastianraschka.com

··Hacker News

Two Leaps to 1000 Tokens/s on a 1T-Parameter Model: On Inference Systems, Execution Boundaries, and Co-Design

💻Operating Systems Blog

tilert.ai··Hacker News

Log in to enable infinite scrolling