🤖 AI - jimman · Scour

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

⚡Model Efficiency Code

github.com··Hacker News, r/LLM

Making a Vintage LLM from Scratch

⚡Model Efficiency

crlf.link··Hacker News

Why LLMs (still) lack taste

⚡LLM Optimization

beyondtheprior.com··Hacker News

Orchestrate your LLM pipeline. Locally

⚡LLM Optimization

llmforge.app··Hacker News

Agentic Frameworks

✍️Prompt Engineering News Blog

astledsa.substack.com··Substack

Dynamic ReACT Loop with Conductor

conductor-oss.github.io··Hacker News

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

aermia.com··Hacker News

Apple WWDC On-Device AI Deep Dive - Google Docs

⚡Model Efficiency

gist.is··Hacker News

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

✍️Prompt Engineering

pub.towardsai.net

·

Profiling in PyTorch (Part 2): From Nn.Linear to a Fused MLP

⚡Model Efficiency Blog

huggingface.co··Hacker News

Siri AI at WWDC 2026

✍️Prompt Engineering

simonwillison.net··Hacker News

LLM Research Papers: The 2026 List (January to May)

⚡LLM Optimization News

magazine.sebastianraschka.com

··Hacker News

Researchers say they trained a foundation model from scratch for about $1,500

✍️Prompt Engineering

venturebeat.com··Hacker News

DiffusionGemma: 4x Faster Text Generation

⚡Model Efficiency News Blog

blog.google··Hacker News, r/LocalLLaMA, r/singularity

Alleged Fable sabotage of an ML project

✍️Prompt Engineering

xcancel.com··Hacker News

Mbodi AI (YC P25) Is Hiring Founding Machine Learning Engineer (Robotics)

✍️Prompt Engineering

ycombinator.com··Hacker News

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

⚡LLM Optimization News

newsletter.semianalysis.com

··Hacker News

Anthropic's Fable 5 Silent Sabotage Mode

✍️Prompt Engineering

everettdutton.com··Hacker News

Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM

⚡LLM Optimization

deemwar-products.github.io··Hacker News

It blocked us at 'hello!' Anthropic Fable 5 refusing innocuous prompts

✍️Prompt Engineering News

theregister.com··Hacker News

Log in to enable infinite scrolling