🧠 LLM Inference - emschwartz · Scour

BREAKING🚨: Stanford University just launched a FREE AI tool for researchers!

threadreaderapp.com·3d

Performance Tip of the Week #79: Make at most one tradeoff at a time

abseil.io·4d

⚙️Mechanical Sympathy

Mastering Unstructured data: The Blueprint For Efficient Solution

pub.towardsai.net·3d

🔤Tokenization

NVIDIA VibeTensor: AI Just Built Its Own Deep Learning Engine… And It Actually Works (AI Revolution

youtube.com·4d

Hardware Acceleration

jellyfin.org·4d

⚡Hardware Acceleration

Planning Work for Our Single-Threaded Brains

linkedin.com·4d

userface.ai·4d

How StrongDM’s AI team build serious software without even looking at the code

simonw.substack.com·4d·

Discuss: Substack

🏗️LLM Infrastructure

6 AI Agents, One Company

voxyz.space·3d

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

news.ycombinator.com·4d·

Discuss: Hacker News

LOTFormer: Doubly-Stochastic Linear Attention via Low-Rank Optimal Transport

arxiv.org·2d

🕸️Sparse Vectors

Decomposing Reasoning Efficiency in Large Language Models

arxiv.org·1d

🧮SMT Solvers

Making a Hardware Accelerated Live TV Player from Scratch in C: HLS Streaming, MPEG-TS Demuxing, H.264 Parsing, and Vulkan Video Decoding

blog.jaysmito.dev·3d·

Discuss: Hacker News, r/programming

📄File Formats

Hallucinations in GPT5 – Can models say "I don't know" (June 2025)

jobswithgpt.com·4d·

Discuss: Hacker News

Beyond agentic coding

haskellforall.com·4d·

Discuss: Lobsters, Hacker News, Hacker News, r/programming

👨‍💻AI Coding

For real game-theoretic reasoning, we need best response in imperfect information games

weyxie.bearblog.dev·3d·

Discuss: Hacker News

🛡️AI Security

We recreated the Anthropic C compiler agent

vizops.ai·3d·

Discuss: Hacker News

⚙️Language Runtimes

Heterogeneous Processing: A Strategy for Augmenting Moore's Law (2006)

linuxjournal.com·4d·

Discuss: Hacker News

🖥️Hardware Architecture

Generative Modeling via Drifting

lambertae.github.io·6d·

Discuss: Hacker News

📦Batch Embeddings

EBM vs. LLMs: Our Kona EBM a 96% vs. 2% Sudoku Benchmark

logicalintelligence.com·6d·

Discuss: Hacker News

🏆LLM Benchmarking

Loading more...