🤖 AI - widget101 · Scour

SLUUG Talk: Demystifying Large Language Models on Linux

🎮Reinforcement Learning Code

github.com··DEV

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

👁️Observability

zozo123.github.io··Hacker News

Building a Multilayer Perceptron from Scratch: What It Taught Me About Neural Networks

📋Tokei Blog

Apple WWDC On-Device AI Deep Dive - Google Docs

🧠Memory Management

gist.is··Hacker News

Breaking the Ice: Analyzing Cold Start Latency in vLLM

📈Performance Profiling Academic

arxiv.org··Hacker News

Machine learning from scratch, what to build before using scikit-learn

🐍Scientific Python Tutorial

iwtlp.com··DEV

PyTorch from Scratch — Part 1: Tensors, Gradients & Activations

🦀Rust Scientific

Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.

📐Vector Embeddings

pub.towardsai.net

·

The Hardware That Makes AI Possible

🏗️Hardware Architecture

towardsdatascience.com·

DiffusionGemma: 4x Faster Text Generation

📈Time Series News Blog

blog.google··Hacker News, r/LocalLLaMA, r/singularity

Why LLMs (still) lack taste

🎮Reinforcement Learning

beyondtheprior.com··Hacker News

Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation

aermia.com··Hacker News

Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

🔄CI/CD Blog

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

💾Cache Optimization News

spectrum.ieee.org

··Hacker News

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

🐙GitHub News

newsletter.semianalysis.com

··Hacker News

How to Become an AWS AI Architect,The Honest Roadmap, the Projects, and Landing the Job

☁️AWS Infrastructure

hackernoon.com·

Best explanations of how LLMs work

🎮Reinforcement Learning Blog

vorushin.github.io··Hacker News

Siri AI at WWDC 2026

simonwillison.net··Hacker News

Token4Token — pay-per-token inference on Gnosis + Swarm

📊Columnar Engines

t4t.eth.link··Hacker News

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🏗️Hardware Architecture Code

github.com··Hacker News

Log in to enable infinite scrolling