🧠 Machine Learning - yfff · Scour

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

📐Linear Algebra Blog

tridao.me··Hacker News

Exploring the Design Space of Reward Backpropagation for Flow Matching

🤖AI Academic

Less-relevant results

Agentic RL: Token-In, Token-Out Done Right

🎮Reinforcement Learning

qgallouedec-tito.hf.space··Hacker News

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

📐Optimization Theory Academic

Designing Loops That Prompt Coding Agents: The Six I Actually Run

✍️Prompt Engineering

cameronwestland.com··Hacker News

Overcoming Rank Collapse in Feedback Alignment

🤖AI Academic

See, Act, Correct: three levers for working with a code agent

🎮Reinforcement Learning Blog

blog.owulveryck.info··Hacker News, Hacker News

Growing Pains of Starting a Secret Society

📐Optimization Theory Blog

mrmarket.bearblog.dev··Hacker News

Flatland: The Adventures of Gradient Descent with Large Step Sizes

📐Optimization Theory Academic

Second-Order Path Kernel Interpolation Formulas in Machine Learning

📐Optimization Theory Academic

Stein Kernelized Molecular Dynamics for Active Learning of Interatomic Potentials

📐Optimization Theory Academic

Fourier fractal dimension to predict the generalization of deep neural networks

📐Optimization Theory Academic

Structured Adaptive Tensor Prediction for Streaming Data

📶Communications Academic

A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training

🤖Transformers Academic

Phantom transitions in language model fine-tuning

💬LLMs Academic

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

🤖Transformers Code

github.com··Hacker News

Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters

📐Optimization Theory Academic

Pseudospectral Bounds for Transient Amplification in Coupled Gradient Descent

📐Optimization Theory Academic

Reinforcement Learning for Flow-Matching Policies with Density Transport

🤖AI Academic

An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization

📐Optimization Theory Academic

Log in to enable infinite scrolling