📈 Optimization - jyunzhang · Scour

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

🧠Deep Learning Academic

Machine learning from scratch, what to build before using scikit-learn

🤖Machine Learning Tutorial

iwtlp.com··DEV

Backpropagation Without the Magic: A First-Principles Derivation

🧠Deep Learning Blog

·

Physics-informed neural networks with caputo-fabrizio derivatives for nonlinear fractal-fractional delay equations and chaotic systems

🧠Deep Learning Academic

ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.

🤖Machine Learning Code

github.com··Hacker News

PyTorch Stochastic Gradient Optimization Technique

🤖Machine Learning

sitepoint.com·

Asynchronous AI cuts computing energy by orders of magnitude while learning continuously

🧠Deep Learning

techxplore.com·

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

📐Linear Algebra Blog

tridao.me··Hacker News

From SGD to Muon: An Incremental Tutorial (Fable-5)

📐Linear Algebra Blog

sankalp.bearblog.dev·

Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment

🧠Deep Learning Academic

markusheimerl/gpt: A generative pretrained transformer implementation

🤖Transformers Code

github.com··Hacker News

A Theory on Flow Matching with Neural Networks

🧠Deep Learning Academic

Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods

🤖Machine Learning Academic

Exploring the Design Space of Reward Backpropagation for Flow Matching

🧠Deep Learning Academic

Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization

🎮Reinforcement Learning Academic

Quantifying Uncertainty In Wide Two-Layer Neural Networks: On The Law Of The Limiting Fluctuation Process

🧠Deep Learning Academic

Overcoming Rank Collapse in Feedback Alignment

🧠Deep Learning Academic

Second-Order Path Kernel Interpolation Formulas in Machine Learning

🧠Deep Learning Academic

Theory of learning of high-dimensional controlled non-linear dynamical systems (I): models and methods

🤖Machine Learning Academic

An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization

🤖LLMs Academic

Log in to enable infinite scrolling