📈 Optimization - jhcha.oyo · Scour

Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization

🎲Probability Academic

Machine learning from scratch, what to build before using scikit-learn

🧠Neural Networks Tutorial

iwtlp.com··DEV

Backpropagation Without the Magic: A First-Principles Derivation

🤖AI Blog

·

ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.

🤖AI Code

github.com··Hacker News

PyTorch Stochastic Gradient Optimization Technique

sitepoint.com·

From SGD to Muon: An Incremental Tutorial (Fable-5)

🧠Neural Networks Blog

sankalp.bearblog.dev·

Gradient-informed Hamiltonian Monte Carlo for multicomponent CALPHAD model optimization and uncertainty quantification

⚛️Physics Academic

sciencedirect.com·

A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training

⚡Transformers Academic

A Theory on Flow Matching with Neural Networks

🤖AI Academic

Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters

🤖Machine Learning Academic

markusheimerl/gpt: A generative pretrained transformer implementation

⚡Transformers Code

github.com··Hacker News

Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment

🧠Deep Learning Academic

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

🧠Deep Learning Academic

Exploring the Design Space of Reward Backpropagation for Flow Matching

🤖AI Academic

Flatland: The Adventures of Gradient Descent with Large Step Sizes

🧠Deep Learning Academic

Overcoming Rank Collapse in Feedback Alignment

🤖AI Academic

Second-Order Path Kernel Interpolation Formulas in Machine Learning

🤖Machine Learning Academic

Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader

🎯RLHF Academic

Predictive Coding with Bayesian Priors via Proximal Gradients

🎲Probability Academic

An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization

🤖Machine Learning Academic

Log in to enable infinite scrolling