Optimization

Convex Optimization, Loss Functions, Gradient Methods, Adam Optimizer

Feeds to Scour
SubscribedAll
Scoured 122 posts in 12.2 ms

Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization

 🎲Probability  Content type: Academic
arxiv.org·

Machine learning from scratch, what to build before using scikit-learn

 🧠Neural Networks  Content type: Tutorial
iwtlp.com··DEV

Backpropagation Without the Magic: A First-Principles Derivation

 🤖AI  Content type: Blog
medium.com
·

ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.

 🤖AI  Content type: Code
github.com··Hacker News

**PyTorch Stochastic Gradient Optimization Technique**

 🤖AI
sitepoint.com·

From SGD to Muon: An Incremental Tutorial (Fable-5)

 🧠Neural Networks  Content type: Blog

Gradient-informed Hamiltonian Monte Carlo for multicomponent CALPHAD model optimization and uncertainty quantification

 ⚛️Physics  Content type: Academic
sciencedirect.com·

A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training

 Transformers  Content type: Academic
arxiv.org·

A Theory on Flow Matching with Neural Networks

 🤖AI  Content type: Academic
arxiv.org·

Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters

 🤖Machine Learning  Content type: Academic
arxiv.org·

markusheimerl/gpt: A generative pretrained transformer implementation

 Transformers  Content type: Code
github.com··Hacker News

Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment

 🧠Deep Learning  Content type: Academic
arxiv.org·

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

 🧠Deep Learning  Content type: Academic
arxiv.org·

Exploring the Design Space of Reward Backpropagation for Flow Matching

 🤖AI  Content type: Academic
arxiv.org·

Flatland: The Adventures of Gradient Descent with Large Step Sizes

 🧠Deep Learning  Content type: Academic
arxiv.org·

Overcoming Rank Collapse in Feedback Alignment

 🤖AI  Content type: Academic
arxiv.org·

Second-Order Path Kernel Interpolation Formulas in Machine Learning

 🤖Machine Learning  Content type: Academic
arxiv.org·

Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader

 🎯RLHF  Content type: Academic
arxiv.org·

Predictive Coding with Bayesian Priors via Proximal Gradients

 🎲Probability  Content type: Academic
arxiv.org·

An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization

 🤖Machine Learning  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help