Optimization

Convex Optimization, Loss Functions, Gradient Methods, Adam Optimizer

Feeds to Scour
SubscribedAll
Scoured 185 posts in 5.4 ms

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

 🧠Deep Learning  Content type: Academic
arxiv.org·

Machine learning from scratch, what to build before using scikit-learn

 🤖Machine Learning  Content type: Tutorial
iwtlp.com··DEV

Backpropagation Without the Magic: A First-Principles Derivation

 🧠Deep Learning  Content type: Blog
medium.com
·

Physics-informed neural networks with caputo-fabrizio derivatives for nonlinear fractal-fractional delay equations and chaotic systems

 🧠Deep Learning  Content type: Academic
nature.com·

ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.

 🤖Machine Learning  Content type: Code
github.com··Hacker News

**PyTorch Stochastic Gradient Optimization Technique**

 🤖Machine Learning
sitepoint.com·

Asynchronous AI cuts computing energy by orders of magnitude while learning continuously

 🧠Deep Learning
techxplore.com·

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

 📐Linear Algebra  Content type: Blog
tridao.me··Hacker News

From SGD to Muon: An Incremental Tutorial (Fable-5)

 📐Linear Algebra  Content type: Blog

Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment

 🧠Deep Learning  Content type: Academic
arxiv.org·

markusheimerl/gpt: A generative pretrained transformer implementation

 🤖Transformers  Content type: Code
github.com··Hacker News

A Theory on Flow Matching with Neural Networks

 🧠Deep Learning  Content type: Academic
arxiv.org·

Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods

 🤖Machine Learning  Content type: Academic
arxiv.org·

Exploring the Design Space of Reward Backpropagation for Flow Matching

 🧠Deep Learning  Content type: Academic
arxiv.org·

Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Quantifying Uncertainty In Wide Two-Layer Neural Networks: On The Law Of The Limiting Fluctuation Process

 🧠Deep Learning  Content type: Academic
arxiv.org·

Overcoming Rank Collapse in Feedback Alignment

 🧠Deep Learning  Content type: Academic
arxiv.org·

Second-Order Path Kernel Interpolation Formulas in Machine Learning

 🧠Deep Learning  Content type: Academic
arxiv.org·

Theory of learning of high-dimensional controlled non-linear dynamical systems (I): models and methods

 🤖Machine Learning  Content type: Academic
arxiv.org·

An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization

 🤖LLMs  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help