Gradient Descent

Feeds to Scour
SubscribedAll
Scoured 106 posts in 6.2 ms

Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization

 🎪Convex Optimization  Content type: Academic
arxiv.org·

Backpropagation Without the Magic: A First-Principles Derivation

 📊Empirical Bayes  Content type: Blog
medium.com
·

Machine learning from scratch, what to build before using scikit-learn

 📈Linear Models  Content type: Tutorial
iwtlp.com··DEV

From SGD to Muon: An Incremental Tutorial (Fable-5)

 📐Matrix Factorization  Content type: Blog

**PyTorch Stochastic Gradient Optimization Technique**

 📈Linear Models
sitepoint.com·

Physics-informed neural networks with caputo-fabrizio derivatives for nonlinear fractal-fractional delay equations and chaotic systems

 📉ODE  Content type: Academic
nature.com·

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

 🎪Convex Optimization  Content type: News
spectrum.ieee.org
··Hacker News

The Smallest Brain You Can Build: A Perceptron in Python

 📊Empirical Bayes  Content type: Blog
ranpara.net··Hacker News

Welcome to Machine Learning With Manya: The Ultimate Adventure Map!

 📈Linear Models  Content type: Blog
medium.com·

markusheimerl/gpt: A generative pretrained transformer implementation

 🔢Embeddings  Content type: Code
github.com··Hacker News

A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training

 📊Empirical Bayes  Content type: Academic
arxiv.org·

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

 📐Matrix Factorization  Content type: Blog
tridao.me··Hacker News

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

 🗺️UMAP

I Let an AI Agent Run 40 Experiments While I Slept

 📦Pseudobulk  Content type: Blog
oreilly.com·
Less-relevant results

The Untrainable

 📊Empirical Bayes  Content type: News  Content type: Blog

Vibe Diaries: Training Nanochat

 🔲Zarr
vibediary.dev··Hacker News

Human-Like Neural Nets by Catapulting

 🔢Embeddings
gwern.net··Hacker News

Exploring the Design Space of Reward Backpropagation for Flow Matching

 🗺️Manifold Learning  Content type: Academic
arxiv.org·

Asynchronous AI cuts computing energy by orders of magnitude while learning continuously

 🔢Embeddings
techxplore.com·

Learning Fuzzy Logic: Automatic Rule Discovery Through Differentiable Circuits

 🎪Convex Optimization
metafunctor.com··DEV

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help