Optimization

Gradient Descent, Convex Optimization, Stochastic Methods, Loss Functions

Feeds to Scour
SubscribedAll
Scoured 155 posts in 9.7 ms

Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization

 🎯Optimization Theory  Content type: Academic
arxiv.org·

Sakana AI's Recursive Self-Improvement (RSI) Lab

 🤖AI
sakana.ai··Hacker News

Forgis-Labs/HEPA: HEPA: Self-supervised horizon-conditioned event predictive architecture for time series. Spotlight at FMSD @ ICML 2026.

 🧠Deep Learning  Content type: Code
github.com··Hacker News

LLM are universal simulators

 🗣️Large Language Models

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

 🎯Optimization Theory

Aligning Superintelligent Humans

 🔥PyTorch
lesswrong.com·

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

 📐Linear Algebra  Content type: Blog
tridao.me··Hacker News

Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent

 🤖AI  Content type: News

The sample efficiency black hole

 🗣️Large Language Models  Content type: News
dwarkesh.com··Hacker News

I open-sourced my UFC prediction model, code, and database after 5 years of work

 🤖AI
mcinerney.ai··Hacker News

Why Compiler Engineers Rarely Use Strassen's Algorithm for Fast Matrix Multiplications

 💻Computer Science  Content type: News  Content type: Blog

From Jupyter Notebook to production: How to ship AI systems that actually work

 🐍Python
Less-relevant results

The Untrainable

 🎯Optimization Theory  Content type: News  Content type: Blog

Vibe Diaries: Training Nanochat

 🤖AI

The Loop That Improves Almost Anything

 Automatic Differentiation  Content type: Blog

A one-parameter model that gets 100% on ARC-AGI-2

 🧠Neural Networks

Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters

 🎯Optimization Theory  Content type: Academic
arxiv.org·

Unpacking AI: The Hardware Behind AI

 🤖AI  Content type: News

Optimal Seating on the Airbus A380

 🐍Python  Content type: Blog

ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.

 📈Statistical Learning  Content type: Code
github.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help