Optimization

Feeds to Scour
SubscribedAll
Scoured 40 posts in 6.9 ms

Variational Proximal Policy Optimization

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.

 🤖Machine Learning  Content type: Code
github.com··Hacker News

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

 ⚡CUDA  Content type: Blog
tridao.me··Hacker News

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

 🧠Neural Networks
aarushgupta.io··Lobsters, Hacker News

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

 🔬Deep Learning  Content type: Academic
arxiv.org·

Finding Optimal Tokenizers

 🔤Tokenization  Content type: Blog
blog.aqnichol.com··Hacker News

Capacity-Constrained Online Convex Optimization with Delayed Feedback

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·
Less-relevant results

The Untrainable

 🎯Fine-tuning  Content type: News  Content type: Blog

Second-Order Path Kernel Interpolation Formulas in Machine Learning

 🤖Machine Learning  Content type: Academic
arxiv.org·

Designing Loops That Prompt Coding Agents: The Six I Actually Run

 📞Function Calling
cameronwestland.com··Hacker News

Simplicity Suffices for Parameter Noise Injection in Stochastic Gradient Descent

 🤖Machine Learning  Content type: Academic
arxiv.org·

Growing Pains of Starting a Secret Society

 🌱Digital Gardens  Content type: Blog

Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Last-Iterate Convergence of Optimistic Multiplicative Weight Update

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Mirror Descent Beyond Euclidean Stability: An Exponential Separation in Initialization Sensitivity

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Flatland: The Adventures of Gradient Descent with Large Step Sizes

 🤖Machine Learning  Content type: Academic
arxiv.org·

Fixed-Parameter Tractability of Private Synthetic Data Generation

 🧠LLM Inference  Content type: Academic
arxiv.org·

Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network

 🤖Machine Learning  Content type: Academic
arxiv.org·

Noise-Adaptive High-Probability Regret Bounds for Online Convex Optimization

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

LieIPM: Lie Group Interior Point Method for Direct Trajectory Optimization of Rigid Bodies

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help