Optimization Theory

Feeds to Scour
SubscribedAll
Scoured 63 posts in 12.5 ms

Flatland: The Adventures of Gradient Descent with Large Step Sizes

馃Machine LearningContent type: Academic
arxiv.org

Revisiting Privacy Amplification by Subsampling in Selective Release DPSGD

馃Machine LearningContent type: Academic
arxiv.org

Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods

馃Machine LearningContent type: Academic
arxiv.org

Gridless Full-Space DOA Estimation for STAR-RIS-Assisted Wireless Systems

馃摱CommunicationsContent type: Academic
arxiv.org

A prism hierarchy of learning regimes in large linear autoencoders

馃Machine LearningContent type: Academic
arxiv.org

Noise-Adaptive High-Probability Regret Bounds for Online Convex Optimization

馃幉Stochastic ProcessesContent type: Academic
arxiv.org

Large-scale empirical tuning and comparison of default optimizers for variational inference

馃敟PyTorchContent type: Academic
arxiv.org

Lagrange multipliers in Maximum likelihood estimations and Least squares problems with Constraints

馃搳StatisticsContent type: Academic
arxiv.org

When Do Fewer Coordinates Suffice in DP-SGD?

馃Machine LearningContent type: Academic
arxiv.org

Adaptive directional gradients for parameterised quantum circuits

馃Machine LearningContent type: Academic
arxiv.org

ANCHOR: Autoregressive Non-intrusive Chunk-Ordered Refinement for Joint Multi-Resolution Speech Quality Modeling

馃摗Signal ProcessingContent type: Academic
arxiv.org

Thresholded Local Hyper-Flow Diffusion

馃搲Loss LandscapesContent type: Academic
arxiv.org

OptMuon: Closed-Loop Orthogonalized Momentum Methods for Stochastic Optimization with Zero-Noise Optimality

馃搻Semidefinite ProgrammingContent type: Academic
arxiv.org

Trace-Mediated Peak Bias: Bridging Temporal Credit Assignment and Cognitive Heuristics in Deep Reinforcement Learning

馃幃Reinforcement LearningContent type: Academic
arxiv.org

The Spectral Dynamics and Noise Geometry of Muon

馃Machine LearningContent type: Academic
arxiv.org

Improved Convergence Analysis of Topology Dependence in Decentralized SGD

馃搲Loss LandscapesContent type: Academic
arxiv.org

Variational Proximal Policy Optimization

馃幃Reinforcement LearningContent type: Academic
arxiv.org

Low-Rank Decay for Grokking in Scale-Invariant Transformers: A Spectral-Geometric View

馃TransformersContent type: Academic
arxiv.org

On the conditional equivalence of phase retrieval algorithms

馃Machine LearningContent type: Academic
arxiv.org

Learning Dynamics Reveal a Hierarchy of Weight-Induced Layerwise Gram Metrics

馃Machine LearningContent type: Academic
arxiv.org

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help