Optimization

Convex Optimization, Loss Functions, Gradient Methods, Adam Optimizer

Feeds to Scour
SubscribedAll
Scoured 87 posts in 9.3 ms

When Do Fewer Coordinates Suffice in DP-SGD?

 🤖Machine Learning  Content type: Academic
arxiv.org·

A Global Convergence Analysis of Consensus ALADIN for Convex Optimization

 ⚛️Physics  Content type: Academic
arxiv.org·

Near-Optimal Decentralized Stochastic Convex Optimization over Networks

 🕸️Network Effects  Content type: Academic
arxiv.org·

Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods

 🧠Deep Learning  Content type: Academic
arxiv.org·

DP-MacAdam: Differentially Private Mechanism with Adaptive Clipping and Adaptive Momentum

 🤖Machine Learning  Content type: Academic
arxiv.org·

Hybridizing Equilibrium Propagation with Ising Machines for Efficient Energy-Based Learning

 🤖AI  Content type: Academic
arxiv.org·

Gradient Descent with Large Step Size Restores Symmetry in Deep Linear Networks with Multi-Pathway

 🤖Machine Learning  Content type: Academic
arxiv.org·

Statistical and Numerical Convergence in Stochastic Equilibrium

 📊Econometrics  Content type: Academic
arxiv.org·

Reinforcement Learning for Flow-Matching Policies with Density Transport

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Fog of Love: Engineering Virtuous Agent Behavior with Affinity-based Reinforcement Learning in a Game Environment

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Beyond Linear and Overcomplete Regimes: A Mean-Field Analysis of Bottleneck Autoencoders

 🤖Machine Learning  Content type: Academic
arxiv.org·

A Study of Parallel Continuous Local Search

 🧮Complexity Theory  Content type: Academic
arxiv.org·

Biweighted Poisson Subsampling for Convoluted Rank Regression with Massive Data

 📊Statistics  Content type: Academic
arxiv.org·

Prediction Under Imperfect Compression: A Theory of Approximate MDL

 Quantization  Content type: Academic
arxiv.org·

Closed-Form Spectral Regularization for Multi-Task Model Merging

 🧠Neural Networks  Content type: Academic
arxiv.org·

PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training

 🧠Neural Networks  Content type: Academic
arxiv.org·

Optimizing Energy-based Neural Network Training with Coherent Ising Machine

 🤖AI  Content type: Academic
arxiv.org·

Trace-Mediated Peak Bias: Bridging Temporal Credit Assignment and Cognitive Heuristics in Deep Reinforcement Learning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Variational Proximal Policy Optimization

 🎯RLHF  Content type: Academic
arxiv.org·

Theory of learning of high-dimensional controlled non-linear dynamical systems (I): models and methods

 🧠Neural Networks  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help