Optimization

Gradient Descent, Convex Optimization, Stochastic Methods, Loss Functions

Feeds to Scour
SubscribedAll
Scoured 246 posts in 6.3 ms

An adaptive framework for the axisymmetric pulsar magnetosphere using physics-informed Kolmogorov-Arnold networks

 Automatic Differentiation  Content type: Academic
arxiv.org·

Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment

 🧠Neural Networks  Content type: Academic
arxiv.org·

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

 🤖AI  Content type: Code
github.com··Hacker News

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

 🧠Deep Learning  Content type: Academic
arxiv.org·

Structured Adaptive Tensor Prediction for Streaming Data

 🎯Optimization Theory  Content type: Academic
arxiv.org·

Second-Order Path Kernel Interpolation Formulas in Machine Learning

 🎯Optimization Theory  Content type: Academic
arxiv.org·

Overcoming Rank Collapse in Feedback Alignment

 🤖AI  Content type: Academic
arxiv.org·

Flatland: The Adventures of Gradient Descent with Large Step Sizes

 🧠Neural Networks  Content type: Academic
arxiv.org·

Learning Doubly Sparse Explicitly Conditioned Transforms

 📐Linear Algebra  Content type: Academic
arxiv.org·

Noise-Adaptive High-Probability Regret Bounds for Online Convex Optimization

 🎯Optimization Theory  Content type: Academic
arxiv.org·

Variational Proximal Policy Optimization

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

Predictable Scaling Laws of Optimal Hyperparameters for LLM Continued Pre-training

 🗣️Large Language Models  Content type: Academic
arxiv.org·

A Global Convergence Analysis of Consensus ALADIN for Convex Optimization

 🎯Optimization Theory  Content type: Academic
arxiv.org·

Trading Utility for Dynamic Fairness in Multiple Resource Division with Sequential Demand

 🎯Decision Theory  Content type: Academic
arxiv.org·

An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization

 🎯Optimization Theory  Content type: Academic
arxiv.org·

Fourier fractal dimension to predict the generalization of deep neural networks

 🧠Deep Learning  Content type: Academic
arxiv.org·

Conservation Laws from Data Symmetry in Neural Networks

 🌀Hamiltonian Mechanics  Content type: Academic
arxiv.org·

Projected Inverse Iteration: An Eigenvalue Approach to Ground-State Computation with Neural Quantum States

 🧠Neural Networks  Content type: Academic
arxiv.org·

Pseudospectral Bounds for Transient Amplification in Coupled Gradient Descent

 🎯Optimization Theory  Content type: Academic
arxiv.org·

Scaling Decision-Focused Learning to Large Problems with Lagrangian Decomposition

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help