Optimization Theory

Feeds to Scour
SubscribedAll
Scoured 63 posts in 16.3 ms

Predictive Coding with Bayesian Priors via Proximal Gradients

 📉Proximal Gradient  Content type: Academic
arxiv.org·

Meltedd/scarecrow: An adversarial frame pattern optimizer for evading automated license plate recognition, personalized to your plate.

 🧠Machine Learning  Content type: Code
github.com··Hacker News

The Untrainable

 🧠Machine Learning  Content type: News  Content type: Blog

Ultrafast machine learning on FPGAs via Kolmogorov-Arnold Networks

 🧠Machine Learning

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

 📐Linear Algebra  Content type: Blog
tridao.me··Hacker News

A Theory on Flow Matching with Neural Networks

 🧠Machine Learning  Content type: Academic
arxiv.org·

Designing Loops That Prompt Coding Agents: The Six I Actually Run

 ✍️Prompt Engineering

An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization

 🧠Machine Learning  Content type: Academic
arxiv.org·

Growing Pains of Starting a Secret Society

 🧠Machine Learning  Content type: Blog

Duality for Optimal Multi-Item, Multi-Bidder Auction Design: Revenue Certificates through Deep Learning

 🧠Machine Learning  Content type: Academic
arxiv.org·

Pseudospectral Bounds for Transient Amplification in Coupled Gradient Descent

 🧠Machine Learning  Content type: Academic
arxiv.org·

PL-KKT-hPINN: Enforcing Nonlinear Equality Constraints on Neural Networks via Piecewise-Linear Projection

 🧠LLM  Content type: Academic
arxiv.org·

ml-from-scratch-book/code: Companion code for Machine Learning From Scratch — 10 core ML algorithms built from scratch with NumPy, compared with Scikit-learn and PyTorch.

 🧠Machine Learning  Content type: Code
github.com··Hacker News

A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training

 🤖Transformers  Content type: Academic
arxiv.org·

Near-Optimal Decentralized Stochastic Convex Optimization over Networks

 📐Semidefinite Programming  Content type: Academic
arxiv.org·

Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks

 🧠Machine Learning  Content type: Academic
arxiv.org·

Second-Order Path Kernel Interpolation Formulas in Machine Learning

 🧠Machine Learning  Content type: Academic
arxiv.org·

When Both Layers Learn: Training Dynamics of Representing Linear Models via ReLU Networks

 🧠Machine Learning  Content type: Academic
arxiv.org·

Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters

 🧠Machine Learning  Content type: Academic
arxiv.org·

Structured Adaptive Tensor Prediction for Streaming Data

 📶Communications  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help