Optimization

Gradient Descent, Convex Optimization, Stochastic Methods, Loss Functions

Feeds to Scour
SubscribedAll
Scoured 97 posts in 8.6 ms

iblameandrew/open-deepthink: Grok-heavy at the price of API cost. You choose the model. An unlimited army to think about your problem.

 💬Prompt Engineering  Content type: Code
github.com··r/LocalLLaMA

Fourier fractal dimension to predict the generalization of deep neural networks

 🧮Embeddings  Content type: Academic
arxiv.org·

Flatland: The Adventures of Gradient Descent with Large Step Sizes

 🧮Embeddings  Content type: Academic
arxiv.org·

Predictive Coding with Bayesian Priors via Proximal Gradients

 🧮Embeddings  Content type: Academic
arxiv.org·

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

 📞Function Calling  Content type: Academic
arxiv.org·

Forward-Only Convolutional Neural Networks with Learnable Channel-Class Assignment

 👁️Computer Vision  Content type: Academic
arxiv.org·

Efficient Time Series Clustering from Multiscale Reservoir Dynamics with Granular-Ball Anchoring Graph Optimization

 🗄️Vector Databases  Content type: Academic
arxiv.org·

Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network

 🧮Embeddings  Content type: Academic
arxiv.org·

Exploring the Design Space of Reward Backpropagation for Flow Matching

 💬Prompt Engineering  Content type: Academic
arxiv.org·

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

 🤖Transformers  Content type: Code
github.com··Hacker News

Joint Movable Antenna Positioning and RIS Partitioning for Sum-Rate Maximization

 🏗️Systems Design  Content type: Academic
arxiv.org·

Overcoming Rank Collapse in Feedback Alignment

 🛡️Error Handling  Content type: Academic
arxiv.org·

DP-MacAdam: Differentially Private Mechanism with Adaptive Clipping and Adaptive Momentum

 🛡️AI Security  Content type: Academic
arxiv.org·

Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods

 💬Prompt Engineering  Content type: Academic
arxiv.org·

A Theory on Flow Matching with Neural Networks

 👁️Computer Vision  Content type: Academic
arxiv.org·

Gradient Descent with Large Step Size Restores Symmetry in Deep Linear Networks with Multi-Pathway

 🧮Embeddings  Content type: Academic
arxiv.org·

Structured Adaptive Tensor Prediction for Streaming Data

 🧮Embeddings  Content type: Academic
arxiv.org·

Reinforcement Learning for Flow-Matching Policies with Density Transport

 💬Prompt Engineering  Content type: Academic
arxiv.org·

Variational Proximal Policy Optimization

 🎭Anthropic Claude  Content type: Academic
arxiv.org·

Beyond Linear and Overcomplete Regimes: A Mean-Field Analysis of Bottleneck Autoencoders

 🧮Embeddings  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help