Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Machine Learning
馃 Machine Learning
Neural Networks, Training, Models, Deep Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
78
posts in
12.5
ms
See, Act, Correct: three levers for working with a code agent
聽
馃幃
Reinforcement Learning
聽
Content type:
Blog
blog.owulveryck.info
路
6d
6 days ago
路
Hacker News
,
Hacker News
Actions for See, Act, Correct: three levers for working with a code agent
Overcoming Rank Collapse in Feedback Alignment
聽
馃
AI
聽
Content type:
Academic
arxiv.org
路
19h
19 hours ago
Actions for Overcoming Rank Collapse in Feedback Alignment
Gram
Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon
聽
馃搻
Linear Algebra
聽
Content type:
Blog
tridao.me
路
1d
1 day ago
路
Hacker News
Actions for Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon
Stein Kernelized Molecular Dynamics for Active
Learning
of Interatomic Potentials
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Stein Kernelized Molecular Dynamics for Active Learning of Interatomic Potentials
Agentic RL: Token-In, Token-Out Done Right
聽
馃幃
Reinforcement Learning
qgallouedec-tito.hf.space
路
1d
1 day ago
路
Hacker News
Actions for Agentic RL: Token-In, Token-Out Done Right
Designing
Loops
That Prompt Coding Agents: The Six I Actually Run
聽
鉁嶏笍
Prompt Engineering
cameronwestland.com
路
1d
1 day ago
路
Hacker News
Actions for Designing Loops That Prompt Coding Agents: The Six I Actually Run
Generalization in
Deep
Neural
Networks
: Minimax Rates for Gradient Methods
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods
Pseudospectral Bounds for Transient Amplification in Coupled
Gradient
Descent
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Pseudospectral Bounds for Transient Amplification in Coupled Gradient Descent
Growing Pains of Starting a Secret Society
聽
馃搻
Optimization Theory
聽
Content type:
Blog
mrmarket.bearblog.dev
路
1d
1 day ago
路
Hacker News
Actions for Growing Pains of Starting a Secret Society
Optimal Rates for Generalization of
Gradient
Descent
Methods with
Deep
Neural Networks
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Optimal Rates for Generalization of Gradient Descent Methods with Deep Neural Networks
An Ensembled Latent Factor
Model
via Differential Evolution and
Gradient
Descent
Optimization
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization
Flatland: The Adventures of
Gradient
Descent
with Large Step Sizes
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Flatland: The Adventures of Gradient Descent with Large Step Sizes
Structured Adaptive Tensor Prediction for Streaming Data
聽
馃摱
Communications
聽
Content type:
Academic
arxiv.org
路
19h
19 hours ago
Actions for Structured Adaptive Tensor Prediction for Streaming Data
A prism hierarchy of
learning
regimes in large linear autoencoders
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
5d
5 days ago
Actions for A prism hierarchy of learning regimes in large linear autoencoders
Second-Order Path Kernel Interpolation Formulas in
Machine
Learning
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Second-Order Path Kernel Interpolation Formulas in Machine Learning
mingusb/transformer-golf
: The Fully Unrolled
Transformer
: An experimental repository for
architecture
simplification and compilation. [2026]
聽
馃
Transformers
聽
Content type:
Code
github.com
路
5d
5 days ago
路
Hacker News
Actions for mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]
Fourier fractal dimension to predict the generalization of
deep
neural
networks
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Fourier fractal dimension to predict the generalization of deep neural networks
A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy
Training
聽
馃
Transformers
聽
Content type:
Academic
arxiv.org
路
19h
19 hours ago
Actions for A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training
Phantom transitions in language
model
fine-tuning
聽
馃挰
LLMs
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Phantom transitions in language model fine-tuning
Revisiting Privacy Amplification by Subsampling in Selective Release DPSGD
聽
馃搻
Optimization Theory
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Revisiting Privacy Amplification by Subsampling in Selective Release DPSGD
« Page 1
路
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help