Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Neural Networks
🧠 Neural Networks
Deep Learning, Backpropagation, Layers
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
100
posts in
8.8
ms
Gradient
descent
at the Edge of Stability: free energy model and kinetic description of the
two-layer
network
🧠
Deep Learning
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network
Generalization in
Deep
Neural
Networks
: Minimax Rates for Gradient Methods
🧠
Deep Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Generalization in Deep Neural Networks: Minimax Rates for Gradient Methods
KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++
⚡
Flash Attention
Content type:
Code
github.com
·
3d
3 days ago
·
Hacker News
Actions for KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++
Flatland: The Adventures of
Gradient
Descent
with Large Step Sizes
🧠
Deep Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Flatland: The Adventures of Gradient Descent with Large Step Sizes
Projected Inverse Iteration: An Eigenvalue Approach to Ground-State Computation with
Neural
Quantum States
🧠
Deep Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Projected Inverse Iteration: An Eigenvalue Approach to Ground-State Computation with Neural Quantum States
PC
Layer
: Polynomial Weight Preconditioning for Improving LLM Pre-Training
💬
LLMs
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training
Synthetic Benchmarks Overstate Forward-Forward Scaling: Real-Data Limits of
Layer-Local
Training
🤖
AI
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Synthetic Benchmarks Overstate Forward-Forward Scaling: Real-Data Limits of Layer-Local Training
Learning
Dynamics Reveal a Hierarchy of Weight-Induced
Layerwise
Gram
Metrics
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Learning Dynamics Reveal a Hierarchy of Weight-Induced Layerwise Gram Metrics
An Ensembled Latent Factor Model via Differential Evolution and
Gradient
Descent
Optimization
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for An Ensembled Latent Factor Model via Differential Evolution and Gradient Descent Optimization
Multilevel Stochastic
Gradient
Descent
for Risk-Averse PDE-Constrained Optimization
📈
Optimization
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Multilevel Stochastic Gradient Descent for Risk-Averse PDE-Constrained Optimization
Second-Order Path Kernel Interpolation Formulas in Machine
Learning
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Second-Order Path Kernel Interpolation Formulas in Machine Learning
DBHN-Net
: Dual-Branch Hybrid
Neural
Network For Low-Complexity Monaural Speech Enhancement
🤖
AI
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for DBHN-Net: Dual-Branch Hybrid Neural Network For Low-Complexity Monaural Speech Enhancement
Predictive Coding with Bayesian Priors via Proximal
Gradients
🎲
Probability
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Predictive Coding with Bayesian Priors via Proximal Gradients
Quantifying Uncertainty In Wide
Two-Layer
Neural
Networks
: On The Law Of The Limiting Fluctuation Process
🤖
AI
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Quantifying Uncertainty In Wide Two-Layer Neural Networks: On The Law Of The Limiting Fluctuation Process
Fourier fractal dimension to predict the generalization of
deep
neural
networks
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Fourier fractal dimension to predict the generalization of deep neural networks
Pretraining Recurrent
Networks
without Recurrence
🤖
AI
Content type:
Academic
arxiv.org
·
5d
5 days ago
Actions for Pretraining Recurrent Networks without Recurrence
Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters
📈
Optimization
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Uniform Stability and Generalization Error of GD and SGD on Fixed-Point Parameters
Pseudospectral Bounds for Transient Amplification in Coupled
Gradient
Descent
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Pseudospectral Bounds for Transient Amplification in Coupled Gradient Descent
Attention at the Theoretical Minimum: A Mathematics of Arrays Framework for Memory-Optimal
Transformer
Kernels
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Attention at the Theoretical Minimum: A Mathematics of Arrays Framework for Memory-Optimal Transformer Kernels
AI from concrete to abstract: demystifying
artificial
intelligence to the general public
🤖
AI
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for AI from concrete to abstract: demystifying artificial intelligence to the general public
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help