Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⛰️ Gradient Descent
Optimization, Learning Rate, Backpropagation, Convergence
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
122648
posts in
1.81
s
Natural
Hypergradient
Descent
: Algorithm Design, Convergence Analysis, and Parallel Implementation
arxiv.org
·
12h
🎪
Convex Optimization
Don't give away to the
gradient
descent
carteakey.dev
·
17h
·
Discuss:
Hacker News
📊
Empirical Bayes
Gradient
Residual
Connections
arxiv.org
·
1d
🎪
Convex Optimization
Wavelet
Meets Adam:
Compressing
Gradients for Memory-Efficient Training
chipublib.idm.oclc.org
·
1d
🔢
Embeddings
Architectural and Mathematical
Foundations
of Machine Learning: A
Rigorous
Synthesis of Theory, Geometry, and Implementation
chizkidd.github.io
·
1d
·
Discuss:
Hacker News
🗺️
Manifold Learning
Gibbs Measures from Deep Shaped
Multilayer
Perceptrons
link.aps.org
·
5h
📊
Empirical Bayes
Microgpt.py
gist.github.com
·
20h
·
Discuss:
Hacker News
,
Hacker News
🗂️
AnnData
A training
principle
for
drifting
models
breno.bearblog.dev
·
6h
🎪
Convex Optimization
Learning Optimization Tools
trendhunter.com
·
2d
🎪
Convex Optimization
Grassmannian
Manifold
Learning: Optimization and Deep Learning Architectures
hackernoon.com
·
1d
🗺️
Manifold Learning
Active learning
Kriging
with functional dimension reduction for reliability analysis of stochastic
dynamical
systems
sciencedirect.com
·
1h
🔗
Markov Chains
The 4 Mixture of Experts Architectures: How to Train
100B
Models at
10B
Cost
pub.towardsai.net
·
4h
🔲
Zarr
Building a Robust
Classifier
with
Stacked
Generalization
dev.to
·
2d
·
Discuss:
DEV
🗺️
UMAP
UbiquitousLearning/mllm
: Fast Multimodal LLM on Mobile Devices
github.com
·
8h
🦠
Whole cell model
LateOn-Code
&
ColGrep
: LightOn unveils state-of-the-art code retrieval models and code search tooling
huggingface.co
·
1h
·
Discuss:
Hacker News
📄
FASTQ
How
Andrej
Karpathy
Built a Working Transformer in 243 Lines of Code
analyticsvidhya.com
·
4h
🔲
Zarr
EyesOff
: Why Some Models
Quantize
Better Than Others
ym2132.github.io
·
18h
·
Discuss:
Hacker News
🗺️
UMAP
New Generative
Paradigm
:
Drifting
Model
mail.bycloud.ai
·
1d
📊
Empirical Bayes
Hybrid meta-optimized
GNN
network to optimize pitch angle and active power of wind
turbines
for reducing fatigue load
sciencedirect.com
·
1d
🎪
Convex Optimization
Wahba
’s Problem and SO(3) Optimization:
Rotation
Learning in Geometric ML
hackernoon.com
·
1d
🎪
Convex Optimization
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help