Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
科学空间|Scientific Spaces
kexue.fm
让炼丹更科学一些
(五):
基于梯度精调学习率
kexue.fm
·
18w
让炼丹更科学一些(四):
新恒等式
,
新学习率
kexue.fm
·
20w
为什么DeltaNet要加L2
Normalize
?
kexue.fm
·
20w
让炼丹更科学一些
(三):
SGD的终点损失收敛
kexue.fm
·
21w
让炼丹更科学一些
(二):
将结论推广到无界域
kexue.fm
·
22w
滑动平均视角下的权重衰减和学习率
kexue.fm
·
23w
« Page 1
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help