Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎯 强化学习
RL, 奖励机制, 策略优化, 机器人控制
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
204255
posts in
127.2
ms
PyTorch
DevLog
🧠
大模型微调
docs.pytorch.org
·
5d
·
Hacker News
Iran
Names
Internet Chief As Shutdown
Reaches
75 Days
🛠️
独立开发
sanantoniopost.com
·
3d
Peng
's Q($\
lambda
$) for Conservative Value Estimation in Offline Reinforcement Learning
👁️
多模态AI
arxiv.org
·
1d
Welcome
🔭
科技趋势
onionui.github.io
·
21h
Qalibaf
Says US 'Has No Alternative' But To
Accept
Iran's Terms
🛠️
独立开发
parisguardian.com
·
4d
기술
뉴스
#294 : 26-05-16
🔭
科技趋势
blog.outsider.ne.kr
·
18h
Gradient
Bang
-
一款通过与大型语言模型对话进行的游戏
👁️
多模态AI
producthunt.com
·
21h
Reinforcement
Learning, Agency and
Taste
👁️
多模态AI
lesswrong.com
·
4d
イーロン・マスク氏、
パヨクに言及
「
左派の根本的な道徳的欠陥は
、犯罪者への共感はあっても被害者への共感がないこと」
⏱️
个人效率
moeasia.net
·
22h
PPO
vs
SAC
Sparse Rewards: 3x Sample Efficiency Gap
👁️
多模态AI
tildalice.io
·
3d
Jannik Sinner beats
Daniil
Medvedev
to reach Italian Open final after overnight rain delay
✍
人物传记、手工创作
nytimes.com
·
12h
Prompt
caching
but for RL – 7.5x
speedup
on long-prompt/short-response workloads
👁️
多模态AI
castform.com
·
5d
·
Hacker News
Self-Supervised On-Policy Reinforcement Learning via Contrastive
Proximal
Policy
Optimisation
👁️
多模态AI
arxiv.org
·
2d
How
Dirty
Frag
rose from the Copy Fail exploit
⏱️
个人效率
reversinglabs.com
·
4d
🐎 💨
🚀
科技创始人
youtube.com
·
13h
🎲 Study
✍
人物传记、手工创作
jeankapsa.com
·
21h
☕️
Commencing
with AI
👁️
多模态AI
Morning Brew via kill-the-newsletter.com
·
17h
Former NFL defensive
lineman
Josh Mauro died from fentanyl, cocaine,
ethanol
overdose
📷
计算机视觉
nytimes.com
·
8h
Some recent
articles
on language and
linguistics
🎙️
语音交互
languagelog.ldc.upenn.edu
·
5h
AIS
: Adaptive Importance Sampling for
Quantized
RL
👁️
多模态AI
arxiv.org
·
1d
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help