Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎯 强化学习
RL, 奖励机制, 策略优化, 机器人控制
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
204051
posts in
37.6
ms
RubricEM
: Meta-RL with Rubric-guided Policy Decomposition beyond
Verifiable
Rewards
👁️
多模态AI
arxiv.org
·
4d
The hard core of alignment (is
robustifying
RL
)
⏱️
个人效率
lesswrong.com
·
2d
CatIF-RL
: Activity-Oriented Enzyme Sequence Design by
Steered
Inverse Protein Folding
⏱️
个人效率
biorxiv.org
·
15h
rl
for red
teaming
: training models to attack and defend themselves
⏱️
个人效率
castform.com
·
2d
·
Hacker News
Luke
Coffey
On How Tehran Has
Adapted
Kremlin Negotiation Tactics
🤖
人工智能、人形机器人、机器人商业化、具身智能、人机交互、AI创业相关
rferl.org
·
19h
SFT
, RL, and On-Policy Distillation Through a
Distributional
Lens (19 minute read)
👁️
多模态AI
nrehiew.github.io
·
6d
·
Hacker News
Your Daily
digest
for
AkademikLink
🔭
科技趋势
5 Dakikada Teknoloji Gündemi <team@aposto.com> via kill-the-newsletter.com
·
1d
Eric
Jang
– Building
AlphaGo
from scratch
🤖
手搓机器人、人生系统、有趣的AI工具
dwarkesh.com
·
1d
·
Hacker News
agreed
.
RL
is not (at least by itself) the way to alignment
⏱️
个人效率
twitter.macworks.dev
·
3d
GRIP-VLM
:
RL
for Efficient Vision-Language Models
👁️
多模态AI
startuphub.ai
·
2d
yikart/AiToEarn
: Let's use AI to Earn!
🛠️
独立开发
github.com
·
5d
Show HN: Watch a neural net discover
molecules
by
arguing
with itself
👁️
多模态AI
randman444.github.io
·
2d
·
Hacker News
Il
pieno
di
energia
!
📷
计算机视觉
maestroandrea.bearblog.dev
·
5d
Locked
Shields
2026:
RL
Joins Live-Fire Cyber Event
🔭
科技趋势
reversinglabs.com
·
2d
What rebuilding
AlphaGo
teaches
us about self-play, RL, and future of LLMs [video]
👁️
多模态AI
youtube.com
·
1d
·
Hacker News
,
Hacker News
DQN
vs
Rainbow
: 4.8x Score Gain From 6 Extensions
👁️
多模态AI
tildalice.io
·
5d
UAE
Denies
Netanyahu
Visited
During Iran War
🔭
科技趋势
beijingbulletin.com
·
2d
RL-Based
Retargeting
Method For
Transferring
Human Motion To Robots
👁️
多模态AI
80.lv
·
4d
Top House
Republican
Says No New US Ukraine
Supplemental
Likely, Backs More Russia Sanctions
⏱️
个人效率
rferl.org
·
18h
BalCapRL
: A Balanced Framework for RL-Based
MLLM
Image Captioning
👁️
多模态AI
machinelearning.apple.com
·
6d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help