Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
RL
🎮 RL
Specific
reinforcement learning, RLHF, reward model, policy gradient
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
453
posts in
7.6
ms
Semi-finalists confirmed in Secondary Schools Volleyball Competition
🦾
Robotics
cbc.bb
·
1d
1 day ago
Actions for Semi-finalists confirmed in Secondary Schools Volleyball Competition
Photos: Syracuse Views Through the Decades
🦾
Robotics
Content type:
Academic
news.syr.edu
·
2d
2 days ago
Actions for Photos: Syracuse Views Through the Decades
Reinforcement
Learning
Disrupts
Gradient-Based
Adversarial Optimization
🕵️
AI Agents
Content type:
Academic
arxiv.org
·
11h
11 hours ago
Actions for Reinforcement Learning Disrupts Gradient-Based Adversarial Optimization
Some Interesting Papers on RLVR
🔓
Open-source Models
lesswrong.com
·
1d
1 day ago
Actions for Some Interesting Papers on RLVR
Posting for authoring
💡
AI Reasoning
turingpost.com
·
3d
3 days ago
Actions for Posting for authoring
Central College News
🦾
Robotics
Content type:
Academic
news.central.edu
·
4d
4 days ago
Actions for Central College News
Why LLMs (still) lack taste
🧠
LLMs
beyondtheprior.com
·
2d
2 days ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Reinforcement
learning
in linear embedding space unlocks generalizable control across soft robot configurations
🤖
Embodied AI
Content type:
Academic
nature.com
·
3d
3 days ago
Actions for Reinforcement learning in linear embedding space unlocks generalizable control across soft robot configurations
Macrodata Refiner – infrastructure for the robotics data loop
🦾
Robotics
macrodata.co
·
6h
6 hours ago
·
Hacker News
Actions for Macrodata Refiner – infrastructure for the robotics data loop
Deep
Reinforcement
Learning
for Adaptive Power Allocation in ISAC Systems with Mobile Target
🕵️
AI Agents
Content type:
Academic
arxiv.org
·
11h
11 hours ago
Actions for Deep Reinforcement Learning for Adaptive Power Allocation in ISAC Systems with Mobile Target
Combermere and Harrison College reach Under-15 basketball final
🦾
Robotics
cbc.bb
·
4d
4 days ago
Actions for Combermere and Harrison College reach Under-15 basketball final
What is MBPO? A Beginner’s Guide to Efficient
Reinforcement
Learning
🎭
Multimodal AI
Content type:
Blog
ujangriswanto08.medium.com
·
6d
6 days ago
Actions for What is MBPO? A Beginner’s Guide to Efficient Reinforcement Learning
Stack Overflow didn't just help AI
learn
to code
🧠
LLMs
zozo123.github.io
·
4d
4 days ago
·
Hacker News
Actions for Stack Overflow didn't just help AI learn to code
How AI chatbots become better
learning
coaches
🧠
LLMs
techxplore.com
·
14h
14 hours ago
Actions for How AI chatbots become better learning coaches
Bridging Multi-Vector and
Learned-Sparse
Retrieval, A Diagnostic Framework for Robust Semantic IDs, and More!
👁️
VLMs
Content type:
News
Content type:
Blog
recsys.substack.com
·
5d
5 days ago
·
Substack
Actions for Bridging Multi-Vector and Learned-Sparse Retrieval, A Diagnostic Framework for Robust Semantic IDs, and More!
Plan-and-Verify Video
Reward
Reasoning with Spatio-Temporal Scene Graph Grounding
💡
AI Reasoning
Content type:
Academic
arxiv.org
·
11h
11 hours ago
Actions for Plan-and-Verify Video Reward Reasoning with Spatio-Temporal Scene Graph Grounding
Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and deep
reinforcement
learning
🕵️
AI Agents
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for Edge AI enabled MIMO MC-CDMA for 6G optimizing spectrum and energy efficiency with SIC and deep reinforcement learning
AI-powered living business intelligence network
💹
AI in Finance
atlasforgex.com
·
1d
1 day ago
·
Hacker News
Actions for AI-powered living business intelligence network
BYD Dahan Official Images Unveiled, Targeting High-End Market
🔓
Open-source Models
autonews.gasgoo.com
·
2d
2 days ago
Actions for BYD Dahan Official Images Unveiled, Targeting High-End Market
Phi-Actor-Critic
: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria
🕵️
AI Agents
Content type:
Academic
arxiv.org
·
11h
11 hours ago
Actions for Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help