Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎮 Reinforcement Learning
Specific
RL, reward functions, policy gradient, RLHF
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
187609
posts in
13.8
ms
Policy
Improvement
Reinforcement
Learning
🧠
LLMs
arxiv.org
·
3d
[R]
Dense
process rewards from LLM feedback for multi-agent credit
assignment
🕵️
AI Agents
reddit.com
·
17h
·
r/reinforcementlearning
The Data
Layer
Tax for Robot Learning
🤖
Machine Learning
rerun.io
·
1d
·
Hacker News
Staying
in Control with AI Agents
🕵️
AI Agents
ministryoftesting.com
·
4h
Extrapolating
optimal
selective
maintenance strategy in new environments: A meta-reinforcement learning approach
🕵️
AI Agents
sciencedirect.com
·
22h
Reinforcement
fine-tuning
with LLM-as-a-judge
🧠
LLMs
aws.amazon.com
·
1d
How does
Reinforcement
Learning
Affect
Models
🧠
LLMs
lesswrong.com
·
5d
ltjed.github.io/MAPPA
/
⚙️
Automation
ltjed.github.io
·
17h
Every Model Learned by Gradient
Descent
Is
Approximately
a Kernel Machine
🤖
Machine Learning
news.ycombinator.com
·
1d
·
Hacker News
How Do Self-Learning AI Agents
Differ
from
Traditional
Machine Learning Models and Current LLM-Based Agents?
🤖
AI
kucoin.com
·
4h
Why agentic AI
governance
is
falling
short – and what we can do about it
🕵️
AI Agents
siliconangle.com
·
16h
Thrml
-
Probabilistic
Compute Simulation on GPUs
🧠
LLMs
docs.thrml.ai
·
4h
·
Hacker News
Reinforced
Agent: Inference-Time Feedback for
Tool-Calling
Agents
🕵️
AI Agents
machinelearning.apple.com
·
1d
A new GitHub
repo
to detect reward hacking in
RL
models
🤖
Machine Learning
github.com
·
6d
·
Hacker News
https://
research.perplexity.ai/articles/designing-refining-and-maintaining-agent-skills-at-perplexity
🕵️
AI Agents
research.perplexity.ai
·
17h
Sukino
's Findings: A Practical Index to AI
Roleplay
🕵️
AI Agents
rentry.org
·
32m
There Will Be a
Scientific
Theory of Deep Learning
🤖
AI
mail.bycloud.ai
·
2d
Automating
Neurosurgery
with Robotics
⚙️
Automation
youtube.com
·
19h
·
r/singularity
How to build custom reasoning agents with a
fraction
of the
compute
🧠
LLMs
venturebeat.com
·
3d
A
game-theoretic
framework for multimodal information
utilization
under heterogeneous processing environments in neuroscience and perception science
📊
Data Science
frontiersin.org
·
1d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help