Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RL, AI Agents, Game Playing, Policy Optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
186654
posts in
51.4
ms
Policy
Improvement
Reinforcement
Learning
✨
Generative AI
arxiv.org
·
2d
How does
Reinforcement
Learning
Affect
Models
✨
Generative AI
lesswrong.com
·
3d
Deep Learning Weekly: Issue 453
✨
Generative AI
deeplearningweekly.com
·
13h
Context
Engineering for Agents
🤖
AI
rlancemartin.github.io
·
5h
Three
principles
for AI Agent
Configuration
🤖
AI
ministryoftesting.com
·
2d
Is your AI strategy missing a "Safety Net"?🛡️
🤖
AI
turingpost.com
·
7h
Artificial Intelligence:
Foundations
of
Computational
Agents
🤖
AI
artint.info
·
3d
·
Hacker News
The Data
Layer
Tax for Robot Learning
🤖
Machine Learning
rerun.io
·
15h
·
Hacker News
WHAT SHOULD — AND SHOULD NOT —
EVOLVE
IN
SELF-IMPROVING
MULTI-AGENT SYSTEMS?
✨
Generative AI
interestingengineering.substack.com
·
2d
·
Substack
Red-teaming
a network of agents: Understanding what breaks when AI agents
interact
at scale
🤖
AI
microsoft.com
·
6h
Jaxpot
: Train self-play RL agents FAST by
parallelizing
environments on GPU
🤖
AI
bardsai.substack.com
·
2d
·
Substack
Agents,
Consciousness
, and the Future of AI
✨
Generative AI
youtube.com
·
4d
Long-running Agents
✨
Generative AI
addyo.substack.com
·
13h
·
Substack
The Policy Picks the Policy
🤖
AI
noise2signal.bearblog.dev
·
2d
Agent
Sandboxes
at Scale: A
Distributed
Systems Design for AI-Driven Development
🤖
AI
medium.com
·
15h
RL
, in
pictures
and videos
🤖
AI
suriya.cc
·
6d
The
Leap
to
Angentic
AI
✨
Generative AI
profbachman.substack.com
·
2d
·
Substack
Alibaba's
Metis
agent cuts
redundant
AI tool calls from 98% to 2% — and gets more accurate doing it
🤖
AI
venturebeat.com
·
7h
Secure AI and Agent Coding Policy
🤖
AI
galdren.com
·
1d
·
Hacker News
Stopping the quiet drift toward
excessive
agency with
re-permissioning
✨
Generative AI
csoonline.com
·
19h
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help