Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RL, reward functions, policy gradient, agents, simulation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
184317
posts in
56.0
ms
Dynamical
Priors
as a Training Objective in Reinforcement Learning
🤖
AI
arxiv.org
·
6d
How to build custom reasoning agents with a
fraction
of the
compute
🧠
LLMs
venturebeat.com
·
1d
The Data
Layer
Tax for Robot Learning
🧠
LLMs
rerun.io
·
2h
·
Hacker News
context-labs/HALO:
Hierarchal
Agent Loop
Optimizer
🧠
AI Agents
github.com
·
16h
·
Hacker News
Artificial Intelligence:
Foundations
of
Computational
Agents
🧠
AI Agents
artint.info
·
2d
·
Hacker News
Deep Learning Weekly: Issue 453
⚙️
MLOps
deeplearningweekly.com
·
25m
How does
Reinforcement
Learning
Affect
Models
🧠
LLMs
lesswrong.com
·
3d
DEEP
Robotics
🤖
Robotics
youtube.com
·
2d
·
r/singularity
WHAT SHOULD — AND SHOULD NOT —
EVOLVE
IN
SELF-IMPROVING
MULTI-AGENT SYSTEMS?
🧠
AI Agents
interestingengineering.substack.com
·
1d
·
Substack
RL
, in
pictures
and videos
🚗
Autonomous Systems
suriya.cc
·
5d
Three
principles
for AI Agent
Configuration
🧠
AI Agents
ministryoftesting.com
·
1d
Adaptive home energy management to
self-motivated
user
preferences
via iterative LLM-augmented reinforcement learning
🧠
LLMs
sciencedirect.com
·
5d
Jaxpot
: Train self-play RL agents FAST by
parallelizing
environments on GPU
🧠
AI Agents
bardsai.substack.com
·
2d
·
Substack
Learning to
Orchestrate
Agents in Natural Language with the
Conductor
🧠
LLMs
openreview.net
·
2d
·
Hacker News
Accelerate RL
rollouts
by up to 50% with distribution-aware
speculative
decoding
⚙️
MLOps
together.ai
·
6d
Building Better Software with AI Agents: Why
Fundamentals
Still
Matter
🧠
AI Agents
youtu.be
·
2d
·
DEV
The Policy Picks the Policy
🧠
AI Agents
noise2signal.bearblog.dev
·
1d
Show HN: A live
autonomous
economic network for AI agents
🧠
AI Agents
ainetwork-global.github.io
·
3d
·
Hacker News
Getting Up to Speed on Multi-Agent Systems, Part 5:
Debate
, State, and
Coordination
🕸️
Distributed Systems
christophermeiklejohn.com
·
2d
Deep Policy Iteration for High-Dimensional Mean-Field Games with
Regenerative
Reformulation
🕸️
Distributed Systems
arxiv.org
·
11h
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help