Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🤖 Reinforcement Learning
Agents
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
10221
posts in
141.5
ms
Multi-Agent Reinforcement Learning (
MARL
): Practical Guide to
Cooperative
and Competitive Learning
dev.to
·
2d
·
Discuss:
DEV
🤖
Swarm Robotics
On
Economics
of A(S)I Agents
lesswrong.com
·
8h
🧠
AI
Rationality
Measurement
and Theory for Reinforcement Learning Agents
arxiv.org
·
2d
🤖
AI agents
Building the Future with AI That
Acts
devxt.com
·
4h
·
Discuss:
Hacker News
🧠
AI
Why AI Agents Make
Different
Decisions
When They Think It's Real
dev.to
·
4h
·
Discuss:
DEV
🏗️
AI Infrastructure
Reinforcement
World Model Learning for LLM-based Agents
arxiv.org
·
1d
💻
Local LLMs
A
Reputation
System for
Surveyors
tbr.bearblog.dev
·
7h
🤖
AI agents
Agentic
Coding and the Problem of
Oracles
epkconsulting.substack.com
·
8h
·
Discuss:
Substack
,
r/programming
🤖
AI agents
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
·
1d
·
Discuss:
Hacker News
🏗️
AI Infrastructure
Continual
learning and the post
monolith
AI era
baseten.co
·
1d
·
Discuss:
Hacker News
🏗️
AI Infrastructure
Mappa
– Fine-tune ANY multi-agent LLM systems end-to-end with AI
coaches
news.ycombinator.com
·
3d
·
Discuss:
Hacker News
🤖
AI agents
EP201
: The
Evolution
of AI in Software Development
blog.bytebytego.com
·
11h
🤖
AI Coding Tools
Distributed
Reinforcement Learning for
Scalable
High-Performance Policy Optimization
towardsdatascience.com
·
6d
🏗️
AI Infrastructure
The control
layer
for AI
blog.dottxt.ai
·
1d
·
Discuss:
Hacker News
🏗️
AI Infrastructure
Barn
Owls
Know When to Wait (
iuSTDP
part 2)
blog.typeobject.com
·
6h
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
The
Rapid
Transition
from Coding Agents to Agents
gearsofmedicine.com
·
3d
🤖
AI agents
Your Best Thinking Is
Wasted
on the Wrong
Decisions
iankduncan.com
·
6h
·
Discuss:
Lobsters
,
Hacker News
⚡
Incremental Computation
The Agentic
Inversion
Principle
mikebrevoort.com
·
1d
·
Discuss:
Hacker News
🤖
AI agents
Turning Coding
Tasks
into Feedback
Loops
feipeng.substack.com
·
2d
·
Discuss:
Substack
🤖
AI Coding Tools
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
56m
·
Discuss:
Hacker News
💻
Local LLMs
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help