Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RL, AI Agents, Game Playing, Policy Optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112480
posts in
1.88
s
Forge
: Scalable Agent
RL
Framework and Algorithm
minimax.io
·
1d
·
Discuss:
Hacker News
⛓️
LangChain
The
Bitter
Lesson Behind Building Agentic
RL
in Terminal Environments
faithful-almanac-add.notion.site
·
8h
✍️
Prompt Engineering
Provable
Offline Reinforcement Learning for Structured Cyclic
MDPs
arxiv.org
·
2d
🎓
RLHF
Explainable
Causal Reinforcement Learning for heritage language
revitalization
programs with inverse simulation verification
dev.to
·
20h
·
Discuss:
DEV
✍️
Prompt Engineering
OpenReview
for AI Agents
news.ycombinator.com
·
12h
·
Discuss:
Hacker News
🤖
AI
Power of Agent
assisted
coding and learning to
achieve
goals faster and cheaper
osm2pgsql.org
·
15h
·
Discuss:
DEV
📞
Function Calling
a
simulation
platform for human
behavior
simile.ai
·
11h
🎓
RLHF
DaVinci-Agency
: A
Shortcut
to Long-Horizon AI Agents
hackernoon.com
·
1d
🎭
Anthropic Claude
Wazir
Drop: a
tournament
winning board game AI engine
github.com
·
9h
·
Discuss:
Hacker News
🤖
AI
Distributionally
Robust Cooperative Multi-Agent Reinforcement Learning via Robust Value
Factorization
arxiv.org
·
2d
🎓
RLHF
Agentic AI Is Here — And
Governance
Is No Longer
Optional
dev.to
·
6h
·
Discuss:
DEV
🎭
Anthropic Claude
AI Learning
Platforms
trendhunter.com
·
17h
🧠
OpenAI
I built a platform that
enables
AI agents to
execute
complex tasks
manifest.new
·
6h
·
Discuss:
DEV
🎭
Anthropic Claude
NuPlan
: A closed-loop
ML-based
planning benchmark for autonomous vehicles
paperium.net
·
4h
·
Discuss:
DEV
🎭
Anthropic Claude
EP202
: MCP vs
RAG
vs AI Agents
blog.bytebytego.com
·
13h
🧠
OpenAI
MiniMax-AI/MiniMax-M2.5
github.com
·
16h
✍️
Prompt Engineering
AI Agents Now
ADAPT
To
Messy
Real-World Problems, Not Just Perfect Tests
quantumzeitgeist.com
·
2d
🎭
Anthropic Claude
Multi-armed
bandit
en.wikipedia.org
·
1d
🎓
RLHF
Brace
Yourself for the AI
Tsunami
wsj.com
·
1h
🤖
AI
Show HN: Fighting the War Against
Expensive
Reinforcement
Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
2d
·
Discuss:
Hacker News
🎓
RLHF
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help