Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RL, Rewards, Agents, Policy, Q-learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
7557
posts in
9.0
ms
QSIM
: Mitigating
Overestimation
in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning
arxiv.org
·
1d
🎯
RLHF
Reinforcement
Learning for LLMs
mesuvash.github.io
·
2d
·
Discuss:
Hacker News
🎯
RLHF
Simulation
for Agentic
Evaluation
yortuc.com
·
13h
·
Discuss:
Hacker News
🎯
AI Agents
Stochastic
Optimal Control with Side Information and
Bayesian
Learning
arxiv.org
·
2d
📊
Bayesian Statistics
A
multi-objective
graph reinforcement learning framework for urban public facility
location
problem
sciencedirect.com
·
1d
🤖
Game AI
A Coding Implementation to Build a
Hierarchical
Planner
AI Agent Using Open-Source LLMs with Tool Execution and Structured Multi-Agent Reasoning
marktechpost.com
·
13h
🤖
Agentic AI
Sutton
&
Barto
, Ch. 08: Planning & Learning with Tabular Methods (Personal Notes)
chizkidd.github.io
·
2d
·
Discuss:
Hacker News
🤖
Game AI
Schelling
Goodness
, and Shared Morality as a Goal
lesswrong.com
·
11h
♟️
Game Theory
Approximation
Game
lcamtuf.substack.com
·
13h
·
Discuss:
Substack
♾️
Set Theory
New AI
Steering
Method Exposes
Flaws
and Potential Improvements
nationaltoday.com
·
10h
🎯
RLHF
Driving the Edge:
YOLOv11
Autonomous Mastery with
MentorPi
hackster.io
·
8h
🧠
Deep Learning
A
Generalizable
MARL-LP
Approach for Scheduling in Logistics
towardsdatascience.com
·
2d
📈
Algorithmic Trading
Show HN:
EK-1
– A local-first,
sovereign
AI agent built in Go and Rust
egokernel.com
·
1d
·
Discuss:
Hacker News
🤖
Game AI
RKGF
Loop
mdh.bearblog.dev
·
13h
🤖
Automated Reasoning
How to Create an AI Agent with the Claude Agent
SDK
shinzo.ai
·
15h
🎯
AI Agents
Learning about
automated
prompts
marcabraham.com
·
8h
✍️
Prompt Engineering
An
Introduction
to
Lean
4
uv.es
·
4h
🤖
Automated Reasoning
Autonomous
AI agents that make money for their
keeper
quoroom.ai
·
2d
·
Discuss:
Hacker News
🤖
Game AI
The Humanoid Robot
Generalization
Problem Has a New
Blueprint
hackernoon.com
·
2d
📈
Optimization
Microsoft Open Sources
Evals
for Agent
Interop
Starter Kit to Benchmark Enterprise AI Agents
infoq.com
·
1d
🎯
AI Agents
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help