Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Multi-Armed Bandits, Deep RL
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81038
posts in
712.0
ms
Reinforcement
Learning from Human
Feedback
arxiv.org
·
1d
♟️
Game Theory
🥇Top AI
Papers
of the Week
nlp.elvissaravia.com
·
18h
🧮
Algorithms
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
1d
🤖
Machine Learning
Multi-Agent Reinforcement Learning (
MARL
): Practical Guide to
Cooperative
and Competitive Learning
dev.to
·
3d
·
Discuss:
DEV
♟️
Game Theory
Adaptive Exploration for
Latent-State
Bandits
arxiv.org
·
3d
🗣️
LLMs
Hybrid Model‑Based / Model‑Free Reinforcement Learning for Energy‑Efficient Autonomous Warehouse Robot Navigation with Real‑Time
Obstacle
Prediction **
Abstra
...
freederia.com
·
3d
🔥
PyTorch
Main
Content ||
Math
∩ Programming
jeremykun.com
·
10h
🧮
Algorithms
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
·
2d
·
Discuss:
Hacker News
🤖
AI
Adaptive
Neuro-Symbolic
Planning for smart agriculture
microgrid
orchestration in hybrid quantum-classical pipelines
dev.to
·
23h
·
Discuss:
DEV
♟️
Game Theory
6 AI Agents, One Company
voxyz.space
·
13m
🤖
AI
AI Agents as Accountability Partners:
Configurable
Nudging
for Your Goals
blog.turtleand.com
·
13h
·
Discuss:
DEV
🤖
AI
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
1d
·
Discuss:
Hacker News
🤖
Machine Learning
Choice
as an
emergent
feature
oop.bearblog.dev
·
14h
🎮
Game Design
Why
reinforcement
learning breaks at scale, and how a new method
fixes
it
techxplore.com
·
4d
🗣️
LLMs
Performance
Tip
of the Week #94: Decision making in a
data-imperfect
world
abseil.io
·
1d
🤖
Machine Learning
On
Economics
of A(S)I Agents
lesswrong.com
·
1d
♟️
Game Theory
Scientists reveal the alien logic of AI:
hyper-rational
but
stumped
by simple concepts
psypost.org
·
1d
♟️
Game Theory
Cooperative Autonomous Navigation of Legged Robots in Unstructured
Terrains
Using Hierarchical Reinforcement Learning — ## Abstract Legged robotic
plat
...
freederia.com
·
2d
🤖
AI
I Let AI Agents Train Their Own Models. Here's What Actually
Happened
.
hamzamostafa.com
·
4h
·
Discuss:
Hacker News
🤖
AI
Label-Consistent
Backdoor
Attacks
paperium.net
·
16h
·
Discuss:
DEV
🤖
Machine Learning
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help