Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-Learning, Policy Gradients, Game Theory, Decision Making
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
128313
posts in
2.20
s
Optimistic
Training and
Convergence
of Q-Learning -- Extended Version
arxiv.org
·
1d
📊
Quantitative Finance
Playing
20 Question Game with Policy-Based
Reinforcement
Learning
arxiv.org
·
23h
🤖
AI Research
check out this article on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical
Implementation
dev.to
·
16h
·
Discuss:
DEV
💬
NLP
Decision-Based Artificial Intelligence and the Strategic
Reordering
of Military Power
inss.ndu.edu
·
12h
🤖
AI Research
Recursive
self-improvement
from AI models
marginalrevolution.com
·
9h
·
Discuss:
Hacker News
🤖
AI Research
For real
game-theoretic
reasoning, we need best response in
imperfect
information games
weyxie.bearblog.dev
·
1d
·
Discuss:
Hacker News
🤖
AI Research
Observe
emergent
behavior in autonomous multi-agent LLM networks
agents.glide2.app
·
11h
·
Discuss:
Hacker News
🤖
AI Research
ashworks1706/rlhf-from-scratch
: A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
github.com
·
16h
·
Discuss:
Hacker News
💬
NLP
Entropic
Balance with Feedback Control: Information
Equalities
and Tight Inequalities
link.aps.org
·
15h
📊
Quantitative Finance
The
Rational
Use of
Cognitive
Resources
press.princeton.edu
·
1d
🤖
AI Research
New Generative
Paradigm
:
Drifting
Model
mail.bycloud.ai
·
9h
🤖
AI Research
Risk-preference-aware
optimal scheduling and profit allocation of load
aggregators
and charging operators
sciencedirect.com
·
7h
📊
Quantitative Finance
JupyterPS/VBAF
: Visual Business Automation Framework - PowerShell-based reinforcement learning for education and business automation
github.com
·
15h
·
Discuss:
Hacker News
💬
NLP
Teaching
Reasoning
with Games
danonymous.bearblog.dev
·
1h
📊
Quantitative Finance
Augmentation of
frontoparietal
gamma-band phase coupling enhances human
altruistic
behavior
journals.plos.org
·
14h
💬
NLP
Advancing
AI
benchmarking
with Game Arena
dev.to
·
9h
·
Discuss:
DEV
🤖
AI Research
— ### Abstract We propose a reinforcement‑learning based framework for automatic coordination of multiple autonomous mobile robots (
AMRs
) performing
sl
...
freederia.com
·
4d
🤖
AI Research
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
3d
🤖
AI Research
Learning Optimization Tools
trendhunter.com
·
1d
👁️
Computer Vision
I’m building a "
Darwinian
" software lab. AI agents generate apps, users kill the bad ones, and the survivors
evolve
.
freehuman.club
·
10h
·
Discuss:
r/SideProject
🤖
AI Research
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help