Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-Learning, Policy Gradients, Game Theory, Decision Making
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
123809
posts in
972.8
ms
Optimistic
Training and
Convergence
of Q-Learning -- Extended Version
arxiv.org
·
2d
📊
Quantitative Finance
Playing
20 Question Game with Policy-Based
Reinforcement
Learning
arxiv.org
·
1d
🤖
AI Research
A multi-agent reinforcement learning approach to autonomous aircraft
taxiing
with
taxiing
time, fuel consumption, and
emission
optimization
sciencedirect.com
·
3h
🤖
AI Research
check out this article on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical
Implementation
dev.to
·
1d
·
Discuss:
DEV
💬
NLP
Decision-Based Artificial Intelligence and the Strategic
Reordering
of Military Power
inss.ndu.edu
·
1d
🤖
AI Research
Show HN: A
minimal
online decision maker
decisionmaker.online
·
4h
·
Discuss:
Hacker News
📊
Quantitative Finance
Recursive
self-improvement
from AI models
marginalrevolution.com
·
23h
·
Discuss:
Hacker News
🤖
AI Research
Architectural and Mathematical
Foundations
of Machine Learning: A
Rigorous
Synthesis of Theory, Geometry, and Implementation
chizkidd.github.io
·
4h
·
Discuss:
Hacker News
👁️
Computer Vision
Instability of cooperation based on
fictitious
belief: an experiment with artificial
supernatural
punishment
nature.com
·
17h
🤖
AI Research
For real
game-theoretic
reasoning, we need best response in
imperfect
information games
weyxie.bearblog.dev
·
2d
·
Discuss:
Hacker News
🤖
AI Research
Observe
emergent
behavior in autonomous multi-agent LLM networks
agents.glide2.app
·
1d
·
Discuss:
Hacker News
🤖
AI Research
The Machine Learning
Practitioner
’s Guide to
Speculative
Decoding
machinelearningmastery.com
·
6h
💬
NLP
ashworks1706/rlhf-from-scratch
: A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
github.com
·
1d
·
Discuss:
Hacker News
💬
NLP
Part 2 - AI Chat Evaluation of the Formal Language in He
Xin
's
PEPC
System
news.ycombinator.com
·
1h
·
Discuss:
Hacker News
💬
NLP
Entropic
Balance with Feedback Control: Information
Equalities
and Tight Inequalities
link.aps.org
·
1d
📊
Quantitative Finance
Gradient-based identification of
hydraulic
resistance for optimal pump control in
meshed
district heating network
sciencedirect.com
·
3h
📊
Quantitative Finance
The Generative AI
Oligopoly
: How Big Tech is Building “Old
Moats
” for the New Era (2024–2026)
pub.towardsai.net
·
2h
🤖
AI Research
Organizational
Strategies from the Collective
Wisdom
of Nature
oreilly.com
·
5h
🌐
Distributed Systems
The
Rational
Use of
Cognitive
Resources
press.princeton.edu
·
1d
🤖
AI Research
New Generative
Paradigm
:
Drifting
Model
mail.bycloud.ai
·
22h
🤖
AI Research
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help