Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔄 Reinforcement Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
115864
posts in
2.67
s
Playing
20 Question Game with Policy-Based
Reinforcement
Learning
arxiv.org
·
1d
🤝
International Relations
Continuous-time reinforcement learning:
ellipticity
enables model-free value function
approximation
arxiv.org
·
2d
🚣
Rowing
StellarSk8board/bardacle
: A metacognitive layer for AI agents - short-term memory that survives context loss
github.com
·
19h
·
Discuss:
Hacker News
🚣
Rowing
Traction
Heroes Ep. 29:
Delusion
jarango.com
·
2d
🚣
Rowing
Backtracking
Algorithms
algos.khourani.com
·
18h
🚣
Rowing
Frequency-domain approach to automated and efficient
multivariate
kernel density estimation for
probabilistic
modeling
sciencedirect.com
·
18h
🚣
Rowing
The
Behavioral
Shift Matrix: 4 Forces Reshaping Customer
Retention
cmswire.com
·
22h
🚣
Rowing
New Research Shows AI Agents Learn
Altruism
From Human
Behavior
pymnts.com
·
1d
🤝
International Relations
Show HN: The Control and Memory
Layer
for AI Agents
news.ycombinator.com
·
20h
·
Discuss:
Hacker News
🤝
International Relations
Mindreading
, Driving, and
Limitations
for Self-Driving Cars
psychologytoday.com
·
9h
🤝
International Relations
20
Agent-focused
Experiments
fitziswriting.substack.com
·
1d
·
Discuss:
Substack
🌍
World Politics and Events
Tuning
to
Experiential
Learning
sounding.com
·
1d
·
Discuss:
Hacker News
🚣
Rowing
Advancing
AI
benchmarking
with Game Arena
dev.to
·
15h
·
Discuss:
DEV
🚣
Rowing
Slides
from my AI presentation I gave to
seniors
, feel free to share
aititus.com
·
14h
·
Discuss:
Hacker News
🚣
Rowing
Choice
as an
emergent
feature
oop.bearblog.dev
·
2d
🚣
Rowing
Microsoft researchers
crack
AI
guardrails
with a single prompt
techradar.com
·
15h
🤝
International Relations
Risk-preference-aware
optimal scheduling and profit allocation of load
aggregators
and charging operators
sciencedirect.com
·
13h
🤝
International Relations
I’m building a "
Darwinian
" software lab. AI agents generate apps, users kill the bad ones, and the survivors
evolve
.
freehuman.club
·
16h
·
Discuss:
r/SideProject
🤝
International Relations
AI
Disruption
ma.tt
·
9h
🚣
Rowing
Logic
That
Patterns
Find
udara.io
·
4h
·
Discuss:
Hacker News
🚣
Rowing
Loading...
Loading more...
« Page 2
•
Page 4 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help