Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
144859
posts in
26.2
ms
Sutton
&
Barto
, Ch. 08: Planning & Learning with Tabular Methods (Personal Notes)
chizkidd.github.io
·
3d
·
Discuss:
Hacker News
📊
Dynamic Programming
QSIM
: Mitigating
Overestimation
in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning
arxiv.org
·
1d
⚓
Anchors
Shared effects of one’s own and others’
experiences
during reinforcement learning on
episodic
memory
nature.com
·
21h
🎴
Anki
The Decision
Ladder
Model for
Skilled
Performance
dev.to
·
4h
·
Discuss:
DEV
📊
Dynamic Programming
Aligning
Few-Step Diffusion Models with
Dense
Reward Difference Learning
arxiv.org
·
1d
📊
Optimization
Reinforcement
Learning for LLMs
mesuvash.github.io
·
2d
·
Discuss:
Hacker News
📊
Dynamic Programming
Spatio-temporal
dynamic graph neural network-based missing
measurement
recovery method for power system state estimation
sciencedirect.com
·
1h
🕸️
Graph Theory
The Power of the
Intentional
Pause
psychologytoday.com
·
3h
🎴
Anki
The Lie algebra of XY-mixer
topologies
and warm starting
QAOA
for constrained optimization
nature.com
·
1d
⚛️
Quantum Computing
New AI
Steering
Method Exposes
Flaws
and Potential Improvements
nationaltoday.com
·
15h
💬
Prompt Engineering
Read, Learn,
Improve
sagetheanalyst.com
·
18h
🎴
Anki
I Taught My Mom
Bayes
’
Theorem
In 10 Minutes. She Taught Me Patience.
pub.towardsai.net
·
16h
⏰
Lamport Clocks
anadim/AdderBoard
: Smallest transformer that can add two 10-digit numbers
github.com
·
7h
🌳
Pratt Parsing
AI
Mindset
Guides
trendhunter.com
·
1d
💬
Prompt Engineering
Probabilistic Graph Neural Inference for bio-inspired soft robotics maintenance with ethical
auditability
baked
in
dev.to
·
11h
·
Discuss:
DEV
📱
Edge AI
Reliability modeling and
multi-threshold
maintenance optimization for
competing-failure
systems with discrete-state functional modules
sciencedirect.com
·
7h
💓
PHI Accrual
Brain
learns
faster from rare rewards than from
repetition
spacewar.com
·
1d
🎴
Anki
Approximate
Normalizations
for Approximate Density
Functionals
link.aps.org
·
1d
🔢
Homomorphic Encryption
Schelling
Goodness
, and Shared Morality as a Goal
lesswrong.com
·
16h
🌊
CALM Theorem
How the Brain and AI Reuse Old Knowledge in New
Situations
,
Kempner
Institute
kempnerinstitute.harvard.edu
·
2d
🧮
Embeddings
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help