Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
122282
posts in
713.6
ms
Control Reinforcement Learning: Token-Level
Mechanistic
Analysis via Learned
SAE
Feature Steering
arxiv.org
·
10h
🤖
AI
check out this
article
on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical Implementation
dev.to
·
2d
·
Discuss:
DEV
🤖
AI
Optimistic
Training and
Convergence
of Q-Learning -- Extended Version
arxiv.org
·
3d
⚡
Query Optimization
Show HN:
Fighting
the War Against
Expensive
Reinforcement Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
8h
·
Discuss:
Hacker News
🤖
AI
A training
principle
for
drifting
models
breno.bearblog.dev
·
4h
🔀
Transformers
FinovateEurope
2026: From AI
Hype
To Bank‑Ready Execution
forrester.com
·
5h
🏗️
Data Engineering
Feedback
Control for Computer Systems
janert.org
·
7h
🌐
Distributed Systems
Recursive
self-improvement
from AI models
marginalrevolution.com
·
1d
·
Discuss:
Hacker News
🤖
AI
The
Rational
Use of
Cognitive
Resources
press.princeton.edu
·
2d
🔀
Transformers
Researchers propose a self-distillation fix for ‘
catastrophic
forgetting
’ in LLMs
infoworld.com
·
5h
🌐
Distributed Systems
A
masterclass
in AI security
operations
redcanary.com
·
1h
🤖
AI
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
5d
🔀
Transformers
A multi-agent reinforcement learning approach to autonomous aircraft
taxiing
with
taxiing
time, fuel consumption, and
emission
optimization
sciencedirect.com
·
1d
🤖
AI
The 4 Mixture of Experts Architectures: How to Train
100B
Models at
10B
Cost
pub.towardsai.net
·
2h
🔀
Transformers
Generalized
Lanczos
method for systematic optimization of neural-network quantum states
link.aps.org
·
5h
🔀
Transformers
Show HN: A
minimal
online decision maker
decisionmaker.online
·
1d
·
Discuss:
Hacker News
🤖
AI
Digitizing
the "
Shokunin
": How we encoded a Master's hammer strike into AI
yusukekaizen.substack.com
·
8h
·
Discuss:
Substack
🤖
AI
Training A Small Language Model To
Outperform
Frontier Models On
CRM-Arena
neurometric.substack.com
·
3h
·
Discuss:
Substack
⚡
Query Optimization
Unlock
Growth With AI And Machine Learning
elearninginfographics.com
·
4h
🤖
AI
Training Data from Real-World Sources
lightningrod.ai
·
17h
🧭
Vector Databases
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help