Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Multi-Armed Bandits, Deep RL
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112399
posts in
812.4
ms
Control Reinforcement Learning: Token-Level
Mechanistic
Analysis via Learned
SAE
Feature Steering
arxiv.org
·
1d
🤖
LLMs
Rising Multi-Armed
Bandits
with Known
Horizons
arxiv.org
·
1d
🤖
Machine Learning
check out this
article
on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical Implementation
dev.to
·
2d
·
Discuss:
DEV
🤖
LLMs
Show HN:
Fighting
the War Against
Expensive
Reinforcement Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
23h
·
Discuss:
Hacker News
🤖
LLMs
A
Conceptual
Framework for Exploration
Hacking
lesswrong.com
·
14h
🤖
LLMs
Gibbs Measures from Deep Shaped
Multilayer
Perceptrons
link.aps.org
·
18h
🔥
PyTorch
Optimizing post-disaster road
restoration
with reinforcement learning: A
traveler-behavior-aware
approach
sciencedirect.com
·
15h
🔌
Embedded Systems
A training
principle
for
drifting
models
breno.bearblog.dev
·
20h
🔥
PyTorch
AI Beyond The
Chatbot
: The New Value
Chain
seekingalpha.com
·
18h
🤖
Machine Learning
BetaZero
V2: A Diffusion Model for Setting
Boulder
Problems
evmojo37.substack.com
·
8h
·
Discuss:
Substack
🔥
PyTorch
Owning
the AI
Pareto
Frontier
latent.space
·
9h
🏗️
System Design
Worlds
: A Simulation Engine for Agentic
Pentesting
dreadnode.io
·
8h
·
Discuss:
Hacker News
🏗️
System Design
Multi AI Agent Systems with
crewAI
deeplearning.ai
·
19h
🤖
AI
A multi-agent reinforcement learning approach to autonomous aircraft
taxiing
with
taxiing
time, fuel consumption, and
emission
optimization
sciencedirect.com
·
1d
🤖
AI
The
Classifier
Layer: Spam, Safety, Intent, Trust Stand Between You And The Answer via @sejournal, @
DuaneForrester
searchenginejournal.com
·
16h
🤖
Machine Learning
Optimal
timing
for
superintelligence
marginalrevolution.com
·
6h
🏗️
System Design
A “
Toolbox
”
Pipeline
for Robots That See, Read, and Act
hackernoon.com
·
7h
👁️
Computer Vision
Recursive
Language Models: Stop
Stuffing
the Context Window
nlp.elvissaravia.com
·
11h
🤖
LLMs
My Honest And
Candid
Review of
Abacus
AI Deep Agent
kdnuggets.com
·
13h
🤖
Machine Learning
Custom
AI
Platforms
trendhunter.com
·
7h
🤖
AI
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help