Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
83088
posts in
1.51
s
GAS: Enhancing
Reward-Cost
Balance of Generative Model-assisted Offline Safe
RL
arxiv.org
·
1d
🤖
AI
Hybrid Model‑Based / Model‑Free Reinforcement Learning for Energy‑Efficient Autonomous Warehouse Robot Navigation with Real‑Time
Obstacle
Prediction **
Abstra
...
freederia.com
·
1d
🤖
AI
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
·
11h
·
Discuss:
Hacker News
🤖
AI
Part 5: Reward Engineering: How to Shape
Behaviors
in
Financial/Robotic
Tasks
dev.to
·
1d
·
Discuss:
DEV
🔧
Feature Engineering
Beyond
Rewards
in Reinforcement Learning for Cyber
Defence
arxiv.org
·
2d
🌐
Distributed Systems
Why
reinforcement
learning breaks at scale, and how a new method
fixes
it
techxplore.com
·
2d
🌐
Distributed Systems
Continual
learning and the post
monolith
AI era
baseten.co
·
9h
·
Discuss:
Hacker News
🔀
Transformers
Deep reinforcement learning-based energy scheduling for green buildings with
stationary
and EV batteries of heterogeneous
characteristics
sciencedirect.com
·
12h
⚡
Query Optimization
**Abstract:** This paper introduces Automated Pedagogical Content Adaptation through Granular Knowledge Graph & Reinforcement Learning (
GPKG-RL
), a
syst
...
freederia.com
·
13h
🔀
Transformers
Rethinking
imitation
learning with Predictive
Inverse
Dynamics Models
microsoft.com
·
1d
🤖
AI
Distributed
Reinforcement Learning for
Scalable
High-Performance Policy Optimization
towardsdatascience.com
·
5d
🌐
Distributed Systems
Multi-Agent Reinforcement Learning (
MARL
): Practical Guide to
Cooperative
and Competitive Learning
dev.to
·
1d
·
Discuss:
DEV
🌐
Distributed Systems
In (highly
contingent
!) defense of
interpretability-in-the-loop
ML training
lesswrong.com
·
15h
🔀
Transformers
Humane
, adaptive AI
bootstrapping
natemeyvis.com
·
17h
🤖
AI
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
1d
·
Discuss:
Hacker News
🧭
Vector Databases
Exploiting
large language model with reinforcement learning for generative job
recommendations
eurekalert.org
·
1d
🔀
Transformers
Finding all the roots of a
polynomial
using the
QR
algorithm
johndcook.com
·
8h
🤖
AI
Exit
Strategy
joelchrono.xyz
·
4h
⚡
Query Optimization
AI ‘thinking Budget’ Revealed In
Landmark
Study Of
Self-Reflecting
Machines
quantumzeitgeist.com
·
15h
🔀
Transformers
Agent
Evaluation
: How to Test and
Measure
Agentic AI Performance
machinelearningmastery.com
·
1d
⚡
Query Optimization
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help