Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81137
posts in
395.4
ms
Reinforcement
Learning from Human
Feedback
arxiv.org
·
1d
🤖
AI
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
2d
🔀
Transformers
Difficulty-Estimated
Policy Optimization
arxiv.org
·
7h
🤖
AI
🥇Top AI
Papers
of the Week
nlp.elvissaravia.com
·
21h
🤖
AI
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
·
2d
·
Discuss:
Hacker News
🤖
AI
Hybrid Model‑Based / Model‑Free Reinforcement Learning for Energy‑Efficient Autonomous Warehouse Robot Navigation with Real‑Time
Obstacle
Prediction **
Abstra
...
freederia.com
·
3d
🤖
AI
Adaptive
Neuro-Symbolic
Planning for smart agriculture
microgrid
orchestration in hybrid quantum-classical pipelines
dev.to
·
1d
·
Discuss:
DEV
🌐
Distributed Systems
Main
Content ||
Math
∩ Programming
jeremykun.com
·
13h
🧭
Vector Databases
(8) AI Meets Brain: Memory Systems from
Cognitive
Neuroscience
to Autonomous Agents
arxiviq.substack.com
·
1h
·
Discuss:
Substack
🤖
AI
Why
reinforcement
learning breaks at scale, and how a new method
fixes
it
techxplore.com
·
4d
🌐
Distributed Systems
Part 5: Reward Engineering: How to Shape
Behaviors
in
Financial/Robotic
Tasks
dev.to
·
3d
·
Discuss:
DEV
🔧
Feature Engineering
From Prediction to
Compilation
: A Manifesto for
Intrinsically
Reliable AI
news.ycombinator.com
·
23h
·
Discuss:
Hacker News
🤖
AI
Building LLMs in
Resource-Constrained
Environments
: A Hands-On Perspective
infoq.com
·
41m
🔧
Feature Engineering
AI Agents as Accountability Partners:
Configurable
Nudging
for Your Goals
blog.turtleand.com
·
16h
·
Discuss:
DEV
🤖
AI
On
Recursive
Self-Improvement
(Part I)
hyperdimensional.co
·
1h
🤖
AI
25W06
. Learning a language with the machine
z1nz0l1n.com
·
1d
🔀
Transformers
Choice
as an
emergent
feature
oop.bearblog.dev
·
17h
🤖
AI
Drifting
models
breno.bearblog.dev
·
1h
🤖
AI
**Abstract:** This paper introduces Automated Pedagogical Content Adaptation through Granular Knowledge Graph & Reinforcement Learning (
GPKG-RL
), a
syst
...
freederia.com
·
2d
🔀
Transformers
Deep reinforcement learning-based energy scheduling for green buildings with
stationary
and EV batteries of heterogeneous
characteristics
sciencedirect.com
·
2d
⚡
Query Optimization
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help