Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
124154
posts in
1.57
s
check out this article on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical
Implementation
dev.to
·
1d
·
Discuss:
DEV
🤖
AI
Optimistic
Training and
Convergence
of Q-Learning -- Extended Version
arxiv.org
·
2d
⚡
Query Optimization
On-Policy Policy
Gradient
Reinforcement Learning Without On-Policy
Sampling
arxiv.org
·
17h
🤖
AI
Recursive
self-improvement
from AI models
marginalrevolution.com
·
1d
·
Discuss:
Hacker News
🤖
AI
The
Rational
Use of
Cognitive
Resources
press.princeton.edu
·
1d
🔀
Transformers
Architectural and Mathematical
Foundations
of Machine Learning: A
Rigorous
Synthesis of Theory, Geometry, and Implementation
chizkidd.github.io
·
8h
·
Discuss:
Hacker News
🤖
ML
A multi-agent reinforcement learning approach to autonomous aircraft
taxiing
with
taxiing
time, fuel consumption, and
emission
optimization
sciencedirect.com
·
8h
🤖
AI
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
4d
🔀
Transformers
New Generative
Paradigm
:
Drifting
Model
mail.bycloud.ai
·
1d
🔀
Transformers
Embedded
Agency
(full-text version)
lesswrong.com
·
8h
🔀
Transformers
Show HN: A
minimal
online decision maker
decisionmaker.online
·
9h
·
Discuss:
Hacker News
🤖
AI
Memory and Learning
layer
be built in-house or bought
externally
?
medium.com
·
1d
·
Discuss:
Hacker News
🌐
Distributed Systems
ashworks1706/rlhf-from-scratch
: A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
github.com
·
1d
·
Discuss:
Hacker News
🔀
Transformers
Training Data from Real-World Sources
lightningrod.ai
·
19m
🧭
Vector Databases
1.8x Increase in Training Speed, 78% Reduction in Inference
Overhead
: Accurate Question Selection
Efficiently
Accelerates RL Training
eu.36kr.com
·
2d
🤖
AI
Palantir: N Of 1,
Industrializing
Autonomy Via
Zero-Marginal-Cost
AI Integration
seekingalpha.com
·
6h
🤖
AI
YORU
: Animal behavior detection with object-based approach for real-time
closed-loop
feedback
science.org
·
8h
🔀
Transformers
Magic
Tricks
,
Moats
, and the Three-Body Problem of AI Networks
caseyaccidental.com
·
6h
🌐
Distributed Systems
Robotics
Motion Learning: Training Linked Robot Arms with
Kuramoto
Models
hackernoon.com
·
6h
🤖
AI
— ### Abstract We propose a reinforcement‑learning based framework for automatic coordination of multiple autonomous mobile robots (
AMRs
) performing
sl
...
freederia.com
·
5d
🌐
Distributed Systems
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help