Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃攧 Reinforcement Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
115880
posts in
1.75
s
check out this article on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical
Implementation
dev.to
路
21h
路
Discuss:
DEV
馃殻
Rowing
Efficient Planning in
Reinforcement
Learning via Model
Introspection
arxiv.org
路
1d
馃殻
Rowing
Reinforcement
Learning with
Backtracking
Feedback
arxiv.org
路
1d
馃殻
Rowing
Recursive
self-improvement
from AI models
marginalrevolution.com
路
14h
路
Discuss:
Hacker News
馃殻
Rowing
The Rather-efficient Replacement to
RL-specialization
for AI agents
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
路
1h
路
Discuss:
Hacker News
馃殻
Rowing
Teaching
Reasoning
with Games
danonymous.bearblog.dev
路
6h
馃
International Relations
JupyterPS/VBAF
: Visual Business Automation Framework - PowerShell-based reinforcement learning for education and business automation
github.com
路
20h
路
Discuss:
Hacker News
馃殻
Rowing
Observe
emergent
behavior in autonomous multi-agent LLM networks
agents.glide2.app
路
16h
路
Discuss:
Hacker News
馃
International Relations
Variable
Rewards Produce
Dopamine
artlu.bearblog.dev
路
1d
馃
International Relations
An automated geometric space
curve
approach for designing dynamically
corrected
gates
nature.com
路
15h
馃
International Relations
Order
parameters
and phase transitions of
continual
learning in deep neural networks
pnas.org
路
13m
馃殻
Rowing
#2 - Going to second
base
: know your
boundaries
dev.to
路
15h
路
Discuss:
DEV
馃殻
Rowing
1.8x Increase in Training Speed, 78% Reduction in Inference
Overhead
: Accurate Question Selection
Efficiently
Accelerates RL Training
eu.36kr.com
路
1d
馃殻
Rowing
ashworks1706/rlhf-from-scratch
: A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it鈥檚 applications in Large Language Models from scratch.
github.com
路
21h
路
Discuss:
Hacker News
馃
International Relations
The
Rational
Use of
Cognitive
Resources
press.princeton.edu
路
1d
馃
International Relations
Experience AI
experience-ai.org
路
18h
馃殻
Rowing
Decision-Based Artificial Intelligence and the Strategic
Reordering
of Military Power
inss.ndu.edu
路
16h
馃
International Relations
New Generative
Paradigm
:
Drifting
Model
mail.bycloud.ai
路
13h
馃殻
Rowing
Entropic
Balance with Feedback Control: Information
Equalities
and Tight Inequalities
link.aps.org
路
20h
馃
International Relations
Introducing
Lab
: A
full-stack
platform for training your own agentic models
threadreaderapp.com
路
7h
馃
International Relations
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help