Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔄 Reinforcement Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
123796
posts in
924.1
ms
RL-Only
Neural Network Training
yager.io
·
4d
🚣
Rowing
Opus 4.6 Reasoning
Distill
3k
prompts
huggingface.co
·
1d
·
Discuss:
r/LocalLLaMA
🚣
Rowing
World Models and the Data Problem in
Robotics
joeljang.github.io
·
1d
·
Discuss:
Hacker News
🤝
International Relations
[Productivity Game] SUMMARY: The
Almanack
of Naval
Ravikant
kill-the-newsletter.com
·
1d
🚣
Rowing
Just-in-Time
Ontological
Reframing
: Teaching Gemini to Route Around Its Own Safety Infrastructure
recursion.wtf
·
1d
🚣
Rowing
Ten-dimensional Neural Network
Emulator
for the
Nonlinear
Matter Power Spectrum
link.aps.org
·
1d
🤝
International Relations
AI
Follows
the 80/20
Rule
buchanan.one
·
2d
·
Discuss:
Hacker News
🤝
International Relations
Show HN: Find automation ideas and
creators
by
sharing
your business problem
humation.ai
·
1d
·
Discuss:
Hacker News
🌍
World Politics and Events
Benefit
of AI?
lemmy.ml
·
1d
🤝
International Relations
Your AI Agents Are Running
Naked
expanso.io
·
23h
·
Discuss:
Hacker News
🚣
Rowing
Gated
Attention &
DeltaNets
: The Missing Link for Long-Context AI
pub.towardsai.net
·
1d
🤝
International Relations
Want AI to
browse
the internet for you?
fry-ai.com
·
1d
🚣
Rowing
Bretton
AI Secures $75 Million to
Deploy
AI Agents Against Financial Crime
pymnts.com
·
21h
🤝
International Relations
**Abstract:** This paper introduces a novel approach to temporal credit
assignment
within distributed actor-critic reinforcement learning (
DRL
) frameworks ap...
freederia.com
·
6d
🚣
Rowing
Your Agent Is
Slow
Because of
Inference
futureagi.com
·
5d
·
Discuss:
DEV
🚣
Rowing
Preference
Conditioned
Multi-Objective Reinforcement Learning:
Decomposed
, Diversity-Driven Policy Optimization
arxiv.org
·
1d
🚣
Rowing
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
·
4d
·
Discuss:
Hacker News
🚣
Rowing
Frugal
AI
ainowinstitute.org
·
1d
🤝
International Relations
Active learning enables generation of
molecules
that advance the known
Pareto
front
nature.com
·
23h
🚣
Rowing
On
Recursive
Self-Improvement
(Part I)
hyperdimensional.co
·
2d
🤝
International Relations
Loading...
Loading more...
« Page 7
•
Page 9 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help