Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃幃 Reinforcement Learning
RL Algorithms, Agent Training, Policy Gradient, Post Traning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
121116
posts in
1.06
s
Show HN:
Fighting
the War Against
Expensive
Reinforcement Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
路
7h
路
Discuss:
Hacker News
馃挰
LLMs
Rollout-Training
Co-Design for Efficient LLM-Based Multi-Agent
Reinforcement
Learning
arxiv.org
路
1d
馃挰
LLMs
Found-RL
: foundation model-enhanced reinforcement learning for
autonomous
driving
arxiv.org
路
9h
馃攧
Transformers
A multi-agent reinforcement learning approach to autonomous aircraft
taxiing
with
taxiing
time, fuel consumption, and
emission
optimization
sciencedirect.com
路
1d
馃
AI
check out this
article
on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical Implementation
dev.to
路
2d
路
Discuss:
DEV
馃攧
Transformers
Multi AI Agent Systems with
crewAI
deeplearning.ai
路
3h
馃
AI
A training
principle
for
drifting
models
breno.bearblog.dev
路
3h
馃
Machine Learning
A
masterclass
in AI security
operations
redcanary.com
路
1h
馃
AI
Your AI Strategy Has a
Human-Shaped
Hole
superiortech.io
路
48m
路
Discuss:
Hacker News
馃
AI
Feedback
Control for Computer Systems
janert.org
路
7h
馃
AI
Observe
emergent
behavior in autonomous multi-agent LLM networks
agents.glide2.app
路
1d
路
Discuss:
Hacker News
馃挰
LLMs
AI Agents Explained in 3
Levels
of
Difficulty
kdnuggets.com
路
1d
路
Discuss:
Hacker News
馃
AI
Why the future of AI
belongs
to models that
simulate
reality
sifted.eu
路
4h
馃
AI
Robotics
Motion Learning: Training Linked Robot Arms with
Kuramoto
Models
hackernoon.com
路
23h
馃
AI
GLM-5
: From
Vibe
Coding to Agentic Engineering
simonwillison.net
路
20h
路
Discuss:
Hacker News
馃挰
LLMs
JupyterPS/VBAF
: Visual Business Automation Framework - PowerShell-based reinforcement learning for education and business automation
github.com
路
2d
路
Discuss:
Hacker News
馃
AI
Recursive
self-improvement
from AI models
marginalrevolution.com
路
1d
路
Discuss:
Hacker News
馃
AI
I
Pitted
3 AI Agents Against Each Other. The Result Was
Scary
.
pub.towardsai.net
路
1d
馃
AI
I
benchmarked
4 CLI coding agents on an
NP-hard
optimization problem I solved by hand 8 years ago. One of them beat me.
charlesazam.com
路
14m
路
Discuss:
Hacker News
馃
AI
Task-Completion
Time
Horizons
of Frontier AI Models
metr.org
路
22h
路
Discuss:
Hacker News
馃
AI
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help