Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
RLHF, Policy Gradient, Reward Models, Agent Training
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
17741
posts in
84.0
ms
Agent Factory Recap: Reinforcement Learning and
Fine-Tuning
on
TPUs
🏛
Sovereign AI Infrastructure
dev.to
·
2d
·
DEV
·
…
From
ClawHavoc
to Trust
Shield
: How a Security Incident Inspired Trust Infrastructure for AI Agents
🎼
Agent Orchestration
rotifer.dev
·
3d
·
DEV
·
…
Predicting When
RL
Training Breaks Chain-of-Thought
Monitorability
🎯
RLHF
lesswrong.com
·
1d
·
…
Hamilton-Jacobi-Bellman
Equation: Reinforcement Learning and Diffusion Models
🧮
Stochastic Calculus
dani2442.github.io
·
6d
·
Hacker News
·
…
The Architecture of an Agent That
Runs
Itself
🏛
Sovereign AI Infrastructure
github.com
·
2d
·
DEV
·
…
Agent-to-Agent
Pair
Programming
🎼
Agent Orchestration
axeldelafosse.com
·
6d
·
Hacker News
·
…
The
Anatomy
of an AI Agent: Memory, Tools, Planning, and Execution
Explained
📋
AGENTS.md
dev.to
·
3h
·
DEV
·
…
The Agent Data
Layer
: A Missing
Layer
in AI
Architecture
🕵️
AI Agents
dev.to
·
2h
·
DEV
·
…
(Some) Natural
Emergent
Misalignment
from Reward Hacking in Non-Production RL
🎭
Adversary Emulation
lesswrong.com
·
3d
·
…
The
Improver
: How I Built an AI Agent That
Upgrades
Other AI Agents
🧠
Context Engineering
dev.to
·
2d
·
DEV
·
…
A
Taxonomy
of Agents:
Intro
& Request for feedback
🎼
Agent Orchestration
lesswrong.com
·
6d
·
…
Your
Knowledge
, Your Model — Part 2: Agents,
Iatrogenics
🧠
Context Engineering
dev.to
·
2d
·
DEV
·
…
Why AI agent teams are just
hoping
their agents
behave
🕵️
AI Agents
dev.to
·
2d
·
DEV
·
…
Building Self-Improving AI Agent
Hierarchies
with
Paperclip
Plugins
🕵️
AI Agents
dev.to
·
1d
·
DEV
·
…
Scalable
Design of Agent
🧠
Context Engineering
dev.to
·
5d
·
DEV
·
…
How to Build a
Self-Healing
AI Agent: A
Practical
Framework
🎯
AI Reliability
dev.to
·
3d
·
DEV
·
…
Building AI Agents: The
Fundamentals
🧠
Context Engineering
dev.to
·
4d
·
DEV
·
…
Here’s how I would learn AI Agents as a
total
beginner
🕵️
AI Agents
dev.to
·
3d
·
DEV
·
…
JSON Strategy Templates vs
Executable
WASM
Genes: Two Paths for AI Agent Evolution
🧠
Context Engineering
dev.to
·
3d
·
DEV
·
…
Beyond the
Hype
: Building
Practical
AI Agents with Memory and Reasoning
💾
Agent Memory
dev.to
·
5d
·
DEV
·
…
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help