Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎮 Reinforcement Learning
Specific
RL, reward functions, policy gradient, RLHF
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
187579
posts in
19.0
ms
Why banks
preach
compliance but
reward
risk‑taking
🤖
Machine Learning
trustsignal.beehiiv.com
·
8h
·
r/Economics
Two agents, one
prompt
🕵️
AI Agents
danielvanstrien.xyz
·
2h
Meta acquires
Assured
Robot Intelligence to accelerate
humanoid
robot push
🧠
AGI
the-decoder.com
·
2h
Is your AI strategy missing a "Safety Net"?🛡️
🕵️
AI Agents
turingpost.com
·
1d
The Trust Problem With AI Agents in Production
Pipelines
🕵️
AI Agents
devops.com
·
17h
Learning diverse natural behaviors for enhancing the
agility
of
quadrupedal
robots
🧠
AGI
nature.com
·
3d
Getting Up to Speed on Multi-Agent Systems, Part 8: Open Questions
🕵️
AI Agents
christophermeiklejohn.com
·
22h
Constraints
That Compute: A Unified Framework for Efficient Intelligence from Prime
Harmonics
to Latent Reasoning
🤖
AI
zenodo.org
·
1d
·
Hacker News
Making sense of
feasibility
constraints. An
agent-centered
account
🕵️
AI Agents
tandfonline.com
·
20h
On-Policy vs Off-Policy RL:
PPO
vs SAC on 5
Gymnasium
Tasks
🕵️
AI Agents
tildalice.io
·
5d
Deep Learning Weekly: Issue 453
🧠
LLMs
deeplearningweekly.com
·
1d
Exploration Hacking: Can LLMs Learn to
Resist
RL
Training?
✍️
Prompt Engineering
lesswrong.com
·
13h
rachel-profitt/RPCustomAgents4VSCode-by-RagnarPitla
: Custom VS Code Copilot Agents for specialized workflows
⚙️
Automation
github.com
·
5h
Preference-aligned
value
stacking
for household battery storage via a selective policy sharing multi-agent reinforcement learning algorithm
🕵️
AI Agents
sciencedirect.com
·
15h
Alibaba's
Metis
agent cuts
redundant
AI tool calls from 98% to 2% — and gets more accurate doing it
🕵️
AI Agents
venturebeat.com
·
1d
Artificial Intelligence:
Foundations
of
Computational
Agents
🕵️
AI Agents
artint.info
·
4d
·
Hacker News
Bayesian policy
gradient
and
actor-critic
algorithms
🕵️
AI Agents
arxiv.org
·
1d
ko-br
| Robotics, Computer Vision,
Imitation
Learning, Reinforcement Learning En...
🧠
AGI
news.ycombinator.com
·
17h
·
Hacker News
Flow
generation through natural language: An agentic
modeling
approach (11 minute read)
⚙️
Automation
shopify.engineering
·
2d
DEEP
Robotics
🤖
AI
youtube.com
·
4d
·
r/singularity
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help