Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
ddboline's Feed
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
8586
posts in
88.4
ms
Loading...
Subscribe
Mode-Dependent
Rectification
for Stable
PPO
Training
arxiv.org
·
2d
🤖
reinforcement learning
Learning the Value Systems of Agents with
Preference-based
and
Inverse
Reinforcement Learning
arxiv.org
·
3d
🤖
reinforcement learning
Why AI Agents Make
Different
Decisions
When They Think It's Real
dev.to
·
20h
·
Discuss:
DEV
🤖
reinforcement learning
A Simple
Method
for
Commonsense
Reasoning
dev.to
·
9h
·
Discuss:
DEV
🤖
reinforcement learning
a
proposal
for AI that's on your side
r.github.io
·
2d
·
Discuss:
Hacker News
🤖
reinforcement learning
Against the
Orthogonality
Thesis
jonasmoman.substack.com
·
3d
·
Discuss:
Substack
🤖
reinforcement learning
26x
technicalchops.com
·
3d
·
Discuss:
Hacker News
🦀
Rust
Prompt Fidelity: Measuring How Much of Your
Intent
an AI Agent Actually
Executes
towardsdatascience.com
·
2d
🤖
reinforcement learning
I spent 2 weeks playing god. My
learnings
from 597 genetic algorithm
lineages
blog.silennai.com
·
3d
·
Discuss:
Hacker News
🤖
reinforcement learning
Style tips for less
experienced
developers
coding with AI
honnibal.dev
·
2d
·
Discuss:
Hacker News
🦀
Rust
Bridging
AI and
Skills
bridge.surf
·
3d
·
Discuss:
Hacker News
🤖
reinforcement learning
Beyond
Roleplay
:
Jailbreaking
Gemini with drugs and ritual
tidepool.leaflet.pub
·
3d
·
Discuss:
Hacker News
🤖
reinforcement learning
The Top 10 Best
Practices
for AI/BI
Dashboards
Performance Optimization (Part 2)
databricks.com
·
3d
📊
linear programming
The Game That
Ate
Itself
seeingthesystem.com
·
4d
·
Discuss:
Hacker News
🤖
reinforcement learning
Feedback
Loopable
ampcode.com
·
3d
·
Discuss:
Hacker News
🤖
reinforcement learning
The
Agentic
Trust Framework: Zero Trust
Governance
for AI Agents
cloudsecurityalliance.org
·
3d
·
Discuss:
Hacker News
🤖
reinforcement learning
Claude Code is the
Inflection
Point
newsletter.semianalysis.com
·
3d
·
Discuss:
Hacker News
,
Hacker News
🧩
operations research
Sign up or login to customize your feed and get personalized topic recommendations
Sign Up
Login
As
Rocks
May Think
evjang.com
·
4d
·
Discuss:
Hacker News
,
r/programming
🤖
reinforcement learning
Agentic
Proof-Oriented
Programming
risemsr.github.io
·
3d
·
Discuss:
Lobsters
,
Hacker News
🧩
operations research
How close is AI to taking my job?
epoch.ai
·
2d
·
Discuss:
Hacker News
🤖
reinforcement learning
Loading...
Loading more...
« Page 6
•
Page 8 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help