Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
ddboline's Feed
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
8784
posts in
90.7
ms
Loading...
Subscribe
Mode-Dependent
Rectification
for Stable
PPO
Training
arxiv.org
·
1d
🤖
reinforcement learning
Learning the Value Systems of Agents with
Preference-based
and
Inverse
Reinforcement Learning
arxiv.org
·
2d
🤖
reinforcement learning
Why Most AI Content Systems Don’t Learn
dev.to
·
16h
·
Discuss:
DEV
🤖
reinforcement learning
**Title**
dev.to
·
8h
·
Discuss:
DEV
🤖
reinforcement learning
OpenIndiana
Is Porting
Solaris
' IPS Package Management To Rust
phoronix.com
·
4d
·
Discuss:
r/linux
🦀
Rust
Making on a Manager's
Schedule
zsuss.substack.com
·
3d
·
Discuss:
Substack
🧩
operations research
The
Gumbel-Max
Trick
blog.quipu-strands.com
·
4d
·
Discuss:
Hacker News
🤖
reinforcement learning
A Modern Python Stack for Data Projects (uv +
ruff
+ ty +
Marimo
+ Polars)
mameli.dev
·
3d
·
Discuss:
r/programming
🦀
Rust
How I Program with LLMs
blog.wesleyabbey.io
·
3d
·
Discuss:
Hacker News
🤖
reinforcement learning
Training language models on
TPUs
shouldn't be
scary
dogac.dev
·
2d
·
Discuss:
Hacker News
🤖
reinforcement learning
Common Sense
Refactoring
of a
Messy
React Component
alexkondov.com
·
3d
·
Discuss:
Hacker News
🦀
Rust
Mem0
stores memories, but doesn't learn user
patterns
news.ycombinator.com
·
3d
·
Discuss:
Hacker News
🤖
reinforcement learning
Dark
Alley
Mathematics
blog.szczepan.org
·
5d
·
Discuss:
Hacker News
📊
linear programming
Breaking the Stack: How Adversarial Attacks
Bypass
LLM
Safeguards
pub.towardsai.net
·
3d
🤖
reinforcement learning
Convert
&
Compress
frontendmasters.com
·
4d
📊
linear programming
Please stop using OpenClaw, formerly known as
Moltbot
, formerly known as
Clawdbot
xda-developers.com
·
3d
·
Discuss:
Hacker News
🦀
Rust
Expensively
Quadratic
: the LLM Agent Cost Curve
blog.exe.dev
·
5d
·
Discuss:
Lobsters
,
Hacker News
🤖
reinforcement learning
Sign up or login to customize your feed and get personalized topic recommendations
Sign Up
Login
Taming the Flat AST:
Ergonomics
in the Age of Zero
Allocations
modern-c.blogspot.com
·
4d
·
Discuss:
Lobsters
,
Hacker News
,
r/golang
🧩
operations research
Selection
Rather
Than Prediction
voratiq.com
·
5d
·
Discuss:
Hacker News
🤖
reinforcement learning
Building a privacy-first,
EU-hosted
AI chat in Rust (
Leptos
)
limbochat.com
·
3d
·
Discuss:
Hacker News
🤖
reinforcement learning
Loading...
Loading more...
« Page 6
•
Page 8 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help