Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🤖 reinforcement learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
6809
posts in
324.6
ms
GAS: Enhancing
Reward-Cost
Balance of Generative Model-assisted Offline Safe
RL
arxiv.org
·
5d
📊
linear programming
Back to Basics:
Revisiting
Exploration in Reinforcement Learning for LLM Reasoning via Generative
Probabilities
arxiv.org
·
5d
📊
linear programming
Stochastic Gradient Descent
Optimizes
Over-parameterized Deep
ReLU
Networks
dev.to
·
3d
·
Discuss:
DEV
📊
linear programming
Meta-Optimized Continual Adaptation for deep-sea exploration
habitat
design with
embodied
agent feedback loops
dev.to
·
3d
·
Discuss:
DEV
🧩
operations research
Mathematical Resolution of P vs NP through
Informational
Noise
Subtraction
and Linear O(n) Mapping
zenodo.org
·
3d
·
Discuss:
Hacker News
📊
linear programming
A
one-prompt
attack that breaks LLM safety
alignment
microsoft.com
·
1d
·
Discuss:
Hacker News
📊
linear programming
Heuristics
for lab
robotics
, and where its future may go
owlposting.com
·
1d
·
Discuss:
Hacker News
🧩
operations research
Show HN: Model Training Memory
Simulator
czheo.github.io
·
2d
·
Discuss:
Hacker News
🦀
Rust
Manufacturing
QMS
Software
samrian.com
·
1d
·
Discuss:
Hacker News
🧩
operations research
Human-like Search for Modern
Applications
anvitra.ai
·
3d
·
Discuss:
Hacker News
📊
linear programming
Show HN:
Routed
Attention – 75-99% savings by routing between O(N) and O(
N²
)
zenodo.org
·
3d
·
Discuss:
Hacker News
🧩
operations research
The AI Training
Asymmetry
tostracker.app
·
3d
·
Discuss:
Hacker News
📊
linear programming
Oatmeal
-
Constraint
propagation for fun
eli.li
·
3d
·
Discuss:
Lobsters
,
Hacker News
📊
linear programming
Show HN:
A2A
Protocol
– Infrastructure for an Agent-to-Agent Economy
news.ycombinator.com
·
3d
·
Discuss:
Hacker News
📊
linear programming
Show HN: First AI Employee –
Treat
AI as a hire, not a
chatbot
site-beige-ten.vercel.app
·
2d
·
Discuss:
Hacker News
📊
linear programming
An
attempt
at a
First-Proof
AI challenge
abhvio.us
·
2d
·
Discuss:
Hacker News
📊
linear programming
Why Files Are Not
Enough
as Memory for AI Agents
medium.com
·
2d
·
Discuss:
Hacker News
📊
linear programming
Building the Future with AI That
Acts
devxt.com
·
3d
·
Discuss:
Hacker News
📊
linear programming
Autonomous
PRD
Agent
minicodemonkey.github.io
·
2d
·
Discuss:
Hacker News
🧩
operations research
Rule
#1 for coding with AI agents
zknill.io
·
1d
·
Discuss:
Hacker News
🧩
operations research
Loading...
Loading more...
« Page 1
•
Page 3 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help