Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔄 Reinforcement Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
115011
posts in
686.9
ms
Adventure
Mode now available
greggjewell.itch.io
·
4d
🚣
Rowing
Listen
to
Yourself
thestoicmanual.com
·
3d
🚣
Rowing
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
4d
·
Discuss:
Hacker News
🚣
Rowing
The
Portfolio
Challenge by Google AI
gbemisolaportfolio-627390562920.us-west1.run.app
·
3d
·
Discuss:
DEV
🚣
Rowing
Saturday links:
deliberate
efforts
abnormalreturns.com
·
3d
🌍
World Politics and Events
Together AI Research Explores
Default
Behaviors
and Risks in Large Language Models
tipranks.com
·
4d
🤝
International Relations
Your AI
Companion
pocketmindai.com
·
4d
·
Discuss:
r/InternetIsBeautiful
🚣
Rowing
Build an Agent with
Nanobot
, Lighter Replacement for
OpenClaw
analyticsvidhya.com
·
3d
🤝
International Relations
A (
collective
)
genius
hits a target no one else can see
metafilter.com
·
4d
🤝
International Relations
From Human
Thought
to Machine
Coordination
psychologytoday.com
·
4d
·
Discuss:
Hacker News
🤝
International Relations
Escape
The
Algorithm
kill-the-newsletter.com
·
4d
🚣
Rowing
AI Role-Playing Characters Gain
Consistency
With
Automatically
Built ‘state Of Mind’ Models
quantumzeitgeist.com
·
4d
🤝
International Relations
The Intelligence
Ratchet
: A Theoretical Framework for
Self-Stabilizing
Artificial Superintelligence
zenodo.org
·
6d
·
Discuss:
Hacker News
🤝
International Relations
AI ‘thinking Budget’ Revealed In
Landmark
Study Of
Self-Reflecting
Machines
quantumzeitgeist.com
·
4d
🤝
International Relations
The Machine
Learned
Our Language
medium.com
·
3d
·
Discuss:
r/programming
🚣
Rowing
Stochastic Gradient Descent
Optimizes
Over-parameterized Deep
ReLU
Networks
dev.to
·
3d
·
Discuss:
DEV
🚣
Rowing
How
separating
logic and search boosts AI agent
scalability
artificialintelligence-news.com
·
4d
🤝
International Relations
Multi-Agent Reinforcement Learning (
MARL
): Practical Guide to
Cooperative
and Competitive Learning
dev.to
·
5d
·
Discuss:
DEV
🚣
Rowing
Show HN:
Axiom
– Open-source AI research agent that runs locally (C#,
Ollama
)
github.com
·
1d
·
Discuss:
Hacker News
🚣
Rowing
Constrained
Sampling to Guide Universal Manipulation
RL
arxiv.org
·
1d
🚣
Rowing
Loading...
Loading more...
« Page 16
•
Page 18 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help