Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🤖 reinforcement learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
8797
posts in
141.2
ms
On
Computation
and
Reinforcement
Learning
arxiv.org
·
1d
🧩
operations research
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
·
1d
·
Discuss:
Hacker News
🧩
operations research
Distributional
Reinforcement Learning with Diffusion Bridge
Critics
arxiv.org
·
1d
📊
linear programming
Learning Models with Uniform Performance via
Distributionally
RobustOptimization
dev.to
·
8h
·
Discuss:
DEV
📊
linear programming
On
Economics
of A(S)I Agents
lesswrong.com
·
2h
🧩
operations research
Barn
Owls
Know When to Wait (
iuSTDP
part 2)
blog.typeobject.com
·
31m
·
Discuss:
Hacker News
📊
linear programming
Distributed
Reinforcement Learning for
Scalable
High-Performance Policy Optimization
towardsdatascience.com
·
6d
📊
linear programming
Continual
learning and the post
monolith
AI era
baseten.co
·
23h
·
Discuss:
Hacker News
📊
linear programming
Multi-Agent Reinforcement Learning (
MARL
): Practical Guide to
Cooperative
and Competitive Learning
dev.to
·
1d
·
Discuss:
DEV
📊
linear programming
The control
layer
for AI
blog.dottxt.ai
·
21h
·
Discuss:
Hacker News
🦀
Rust
Your Best Thinking Is
Wasted
on the Wrong
Decisions
iankduncan.com
·
39m
·
Discuss:
Lobsters
,
Hacker News
🧩
operations research
The AI CEO
Experiment
yukicapital.com
·
3h
·
Discuss:
Hacker News
🧩
operations research
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
21h
·
Discuss:
Hacker News
🧩
operations research
Is Your Machine Learning
Pipeline
as Efficient as it Could Be?
kdnuggets.com
·
1d
📊
linear programming
UltiGameMate
silvestar.codes
·
1d
·
Discuss:
Hacker News
📊
linear programming
Mechanistic
Interpretability:
Peeking
Inside an LLM
towardsdatascience.com
·
2d
📊
linear programming
Agentic
Coding and the Problem of
Oracles
epkconsulting.substack.com
·
3h
·
Discuss:
Substack
,
r/programming
🧩
operations research
A Neuro Symbolic Architecture For Induced
Epistemic
Agency and System 2 Reasoning in
Quantized
Large Language Models
papers.ssrn.com
·
2d
·
Discuss:
Hacker News
📊
linear programming
Mappa
– Fine-tune ANY multi-agent LLM systems end-to-end with AI
coaches
news.ycombinator.com
·
3d
·
Discuss:
Hacker News
📊
linear programming
From Human
Thought
to Machine
Coordination
psychologytoday.com
·
1d
·
Discuss:
Hacker News
🧩
operations research
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help