Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-Learning, Policy Gradients, Game Theory, Decision Making
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112693
posts in
336.6
ms
Blockwise
Advantage Estimation for Multi-Objective RL with
Verifiable
Rewards
arxiv.org
·
1d
📊
Quantitative Finance
Can We Really Learn One Representation to
Optimize
All
Rewards
?
arxiv.org
·
21h
🤖
AI Research
Microsoft Tests AI
Marketplace
Simulation
i-programmer.info
·
7h
🤖
AI Research
MiniMaxAI/MiniMax-M2.5
huggingface.co
·
12h
·
Discuss:
Hacker News
,
r/LocalLLaMA
🤖
AI Research
Architectural and Mathematical
Foundations
of Machine Learning: A
Rigorous
Synthesis of Theory, Geometry, and Implementation
chizkidd.github.io
·
2d
·
Discuss:
Hacker News
👁️
Computer Vision
For real
game-theoretic
reasoning, we need best response in
imperfect
information games
weyxie.bearblog.dev
·
4d
·
Discuss:
Hacker News
🤖
AI Research
Are AI agents
cognitive
Ozempic
?
jesseduffield.com
·
13h
🤖
AI Research
Feedback
Control for Computer Systems
janert.org
·
1d
🌐
Distributed Systems
Multi AI Agent Systems with
crewAI
deeplearning.ai
·
1d
🤖
AI Research
Artificial Intelligence and the
Passivity
Problem
psychologytoday.com
·
1d
🤖
AI Research
You are
probably
overpaying
for intelligence
residuals.bearblog.dev
·
5h
📊
Quantitative Finance
Category
Theory, AI and Jobs
deadneurons.substack.com
·
12h
·
Discuss:
Substack
🤖
AI Research
BetaZero
V2: A Diffusion Model for Setting
Boulder
Problems
evmojo37.substack.com
·
1d
·
Discuss:
Substack
📊
Quantitative Finance
Two AI
Economies
, Two
Outcomes
elmerdata.bearblog.dev
·
1d
🤖
AI Research
v6 (Code 2 here) — Most complete architecture. This version is faster than my old v5,
statistically
correct, has all the advanced psychology/network features, and produces stunning
visualizations
gist.github.com
·
1d
·
Discuss:
r/C_Programming
📊
Quantitative Finance
Recursive
Language Models: Stop
Stuffing
the Context Window
nlp.elvissaravia.com
·
1d
💬
NLP
Addendum
: Data splitting against information leakage with
DataSAIL
nature.com
·
13h
💬
NLP
Antigravity
: Beyond the
Basics
of AI Coding
dev.to
·
18h
·
Discuss:
DEV
💬
NLP
In defense of
wasting
time
fastcompany.com
·
1d
🤖
AI Research
We Die Because it's a
Computational
Necessity
lesswrong.com
·
13h
🌐
Distributed Systems
Sign up or log in to see more results
Sign Up
Login
« Page 2
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help