Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Reinforcement Learning In Finance
📈 Reinforcement Learning In Finance
Specific
RLHF, PPO, Q-Learning, Policy Gradient, AlphaGo
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
377
posts in
18.8
ms
TT-DAC-PS: Twin-Target Deterministic
Actor-Critic
with
Policy
Smoothing for Optimal Trade Execution
Â
📊
Quantitative Finance For Portfolio Management
Â
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution
Q-Learning
(
Reinforcement
learning
): Bellman Equation, Markov Decision Processes, Q-Values, and…
Â
📊
Quantitative Finance For Portfolio Management
Â
Content type:
Blog
medium.com
·
3d
3 days ago
Actions for Q-Learning (Reinforcement learning): Bellman Equation, Markov Decision Processes, Q-Values, and…
How to Implement a Model-Free
RL
Algorithm
: A Step-by-Step Guide
Â
🎲
Ergodicity Economics
Â
Content type:
Blog
ujangriswanto08.medium.com
·
20h
20 hours ago
Actions for How to Implement a Model-Free RL Algorithm: A Step-by-Step Guide
Researchers develop AI-powered railway control system for efficient urban train operation
Â
🕸
Complexity Economics
techxplore.com
·
1d
1 day ago
Actions for Researchers develop AI-powered railway control system for efficient urban train operation
Reasoning
RL
in 2026: GRPO, DPO, RLVR, Agentic
PO
& Beyond
Â
🤖
LLMs
turingpost.com
·
4d
4 days ago
Actions for Reasoning RL in 2026: GRPO, DPO, RLVR, Agentic PO & Beyond
Less-relevant results
GermRL: Alleviating The Germline Bias In Autoregressive Antibody Language Models Through
Reinforcement
Learning
Â
🎲
Ergodicity Economics
Â
Content type:
Academic
biorxiv.org
·
7h
7 hours ago
Actions for GermRL: Alleviating The Germline Bias In Autoregressive Antibody Language Models Through Reinforcement Learning
Agents Need Work Data: A Primer on RLWD, or
Reinforcement
Learning
on Work Data
Â
🕸
Complexity Economics
anjalishriva.com
·
2d
2 days ago
·
Hacker News
Actions for Agents Need Work Data: A Primer on RLWD, or Reinforcement Learning on Work Data
Reinforcement
Learning
and
Optimal
Control Book (RIP Dimitri Bertsekas)
Â
🎲
Ergodicity Economics
Â
Content type:
Academic
web.mit.edu
·
6d
6 days ago
·
Hacker News
Actions for Reinforcement Learning and Optimal Control Book (RIP Dimitri Bertsekas)
Reinforcement-learning
signals support dynamic adaptive control during language switching
Â
🕸
Complexity Economics
Â
Content type:
Academic
nature.com
·
2d
2 days ago
Actions for Reinforcement-learning signals support dynamic adaptive control during language switching
Time-slip in AI sepsis models may inflate results, risking under- or overtreatment
Â
🎲
Ergodicity Economics
medicalxpress.com
·
6d
6 days ago
Actions for Time-slip in AI sepsis models may inflate results, risking under- or overtreatment
Some Interesting Papers on RLVR
Â
📊
Quantitative Finance For Portfolio Management
lesswrong.com
·
2d
2 days ago
Actions for Some Interesting Papers on RLVR
Scale Robot
Reinforcement
Learning
with NVIDIA Isaac Lab on Amazon SageMaker AI
Â
🕸
Complexity Economics
Â
Content type:
Blog
aws.amazon.com
·
2d
2 days ago
Actions for Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker AI
[NEW MODEL] SupraLabs just released Supra1.5-50M Base (Experimental)!
Â
📊
Quantitative Finance For Portfolio Management
huggingface.co
·
13h
13 hours ago
·
r/LocalLLaMA
Actions for [NEW MODEL] SupraLabs just released Supra1.5-50M Base (Experimental)!
SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
Â
🕸
Complexity Economics
Â
Content type:
Code
github.com
·
4d
4 days ago
·
r/opensource
Actions for SimarcLabs/pybullet-swarm-sim: Python framework for simulating drone swarms with PyBullet in seconds.
Comp.compilers: Paper: MileStone: A Multi-Objective Compiler Phase
Ordering
Framework for Graph-based IR-Level
Optimization
Â
🎲
Ergodicity Economics
compilers.iecc.com
·
6d
6 days ago
Actions for Comp.compilers: Paper: MileStone: A Multi-Objective Compiler Phase Ordering Framework for Graph-based IR-Level Optimization
Dynamic
Execution
Horizon Prediction for Chunk-based Robot
Policies
Â
🎲
Ergodicity Economics
Â
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for Dynamic Execution Horizon Prediction for Chunk-based Robot Policies
Weekly Research Recap
Â
📈
Quantitative Strategies
Â
Content type:
News
quantseeker.com
·
2d
2 days ago
Actions for Weekly Research Recap
Beyond Dexterity: Why Contact May Define the Next Era of Robotics
Â
📊
Quantitative Finance For Portfolio Management
Â
Content type:
Video
Â
Content type:
News
spectrum.ieee.org
·
2d
2 days ago
·
Hacker News
Actions for Beyond Dexterity: Why Contact May Define the Next Era of Robotics
LogicWealth | SMT
Portfolio
Construction Terminal
Â
📊
Quantitative Finance For Portfolio Management
pralfredo.github.io
·
4d
4 days ago
·
r/SideProject
Actions for LogicWealth | SMT Portfolio Construction Terminal
You'
re
doing it wrong
Â
🕸
Complexity Economics
Â
Content type:
News
understandably.com
·
2d
2 days ago
Actions for You're doing it wrong
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help