Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
123554
posts in
2.04
s
JRFM
, Vol. 19,
Pages
132: A Hybrid Framework for Multi-Stock Trading: Deep Q-Networks with Portfolio...
mdpi.com
·
2d
📈
Time Series
AI Dispatch, Fraud Prevention, and Building “The
Trucker
’s
TMS
”
finance.yahoo.com
·
1d
🌐
Distributed Systems
25W06
. Learning a language with the machine
z1nz0l1n.com
·
3d
🔀
Transformers
Measure
Twice
, Prompt Once
ignasibosch.com
·
17h
·
Discuss:
DEV
🌐
Distributed Systems
Agent Bricks
Supervisor
Agent is Now GA:
Orchestrate
Enterprise Agents
databricks.com
·
22h
🏗️
Data Engineering
Adversarial
Reasoning:
Multiagent
World Models for closing the Simulation Gap
latent.space
·
3d
·
Discuss:
Hacker News
,
Hacker News
🤖
AI
Slides
from my AI presentation I gave to
seniors
, feel free to share
aititus.com
·
19h
·
Discuss:
Hacker News
🤖
AI
An approach to
reducing
prompt size,
drift
, and governance risk in LLM-based systems
qu3ry.net
·
1d
·
Discuss:
r/LLM
🌐
Distributed Systems
We
chose
a pipeline over speech-to-speech for
evaluative
voice AI
productfit.substack.com
·
1d
·
Discuss:
Substack
🔀
Transformers
Augmentation of
frontoparietal
gamma-band phase coupling enhances human
altruistic
behavior
journals.plos.org
·
1d
🔀
Transformers
The
Scientist
and the
Simulator
latent.space
·
23h
·
Discuss:
Hacker News
🤖
AI
CCUS
technology diffusion and multi-agent investment behavior under policy
incentives
sciencedirect.com
·
1h
🌐
Distributed Systems
Dynamic Reinforcement‑Learning Allocation of Daily Physical‑Activity Intensity for Post‑Myocardial‑
Infarction
Cardiac‑Rehab: A Closed‑Loop, Model‑Based
Appro
...
freederia.com
·
5d
🤖
AI
Agentic Banking: How AI Systems and
Tokenized
Compliance Are
Restructuring
Investment and…
medium.com
·
2d
📊
OLAP
Continuous-time reinforcement learning:
ellipticity
enables model-free value function
approximation
arxiv.org
·
2d
🤖
AI
Instability of cooperation based on
fictitious
belief: an experiment with artificial
supernatural
punishment
nature.com
·
15h
🌐
Distributed Systems
Environmental semantic clustering-guided multimodal fusion for enhanced
interpretability
in methane
concentration
prediction
sciencedirect.com
·
1h
🔧
Feature Engineering
Here's an AI
assignment
I'm going to
try
groups.google.com
·
1h
🤖
AI
The Role of Signal To
Noise
in Loss
Convergence
pub.towardsai.net
·
16h
🔀
Transformers
**Abstract:** Safe exploration in reinforcement learning (
RL
) environments,
particularly
robotic manipulation, remains a critical challenge. Current approach...
freederia.com
·
6d
🤖
AI
Loading...
Loading more...
« Page 8
•
Page 10 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help