Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
123949
posts in
841.7
ms
JRFM
, Vol. 19,
Pages
132: A Hybrid Framework for Multi-Stock Trading: Deep Q-Networks with Portfolio...
mdpi.com
·
2d
📈
Time Series
AI Dispatch, Fraud Prevention, and Building “The
Trucker
’s
TMS
”
finance.yahoo.com
·
1d
🌐
Distributed Systems
The Skills
Decay
Curve
blog.gorewood.games
·
2d
🤖
AI
Slides
from my AI presentation I gave to
seniors
, feel free to share
aititus.com
·
1d
·
Discuss:
Hacker News
🤖
AI
What Would Good Agent
Productivity
Metrics
Look Like?
m16g.com
·
1d
·
Discuss:
Hacker News
🤖
AI
We
chose
a pipeline over speech-to-speech for
evaluative
voice AI
productfit.substack.com
·
1d
·
Discuss:
Substack
🔀
Transformers
Continual
learning and the post
monolith
AI era
baseten.co
·
5d
·
Discuss:
Hacker News
🔀
Transformers
'
Observational
memory' cuts AI agent costs 10x and
outscores
RAG on long-context benchmarks
venturebeat.com
·
1d
🔀
Transformers
The
Scientist
and the
Simulator
latent.space
·
1d
·
Discuss:
Hacker News
🤖
AI
Pedestrian
Trajectory Dataset of Public European
Squares
nature.com
·
1d
🧭
Vector Databases
Instability of cooperation based on
fictitious
belief: an experiment with artificial
supernatural
punishment
nature.com
·
23h
🌐
Distributed Systems
Part 5: Reward Engineering: How to Shape
Behaviors
in
Financial/Robotic
Tasks
dev.to
·
5d
·
Discuss:
DEV
🔧
Feature Engineering
Hybrid meta-optimized
GNN
network to optimize pitch angle and active power of wind
turbines
for reducing fatigue load
sciencedirect.com
·
9h
🔀
Transformers
epfml/halluhard
: A Hard Multi-Turn Hallucination Benchmark
github.com
·
1d
🤖
AI
— ### Abstract The integration of reinforcement learning (RL) with joint
torque
and vision feedback represents a
decisive
step toward fully autonomous ...
freederia.com
·
6d
🔧
Feature Engineering
Building stateful AI Agents with Google
ADK
’s
InMemorySessionService
pub.towardsai.net
·
1d
🤖
AI
Efficient Planning in
Reinforcement
Learning via Model
Introspection
arxiv.org
·
1d
🤖
AI
Why Lose Context in Claude
Sessions
? A
Claude-Mem
Solution
dev.to
·
2h
·
Discuss:
DEV
🔀
Transformers
Monday AI
Radar
#12
lesswrong.com
·
1d
🤖
AI
Empowerment of accurate modeling of
anaerobic
membrane
bioreactors
by automated machine learning
sciencedirect.com
·
9h
🔧
Feature Engineering
Loading...
Loading more...
« Page 13
•
Page 15 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help