Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
122984
posts in
1.95
s
The
Galactico
strategy
alearningaday.blog
·
1d
💬
Prompt Engineering
Playing
20 Question Game with Policy-Based
Reinforcement
Learning
arxiv.org
·
1d
💬
Prompt Engineering
Technology is a tool, not a
replacement
for experience
healio.com
·
1d
📵
Digital Minimalism
Demand‑Controlled
Ventilation
in Multi‑
Occupancy
Offices: A Reinforcement‑Learning Approach to Adaptive CO₂ Threshold Optimization and Energy‑Efficiency Analysis
freederia.com
·
5d
💬
Prompt Engineering
Dopaminergic
mechanisms supporting hippocampal
postencoding
dynamics in humans
pnas.org
·
15h
🧠
Cognitive Science
Your
ML
Model Is Training on the Future
dev.to
·
11h
·
Discuss:
DEV
🧠
Machine Learning
Determining
the Chemical Potential via Universal
Density
Functional Learning
link.aps.org
·
15h
🤖
AI
Continuous-time reinforcement learning:
ellipticity
enables model-free value function
approximation
arxiv.org
·
2d
🤖
AI
Introspective
Interpretability
: a Definition, Motivation, and Open Problems
lesswrong.com
·
2d
🗣️
LLMs
Agentic
Interactions
linkedin.com
·
15h
💬
Prompt Engineering
Slides
from my AI presentation I gave to
seniors
, feel free to share
aititus.com
·
1d
·
Discuss:
Hacker News
💬
Prompt Engineering
Gated
Attention &
DeltaNets
: The Missing Link for Long-Context AI
pub.towardsai.net
·
1d
🗣️
LLMs
Backtracking
Algorithms
algos.khourani.com
·
1d
💬
Prompt Engineering
Boosting
metacognition
in
entangled
human-AI interaction to navigate cognitive-behavioral drift
pure.mpg.de
·
2d
💬
Prompt Engineering
Hands-Free
Claude Code with the Agent
SDK
yberreby.com
·
1d
·
Discuss:
Hacker News
🤖
AI
JRFM
, Vol. 19,
Pages
132: A Hybrid Framework for Multi-Stock Trading: Deep Q-Networks with Portfolio...
mdpi.com
·
2d
📊
Data Science
Tutorial – What is a
variational
autoencoder
?
jaan.io
·
2d
·
Discuss:
Hacker News
🤖
AI
— ### Abstract We propose a reinforcement‑learning based framework for automatic coordination of multiple autonomous mobile robots (
AMRs
) performing
sl
...
freederia.com
·
5d
💬
Prompt Engineering
Becoming
More
blog.startifact.com
·
1d
💬
Prompt Engineering
Observe
emergent
behavior in autonomous multi-agent LLM networks
agents.glide2.app
·
1d
·
Discuss:
Hacker News
💬
Prompt Engineering
Loading...
Loading more...
« Page 4
•
Page 6 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help