Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
123712
posts in
2.44
s
Playing
20 Question Game with Policy-Based
Reinforcement
Learning
arxiv.org
·
1d
💬
Prompt Engineering
Boltzmann
Reinforcement Learning for Noise resilience in Analog
Ising
Machines
arxiv.org
·
9h
🤖
AI
Main
Content ||
Math
∩ Programming
jeremykun.com
·
2d
📊
Data Science
Technology is a tool, not a
replacement
for experience
healio.com
·
18h
📵
Digital Minimalism
The
Galactico
strategy
alearningaday.blog
·
1d
💬
Prompt Engineering
The Machine Learning
Practitioner
’s Guide to
Speculative
Decoding
machinelearningmastery.com
·
3h
🗣️
LLMs
Quantization-Aware
Distillation
ternarysearch.blogspot.com
·
3d
·
Discuss:
Hacker News
🗣️
LLMs
JRFM
, Vol. 19,
Pages
132: A Hybrid Framework for Multi-Stock Trading: Deep Q-Networks with Portfolio...
mdpi.com
·
2d
📊
Data Science
Machine learning reveals hidden
landscape
of
robust
information storage
phys.org
·
23h
💬
Prompt Engineering
Beyond the
Hype
: Why Machine Learning is the Strategic
Backbone
of Modern AI
pub.towardsai.net
·
18h
💬
Prompt Engineering
Genuine learning biases persist after accounting for
temporally
decreasing
learning rates: Insight from fitting six datasets
pnas.org
·
1h
🧠
Machine Learning
Schedules
of Reinforcement in
Psychology
(Examples)
simplypsychology.org
·
19h
·
Discuss:
Hacker News
🧠
Cognitive Science
Heuristics
for lab
robotics
, and where its future may go
lesswrong.com
·
21h
💬
Prompt Engineering
Unlock
Customer Insights with
Theta
Intelligence
medium.com
·
17h
💬
Prompt Engineering
Ai’s ‘
steering
’ Made Far More
Precise
With New Fine-Tuning Technique
quantumzeitgeist.com
·
1d
💬
Prompt Engineering
New Research Shows AI Agents Learn
Altruism
From Human
Behavior
pymnts.com
·
1d
🤖
AI
Hands-Free
Claude Code with the Agent
SDK
yberreby.com
·
17h
·
Discuss:
Hacker News
🤖
AI
Demand‑Controlled
Ventilation
in Multi‑
Occupancy
Offices: A Reinforcement‑Learning Approach to Adaptive CO₂ Threshold Optimization and Energy‑Efficiency Analysis
freederia.com
·
4d
💬
Prompt Engineering
Dopaminergic
mechanisms supporting hippocampal
postencoding
dynamics in humans
pnas.org
·
1h
🧠
Cognitive Science
Slides
from my AI presentation I gave to
seniors
, feel free to share
aititus.com
·
19h
·
Discuss:
Hacker News
💬
Prompt Engineering
Loading...
Loading more...
« Page 2
•
Page 4 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help