Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
123557
posts in
1.89
s
check out this article on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical
Implementation
dev.to
·
1d
·
Discuss:
DEV
💬
Prompt Engineering
Optimistic
Training and
Convergence
of Q-Learning -- Extended Version
arxiv.org
·
2d
💬
Prompt Engineering
On-Policy Policy
Gradient
Reinforcement Learning Without On-Policy
Sampling
arxiv.org
·
20h
💬
Prompt Engineering
A multi-agent reinforcement learning approach to autonomous aircraft
taxiing
with
taxiing
time, fuel consumption, and
emission
optimization
sciencedirect.com
·
11h
💬
Prompt Engineering
Architectural and Mathematical
Foundations
of Machine Learning: A
Rigorous
Synthesis of Theory, Geometry, and Implementation
chizkidd.github.io
·
11h
·
Discuss:
Hacker News
🧠
Machine Learning
Memory and Learning
layer
be built in-house or bought
externally
?
medium.com
·
1d
·
Discuss:
Hacker News
💬
Prompt Engineering
Show HN: A
minimal
online decision maker
decisionmaker.online
·
11h
·
Discuss:
Hacker News
🧠
Cognitive Science
Order
parameters
and phase transitions of
continual
learning in deep neural networks
pnas.org
·
12h
🤖
AI
Learning Optimization Tools
trendhunter.com
·
1d
💬
Prompt Engineering
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
4d
🧠
Cognitive Science
ashworks1706/rlhf-from-scratch
: A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
github.com
·
1d
·
Discuss:
Hacker News
💬
Prompt Engineering
Wavelet
Meets Adam:
Compressing
Gradients for Memory-Efficient Training
chipublib.idm.oclc.org
·
9h
🗣️
LLMs
Behavioral economics-oriented energy storage investment analysis: A
holistic
decision support model with advanced
fuzzy
techniques
sciencedirect.com
·
8h
🧠
Cognitive Science
Recursive
self-improvement
from AI models
marginalrevolution.com
·
1d
·
Discuss:
Hacker News
💬
Prompt Engineering
Embedded
Agency
(full-text version)
lesswrong.com
·
11h
💬
Prompt Engineering
An
assistive
robot learns to set and clear the table by
observing
humans
techxplore.com
·
2h
🤖
AI
New Generative
Paradigm
:
Drifting
Model
mail.bycloud.ai
·
1d
💬
Prompt Engineering
Entropic
Balance with Feedback Control: Information
Equalities
and Tight Inequalities
link.aps.org
·
1d
🤖
AI
The Ultimate Guide to
Spaced
Repetition
: How to Remember Anything Forever
brainrash.com
·
11h
·
Discuss:
DEV
🧠
Cognitive Science
EyesOff
: Why Some Models
Quantize
Better Than Others
ym2132.github.io
·
2h
·
Discuss:
Hacker News
🤖
AI
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help