Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
115848
posts in
1.80
s
check out this article on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical
Implementation
dev.to
·
21h
·
Discuss:
DEV
🔄
Meta-Learning
Optimistic
Training and
Convergence
of Q-Learning -- Extended Version
arxiv.org
·
2d
🔄
Meta-Learning
Reinforcement
Learning with
Backtracking
Feedback
arxiv.org
·
1d
🔄
Meta-Learning
Insights
on Machine Learning
Fundamentals
dev.to
·
2h
·
Discuss:
DEV
🤖
Machine Learning
Memory and Learning
layer
be built in-house or bought
externally
?
medium.com
·
16h
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
Order
parameters
and phase transitions of
continual
learning in deep neural networks
pnas.org
·
40m
🌳
recursive neural networks
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
4d
🎯
Predictive Coding
ashworks1706/rlhf-from-scratch
: A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.
github.com
·
21h
·
Discuss:
Hacker News
🎯
Predictive Coding
Recursive
self-improvement
from AI models
marginalrevolution.com
·
14h
·
Discuss:
Hacker News
🌳
recursive neural networks
Risk-preference-aware
optimal scheduling and profit allocation of load
aggregators
and charging operators
sciencedirect.com
·
13h
🎯
Predictive Coding
The Rather-efficient Replacement to
RL-specialization
for AI agents
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
2h
·
Discuss:
Hacker News
🔄
Meta-Learning
New Generative
Paradigm
:
Drifting
Model
mail.bycloud.ai
·
14h
🔄
Meta-Learning
Entropic
Balance with Feedback Control: Information
Equalities
and Tight Inequalities
link.aps.org
·
20h
🧠
Neuromorphic Hardware
Home
freedomtrainers.net
·
4h
🔬
Science
A data-efficient foundation model for
porous
materials based on expert-guided
supervised
learning
nature.com
·
1h
🖨️
3D Printing
The
Rational
Use of
Cognitive
Resources
press.princeton.edu
·
1d
🎯
Predictive Coding
Show HN:
ContinualCode
– a coding agent that updates its
weights
from feedback
sdan.github.io
·
1d
·
Discuss:
Hacker News
🔄
Meta-Learning
Frequency-domain approach to automated and efficient
multivariate
kernel density estimation for
probabilistic
modeling
sciencedirect.com
·
17h
📡
Signal Processing
1.8x Increase in Training Speed, 78% Reduction in Inference
Overhead
: Accurate Question Selection
Efficiently
Accelerates RL Training
eu.36kr.com
·
1d
🔄
Meta-Learning
AI Dispatch, Fraud Prevention, and Building “The
Trucker
’s
TMS
”
finance.yahoo.com
·
18h
🦾
Robotics
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help