Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
150933
posts in
16.6
ms
Provable
Multi-Task Reinforcement Learning: A Representation Learning Framework with Low
Rank
Rewards
🔄
Meta-Learning
arxiv.org
·
3d
Neural
circuits
encode
prior knowledge of temporal statistics
🎯
Predictive Coding
nature.com
·
2d
Trustworthy
agents in
practice
🔄
Meta-Learning
anthropic.com
·
22h
Compression
technique
makes AI models
leaner
and faster while they're still learning
🎯
Predictive Coding
techxplore.com
·
20h
Hyperparameter
optimization impact and tuning guidelines for decentralized multi-agent reinforcement learning in multi-energy
neighborhoods
🧠
Neuromorphic Hardware
sciencedirect.com
·
2d
Three Ways
Machines
Learn
🎯
Predictive Coding
medium.com
·
3d
Making AI smarter and
greener
:
Reducing
the energy cost of large language models
🧠
Neuromorphic Hardware
digitaljournal.com
·
20h
Formalizing
the "generative crash" via
inverse
reinforcement learning
🎯
Predictive Coding
news.ycombinator.com
·
2d
·
Hacker News
Tensors
— The Native Data
Format
of Deep Learning
🤖
Machine Learning
grahamjroy.medium.com
·
19h
Rethinking
Robotics Reinforcement Learning: A Practical
Humanoid
Training Workflow
🦾
Robotics
semiengineering.com
·
1d
Markov
Decision
Processes
: The Language of Reinforcement Learning
🎯
Predictive Coding
medium.com
·
5d
How HN: We were wrong about AI
capability
floors
(and why smart triggers matter)
🧠
Neuromorphic Hardware
zenodo.org
·
14h
·
Hacker News
The Dark Factory
Harness
: Turning Autonomous
Hill-Climbing
into Autonomous Research
🎯
Predictive Coding
sotaverified.org
·
2d
·
Hacker News
Continual
learning for AI agents
🔄
Meta-Learning
blog.langchain.com
·
4d
·
Hacker News
How Does an Agent with Multiple
Goals
Choose
a Target?
🧭
Axon Guidance
lesswrong.com
·
2d
Autonomous
Rocket
Landing
with Reinforcement Learning (YouTube)
🦾
Robotics
youtube.com
·
1d
·
Hacker News
Reinforcement Learning with
Reward
Machines
for Sleep Control in Mobile Networks
🧠
Neuromorphic Hardware
arxiv.org
·
11h
Reinforcement
Learning From Human Feedback (
RLHF
) in Large Language Models(LLMs)
🎯
Predictive Coding
pub.towardsai.net
·
6d
My AI Learning
Journey
– Part 4
🧠
Neuromorphic Hardware
blog.wirelessmoves.com
·
2d
The Complete Guide to Multi-Agent AI Systems and
Reinforcement
Learning
🧠
Neuromorphic Hardware
medium.com
·
3d
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help