Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80632
posts in
322.7
ms
Weak-Driven
Learning: How
Weak
Agents make Strong Agents
Stronger
arxiv.org
·
12h
🔄
Meta-Learning
Who
Deserves
the Reward? SHARP:
Shapley
Credit-based Optimization for Multi-Agent System
arxiv.org
·
12h
🎯
Predictive Coding
Meta-Optimized Continual Adaptation for deep-sea exploration
habitat
design with
embodied
agent feedback loops
dev.to
·
2d
·
Discuss:
DEV
🌱
Neuroplasticity
Skills:
teaching
AI agents to act
consistently
dev.to
·
22h
·
Discuss:
DEV
🎯
Predictive Coding
## Hyper-Accurate
Cerebellar
Microcircuit
Modeling via Dynamic Stochastic Differential Equation Projection and Reinforcement Learning Optimization for Enhanced Motor Skill Acquisition
freederia.com
·
5d
🎯
Predictive Coding
Human-like Search for Modern
Applications
anvitra.ai
·
2d
·
Discuss:
Hacker News
🎯
Predictive Coding
Exploiting
large language model with reinforcement learning for generative job
recommendations
eurekalert.org
·
5d
🎯
Predictive Coding
*Robust Hierarchical Reinforcement Learning for
Bipedal
Robots Performing Dynamic Balance on
Sloped
Terrains under Partial Sensor Failure*
freederia.com
·
4d
🦾
Robotics
AI and the future of work:
Measuring
AI-driven productivity gains for
workplace
tasks
aisi.gov.uk
·
2d
🤖
Machine Learning
AI
Workflows
chatprd.ai
·
2d
🧮
Algorithms
Agentic Banking: How AI Systems and
Tokenized
Compliance Are
Restructuring
Investment and…
medium.com
·
1d
🧠
Neuromorphic Hardware
How To Go
Slow
artima.com
·
2d
💾
Microcontrollers
Finding all the roots of a
polynomial
using the
QR
algorithm
johndcook.com
·
3d
🧮
Algorithms
A
GTM
guide to AI models
revengine.substack.com
·
2d
·
Discuss:
Substack
🔄
Meta-Learning
On
Economics
of A(S)I Agents
lesswrong.com
·
2d
🧠
Neuromorphic Hardware
Faster
AI Training
Unlocked
With New System For Massive Language Models
quantumzeitgeist.com
·
1d
🌳
recursive neural networks
My
Workflow
for
Agentic
Coding
szymonkrajewski.pl
·
1d
🎯
Predictive Coding
From years to days: How AI agents are
helping
predict
battery life in just days
indianexpress.com
·
3d
🧠
Neuromorphic Hardware
The Art of Action
jarango.com
·
1d
🌱
Neuroplasticity
Predicting
operators
reliability
for control room alarm management using knowledge-based Bayesian networks
sciencedirect.com
·
4d
🎯
Predictive Coding
Loading...
Loading more...
« Page 6
•
Page 8 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help