Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112768
posts in
964.5
ms
Can We Really Learn One Representation to
Optimize
All
Rewards
?
arxiv.org
·
23h
🎯
Predictive Coding
check out this
article
on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical Implementation
dev.to
·
3d
·
Discuss:
DEV
🔄
Meta-Learning
Learning beyond Teacher:
Generalized
On-Policy Distillation with Reward
Extrapolation
arxiv.org
·
23h
·
Discuss:
Hacker News
🔄
Meta-Learning
Multi-armed
bandit
en.wikipedia.org
·
11h
🧮
Algorithms
Optimizing post-disaster road
restoration
with reinforcement learning: A
traveler-behavior-aware
approach
sciencedirect.com
·
1d
🧠
Neuromorphic Hardware
The
implementation
for the
drifting
model
breno.bearblog.dev
·
17h
🎯
Predictive Coding
Show HN:
Fighting
the War Against
Expensive
Reinforcement Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
1d
·
Discuss:
Hacker News
🔄
Meta-Learning
Optimization of interpretable
hydropower
reservoir operation rules by
denoising
diffusion probabilistic model, parallel chaotic cooperation search algorithm and...
sciencedirect.com
·
10h
🎯
Predictive Coding
Tiny
Recursion
Models (
TRM
): How Tiny Networks With
Recursion
Beat Large Models on Hard Puzzles
pub.towardsai.net
·
1h
🌳
recursive neural networks
Forge
: Scalable Agent
RL
Framework and Algorithm
minimax.io
·
19h
·
Discuss:
Hacker News
🔄
Meta-Learning
Read, Learn,
Improve
sagetheanalyst.com
·
45m
🧭
Axon Guidance
AI captures
particle
accelerator
behavior to optimize machine performance
phys.org
·
14h
🧠
Neuromorphic Hardware
A
Conceptual
Framework for Exploration
Hacking
lesswrong.com
·
1d
⚡
Mechatronics
Why Modern
Analytics
Tools Create More Data but Less
Clarity
gobbledata.com
·
59m
·
Discuss:
DEV
📡
Signal Processing
We Are the
Average
of Our Models
mercurialsolo.github.io
·
8h
🎯
Predictive Coding
At-home movement state classification using totally
implantable
cortical-basal
ganglia
neural interface
science.org
·
14h
🔌
Neural Interfaces
BetaZero
V2: A Diffusion Model for Setting
Boulder
Problems
evmojo37.substack.com
·
1d
·
Discuss:
Substack
🎯
Predictive Coding
Show HN:
Darius
– An AI router that
selects
the best model for each prompt
withdarius.com
·
6h
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
Deciphering
hippocampal
place codes in weak
theta
rhythms
nature.com
·
10h
🌊
Neural Oscillations
Feedback
Control for Computer Systems
janert.org
·
1d
💾
Microcontrollers
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help