Skip to main content
Scour
Browse
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃幆 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80762
posts in
1.09
s
Reinforcement
Learning via
Self-Distillation
arxiv.org
路
20h
馃攧
Meta-Learning
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
**Abstract:** This paper introduces a novel framework, Hyperdimensional Semantic Representation and Reinforcement Learning (
HDS-RL
), for AI-driven
personaliz
...
freederia.com
路
5h
馃幆
Predictive Coding
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Learning to
Execute
dev.to
路
11h
路
Discuss:
DEV
馃尦
recursive neural networks
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Vector-Valued
Distributional
Reinforcement Learning Policy Evaluation: A
Hilbert
Space Embedding Approach
arxiv.org
路
1d
馃幆
Predictive Coding
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Specification-Guided
Reinforcement Learning
cacm.acm.org
路
1d
馃幆
Predictive Coding
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Reinforcement
Learning with
GRPO
blog.nilenso.com
路
2d
馃攧
Meta-Learning
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
How Reinforcement Learning and Stable Diffusion Are Being
Combined
to
Simulate
Game Worlds
hackernoon.com
路
1d
馃幆
Predictive Coding
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
**Abstract:** This research explores a novel reinforcement learning (RL) framework for dynamic parameter optimization in real-time image
enhancement
algorith
...
freederia.com
路
1h
馃幆
Predictive Coding
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Fitness-Seekers
:
Generalizing
the Reward-Seeking Threat Model
lesswrong.com
路
5h
馃幆
Predictive Coding
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
andrewthecodertx/go-neural-network
: Feed forward neural network with back
propagation
and activation functions built from scratch (no libraries)
github.com
路
17m
路
Discuss:
r/golang
馃尦
recursive neural networks
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Attention
Optimization
aussieai.com
路
3h
馃幆
Predictive Coding
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Hippocampus
Predicts Rewards by
Reorganizing
Memories
neurosciencenews.com
路
2h
馃
Neuroscience
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
AI search framework that
teaches
AI models to think like experts
blogs.cisco.com
路
12h
馃攧
Meta-Learning
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
WTF
is
Explainable
Reinforcement Learning?
dev.to
路
16h
路
Discuss:
DEV
馃攧
Meta-Learning
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
We Built an Optimization
Engine
- and
Realized
Optimization Was the Wrong Problem
kiploks.com
路
9h
路
Discuss:
DEV
馃М
Algorithms
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
A Complete Guide to Neural Network
Optimizers
chizkidd.github.io
路
3d
路
Discuss:
Hacker News
馃
Machine Learning
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Are We in a
Continual
Learning
Overhang
?
lesswrong.com
路
8h
馃
Neuromorphic Computing
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Learning to
Execute
paperium.net
路
11h
路
Discuss:
DEV
馃攧
Meta-Learning
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Effects of human performance on ship collision risk in
restricted
waters
: A Bayesian network driven by real navigation data
sciencedirect.com
路
12h
馃幆
Predictive Coding
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Home
Masonry
|
Tracking
the latest trends in software development and technology
matrixtrak.com
路
1d
路
Discuss:
r/programming
馃捑
Microcontrollers
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help