Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80300
posts in
225.9
ms
Difficulty-Estimated
Policy Optimization
arxiv.org
·
1d
🔄
Meta-Learning
Fairness
Aware
Reward
Optimization
arxiv.org
·
10h
🔄
Meta-Learning
Stochastic Gradient Descent
Optimizes
Over-parameterized Deep
ReLU
Networks
dev.to
·
2d
·
Discuss:
DEV
🌳
recursive neural networks
Dynamic Metabolic Flux Optimization by Reinforcement‑Learning‑Guided Feed Control for *E. coli*
Bioprocesses
**Abstract** We present a scalable framework
tha
...
freederia.com
·
3d
🧬
Bioengineering
Scaling AI Agents: Mastering
Elasticity
, State, and
Throughput
with C#
dev.to
·
19h
·
Discuss:
DEV
🧠
Neuromorphic Hardware
Dynamic Pedestrian Flow Optimization in Smart Tunnels Using Multi‑Agent Reinforcement Learning **Abstract** Rapid
urbanization
has produced urban tunnels
tha
...
freederia.com
·
4d
🧠
Neuromorphic Hardware
Jokes
on You AI: Turning the
Tables
dev-log.me
·
2d
·
Discuss:
Hacker News
🔄
Meta-Learning
Scientists reveal the alien logic of AI:
hyper-rational
but
stumped
by simple concepts
psypost.org
·
2d
🎯
Predictive Coding
Teaching
AI to talk to itself could make
machines
learn faster
earth.com
·
4d
🔄
Meta-Learning
AIMATDESIGN
: knowledge-augmented reinforcement learning for inverse materials design under data
scarcity
nature.com
·
5d
🔬
Materials Science
Self-Learning AI Agents: A High-Level
Overview
digitalocean.com
·
6d
🔄
Meta-Learning
AI Agents 2.0: AI Agents that can Learn(6 learning
types
that make memory
persistent
)
pub.towardsai.net
·
4d
🧠
Neuromorphic Hardware
Why do tree-based models still
outperform
deep learning on
tabular
data?
paperium.net
·
2d
·
Discuss:
DEV
🌳
recursive neural networks
Loss Distribution Collapse: A
Structural
Theory of Dataset
Degradation
zenodo.org
·
4d
·
Discuss:
Hacker News
🔄
Meta-Learning
Is Your Machine Learning
Pipeline
as Efficient as it Could Be?
kdnuggets.com
·
4d
🤖
Machine Learning
Humane
, adaptive AI
bootstrapping
natemeyvis.com
·
4d
🔄
Meta-Learning
Listen
to
Yourself
thestoicmanual.com
·
3d
🔗
Synaptic Plasticity
Building the Future with AI That
Acts
devxt.com
·
2d
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
In (highly
contingent
!) defense of
interpretability-in-the-loop
ML training
lesswrong.com
·
3d
🎯
Predictive Coding
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
3d
·
Discuss:
Hacker News
🔄
Meta-Learning
Loading...
Loading more...
« Page 7
•
Page 9 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help