Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81178
posts in
662.0
ms
Dynamic Regret via Discounted-to-Dynamic Reduction with Applications to
Curved
Losses and Adam
Optimizer
arxiv.org
·
13h
🌳
recursive neural networks
Projected
Gradient
Ascent
for Efficient Reward-Guided Updates with One-Step Generative Models
arxiv.org
·
13h
🔄
Meta-Learning
Skills:
teaching
AI agents to act
consistently
dev.to
·
23h
·
Discuss:
DEV
🎯
Predictive Coding
Stochastic Gradient Descent
Optimizes
Over-parameterized Deep
ReLU
Networks
dev.to
·
2d
·
Discuss:
DEV
🌳
recursive neural networks
Dynamic Pedestrian Flow Optimization in Smart Tunnels Using Multi‑Agent Reinforcement Learning **Abstract** Rapid
urbanization
has produced urban tunnels
tha
...
freederia.com
·
4d
🧠
Neuromorphic Hardware
Hierarchical Reinforcement Learning for Multi‑Arm Collaborative Assembly of Aerospace Composite Panels: Joint
Kinematic
Constraint
‑Aware Policy with Curriculum‑Based Reward Shaping
freederia.com
·
4d
🎯
Predictive Coding
Nonlinear random walks on
hypergraphs
characterized
by higher-order interactions
sciencedirect.com
·
3d
🧠
Neuromorphic Hardware
AI Agents 2.0: AI Agents that can Learn(6 learning
types
that make memory
persistent
)
pub.towardsai.net
·
4d
🧠
Neuromorphic Hardware
Loss Distribution Collapse: A
Structural
Theory of Dataset
Degradation
zenodo.org
·
4d
·
Discuss:
Hacker News
🔄
Meta-Learning
Why do tree-based models still
outperform
deep learning on
tabular
data?
paperium.net
·
2d
·
Discuss:
DEV
🌳
recursive neural networks
Humane
, adaptive AI
bootstrapping
natemeyvis.com
·
4d
🔄
Meta-Learning
Listen
to
Yourself
thestoicmanual.com
·
3d
🔗
Synaptic Plasticity
Building the Future with AI That
Acts
devxt.com
·
2d
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
Self-Learning AI Agents: A High-Level
Overview
digitalocean.com
·
6d
🔄
Meta-Learning
In (highly
contingent
!) defense of
interpretability-in-the-loop
ML training
lesswrong.com
·
4d
🎯
Predictive Coding
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
3d
·
Discuss:
Hacker News
🔄
Meta-Learning
Neural population
geometry
and optimal coding of tasks with shared
latent
structure
nature.com
·
4d
🎯
Predictive Coding
Text classification with Python 3.14's
zstd
module • Max
Halford
maxhalford.github.io
·
4d
·
Discuss:
Lobsters
,
Hacker News
🌳
recursive neural networks
Your AI
Companion
pocketmindai.com
·
4d
·
Discuss:
r/InternetIsBeautiful
🧠
Neuromorphic Hardware
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
4d
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
Loading...
Loading more...
« Page 8
•
Page 10 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help