Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80518
posts in
329.6
ms
The Optimal Token
Baseline
:
Variance
Reduction for Long-Horizon LLM-RL
arxiv.org
·
11h
🔄
Meta-Learning
Reinforcement
Learning from Human
Feedback
arxiv.org
·
3d
🎯
Predictive Coding
Scaling AI Agents: Mastering
Elasticity
, State, and
Throughput
with C#
dev.to
·
20h
·
Discuss:
DEV
🧠
Neuromorphic Hardware
Linear
Regression
: An
Overview
dev.to
·
4d
·
Discuss:
DEV
🤖
Machine Learning
Hierarchical Reinforcement Learning for Multi‑Arm Collaborative Assembly of Aerospace Composite Panels: Joint
Kinematic
Constraint
‑Aware Policy with Curriculum‑Based Reward Shaping
freederia.com
·
4d
🎯
Predictive Coding
**Abstract:** Current brain-computer interfaces (BCIs) for environmental control in
quadriplegia
often suffer from limited adaptability to fluctuating
cognit
...
freederia.com
·
4d
🔌
Neural Interfaces
Is Your Machine Learning
Pipeline
as Efficient as it Could Be?
kdnuggets.com
·
4d
🤖
Machine Learning
Humane
, adaptive AI
bootstrapping
natemeyvis.com
·
4d
🔄
Meta-Learning
Listen
to
Yourself
thestoicmanual.com
·
3d
🔗
Synaptic Plasticity
Building the Future with AI That
Acts
devxt.com
·
2d
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
In (highly
contingent
!) defense of
interpretability-in-the-loop
ML training
lesswrong.com
·
3d
🎯
Predictive Coding
Writing an LLM from scratch, part
32d
--
Interventions
: adding attention bias
gilesthomas.com
·
3d
·
Discuss:
Hacker News
🔄
Meta-Learning
Neural population
geometry
and optimal coding of tasks with shared
latent
structure
nature.com
·
4d
🎯
Predictive Coding
Your AI
Companion
pocketmindai.com
·
4d
·
Discuss:
r/InternetIsBeautiful
🧠
Neuromorphic Hardware
Text classification with Python 3.14's
zstd
module • Max
Halford
maxhalford.github.io
·
4d
·
Discuss:
Lobsters
,
Hacker News
🌳
recursive neural networks
Understanding LLM Inference
Engines
: Inside
Nano-vLLM
(Part 2)
neutree.ai
·
4d
·
Discuss:
Hacker News
🧠
Neuromorphic Hardware
Growth through Games
pctmagazine.org
·
4d
🧭
Axon Guidance
Adaptive marketing:
Proven
strategies
for growing companies
blog.hubspot.com
·
3d
🔄
Meta-Learning
Unsupervised
Learning NO. 515
newsletter.danielmiessler.com
·
3d
🎯
Predictive Coding
Boundary
Engineering
cabreza.substack.com
·
3d
·
Discuss:
Substack
⚙
Engineering
Loading...
Loading more...
« Page 8
•
Page 10 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help