Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃幆 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112193
posts in
339.4
ms
Think Longer to Explore Deeper: Learn to Explore In-Context via
Length-Incentivized
Reinforcement Learning
arxiv.org
路
10h
馃攧
Meta-Learning
Goal-Conditioned
Reinforcement Learning from Sub-Optimal Data on
Metric
Spaces
arxiv.org
路
1d
馃尦
recursive neural networks
The
democratization
of AI data
poisoning
and how to protect your organization
csoonline.com
路
4h
馃敀
Cybersecurity
A training
principle
for
drifting
models
breno.bearblog.dev
路
1d
馃攧
Meta-Learning
Generalized
Lanczos
method for systematic optimization of neural-network quantum states
link.aps.org
路
1d
馃
Neuromorphic Computing
Product
Forecasting
through Time Series Analysis (
Modelling
)
pub.towardsai.net
路
15h
馃幆
Predictive Coding
Determining
the Chemical Potential via Universal
Density
Functional Learning
journals.aps.org
路
1d
馃幆
Predictive Coding
How to ground AI agents in
accurate
,
context-rich
data
thenewstack.io
路
2h
馃
Machine Learning
Ai鈥檚
Inner
Workings
Revealed By Model Trained On One Billion Data Points
quantumzeitgeist.com
路
23h
馃幆
Predictive Coding
AI
Inference
Needs A
Mix-And-Match
Memory Strategy
semiengineering.com
路
1d
馃
Neuromorphic Hardware
ashworks1706/rlhf-from-scratch
: A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it鈥檚 applications in Large Language Models from scratch.
github.com
路
3d
路
Discuss:
Hacker News
馃幆
Predictive Coding
Diffusion Models for
ARC-AGI
: A
Retrospective
christopherhwood.com
路
1d
路
Discuss:
Hacker News
馃幆
Predictive Coding
Antigravity
: Beyond the
Basics
of AI Coding
dev.to
路
7h
路
Discuss:
DEV
馃幆
Predictive Coding
The Ultimate Guide to
Spaced
Repetition
: How to Remember Anything Forever
brainrash.com
路
2d
路
Discuss:
DEV
馃敆
Synaptic Plasticity
The
Perceptron
blog.engora.com
路
1d
路
Discuss:
Hacker News
馃幆
Predictive Coding
The
Sour
Lesson: A Guide to Building
AGI-Pilled
Products
chrislovejoy.me
路
1d
路
Discuss:
Hacker News
馃攧
Meta-Learning
Owning
the AI
Pareto
Frontier
latent.space
路
17h
馃
Neuromorphic Hardware
Artificial Intelligence and the
Passivity
Problem
psychologytoday.com
路
21h
馃尡
Neuroplasticity
Training A Small Language Model To
Outperform
Frontier Models On
CRM-Arena
neurometric.substack.com
路
1d
路
Discuss:
Substack
馃攧
Meta-Learning
Custom
AI
Platforms
trendhunter.com
路
16h
馃捑
Microcontrollers
Sign up or log in to see more results
Sign Up
Login
« Page 2
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help