Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112430
posts in
413.7
ms
Optimistic
Training and
Convergence
of Q-Learning -- Extended Version
arxiv.org
·
4d
🔄
Meta-Learning
Think Longer to Explore Deeper: Learn to Explore In-Context via
Length-Incentivized
Reinforcement Learning
arxiv.org
·
16h
🔄
Meta-Learning
MiniMaxAI/MiniMax-M2.5
huggingface.co
·
6h
·
Discuss:
Hacker News
,
r/LocalLLaMA
🎯
Predictive Coding
A “
Toolbox
”
Pipeline
for Robots That See, Read, and Act
hackernoon.com
·
20h
🔌
Neural Interfaces
Scaling
LLM Post-Training at Netflix
netflixtechblog.com
·
12h
🔄
Meta-Learning
Multi objective optimization of a discrete fracture
geothermal
reservoir using
Bi-LSTM
network
sciencedirect.com
·
2h
🌳
recursive neural networks
Shel-y/q-drift
: Quantum-inspired CLI to analyze structural
fragility
and decision drift in distributed systems using Shannon Entropy and Signal Decay models.
github.com
·
12h
·
Discuss:
DEV
📡
Signal Processing
Olmix
: A framework for data mixing throughout
LM
development
allenai.org
·
4h
🎯
Predictive Coding
GLM-5
: Targeting complex systems engineering and
long-horizon
agentic tasks
news.ycombinator.com
·
1h
·
Discuss:
Hacker News
🔄
Meta-Learning
A training
principle
for
drifting
models
breno.bearblog.dev
·
1d
🔄
Meta-Learning
Generalized
Lanczos
method for systematic optimization of neural-network quantum states
link.aps.org
·
1d
🧠
Neuromorphic Computing
The
democratization
of AI data
poisoning
and how to protect your organization
csoonline.com
·
10h
🔒
Cybersecurity
Product
Forecasting
through Time Series Analysis (
Modelling
)
pub.towardsai.net
·
21h
🎯
Predictive Coding
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
6d
🎯
Predictive Coding
Recursive
self-improvement
from AI models
marginalrevolution.com
·
3d
·
Discuss:
Hacker News
🌳
recursive neural networks
Human-like
metacognitive
skills will reduce LLM
slop
and aid alignment and capabilities
lesswrong.com
·
1d
🔄
Meta-Learning
How to ground AI agents in
accurate
,
context-rich
data
thenewstack.io
·
8h
🤖
Machine Learning
Ai’s
Inner
Workings
Revealed By Model Trained On One Billion Data Points
quantumzeitgeist.com
·
1d
🎯
Predictive Coding
AI
Inference
Needs A
Mix-And-Match
Memory Strategy
semiengineering.com
·
1d
🧠
Neuromorphic Hardware
The
Perceptron
blog.engora.com
·
2d
·
Discuss:
Hacker News
🎯
Predictive Coding
Sign up or log in to see more results
Sign Up
Login
« Page 2
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help