Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
112612
posts in
504.7
ms
Can We Really Learn One Representation to
Optimize
All
Rewards
?
arxiv.org
·
1d
💬
Prompt Engineering
check out this
article
on Reinforcement Learning with R:
Origins
, Real-Life Applications, and Practical Implementation
dev.to
·
3d
·
Discuss:
DEV
💬
Prompt Engineering
Learning beyond Teacher:
Generalized
On-Policy Distillation with Reward
Extrapolation
arxiv.org
·
1d
·
Discuss:
Hacker News
🗣️
LLMs
Multi-armed
bandit
en.wikipedia.org
·
13h
💬
Prompt Engineering
Optimizing post-disaster road
restoration
with reinforcement learning: A
traveler-behavior-aware
approach
sciencedirect.com
·
1d
💬
Prompt Engineering
The
implementation
for the
drifting
model
breno.bearblog.dev
·
19h
💬
Prompt Engineering
Show HN:
Fighting
the War Against
Expensive
Reinforcement Learning
cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app
·
1d
·
Discuss:
Hacker News
💬
Prompt Engineering
Optimization of interpretable
hydropower
reservoir operation rules by
denoising
diffusion probabilistic model, parallel chaotic cooperation search algorithm and...
sciencedirect.com
·
12h
🤖
AI
Tiny
Recursion
Models (
TRM
): How Tiny Networks With
Recursion
Beat Large Models on Hard Puzzles
pub.towardsai.net
·
3h
💬
Prompt Engineering
Forge
: Scalable Agent
RL
Framework and Algorithm
minimax.io
·
21h
·
Discuss:
Hacker News
💬
Prompt Engineering
Read, Learn,
Improve
sagetheanalyst.com
·
2h
🌿
Digital Gardens
AI captures
particle
accelerator
behavior to optimize machine performance
phys.org
·
15h
💬
Prompt Engineering
A
Conceptual
Framework for Exploration
Hacking
lesswrong.com
·
1d
💬
Prompt Engineering
Why Modern
Analytics
Tools Create More Data but Less
Clarity
gobbledata.com
·
2h
·
Discuss:
DEV
📊
Data Science
We Are the
Average
of Our Models
mercurialsolo.github.io
·
10h
🗣️
LLMs
At-home movement state classification using totally
implantable
cortical-basal
ganglia
neural interface
science.org
·
16h
🧠
Cognitive Science
BetaZero
V2: A Diffusion Model for Setting
Boulder
Problems
evmojo37.substack.com
·
1d
·
Discuss:
Substack
🗣️
LLMs
Show HN:
Darius
– An AI router that
selects
the best model for each prompt
withdarius.com
·
8h
·
Discuss:
Hacker News
💬
Prompt Engineering
Deciphering
hippocampal
place codes in weak
theta
rhythms
nature.com
·
12h
🧠
Cognitive Science
Feedback
Control for Computer Systems
janert.org
·
1d
🐚
Shell Scripting
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help