Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
121442
posts in
1.68
s
learning by
reverse
engineering
clymup.com
·
4d
💬
Prompt Engineering
The AI Training
Asymmetry
tostracker.app
·
4d
·
Discuss:
Hacker News
🤖
AI
Tips
lonestation.itch.io
·
4d
🌊
Stress Management
Exploiting
large language model with reinforcement learning for generative job
recommendations
eurekalert.org
·
6d
🗣️
LLMs
userface.ai
userface.ai
·
3d
🤖
AI
Human-like Search for Modern
Applications
anvitra.ai
·
4d
·
Discuss:
Hacker News
💬
Prompt Engineering
AI and the future of work:
Measuring
AI-driven productivity gains for
workplace
tasks
aisi.gov.uk
·
3d
💬
Prompt Engineering
Adversarial
Reasoning:
Multiagent
World Models for closing the Simulation Gap
latent.space
·
4d
·
Discuss:
Hacker News
,
Hacker News
💬
Prompt Engineering
AI
Workflows
chatprd.ai
·
3d
💬
Prompt Engineering
AI-powered
Customer
Research
strella.io
·
3d
💬
Prompt Engineering
Sharpness-Aware
Minimization
with Adaptive Regularization for Training Deep Neural Networks
sonomarpa.sonoma.lib.ca.us
·
5d
💬
Prompt Engineering
A
GTM
guide to AI models
revengine.substack.com
·
4d
·
Discuss:
Substack
💬
Prompt Engineering
Home
physicsgraph.com
·
4d
🌿
Digital Gardens
30 Agentic AI Interview Questions and
Answers
: From
Beginner
to Advanced
analyticsvidhya.com
·
4d
💬
Prompt Engineering
Adaptive Intelligence 2026: The Rise of
Continual
Learning & The End of
Frozen
AI Models?
mail.bycloud.ai
·
5d
💬
Prompt Engineering
Jokes
on You AI: Turning the
Tables
dev-log.me
·
3d
·
Discuss:
Hacker News
💬
Prompt Engineering
Proposal: A Framework for
Discovering
Alien Physics via Optimal
Compression
lesswrong.com
·
5d
💬
Prompt Engineering
**Abstract:** This paper introduces a novel approach to automated credit risk assessment and early warning systems leveraging a
hierarchical
Bayesian
network...
freederia.com
·
5d
🧠
Machine Learning
Projected
Gradient
Ascent
for Efficient Reward-Guided Updates with One-Step Generative Models
arxiv.org
·
2d
💬
Prompt Engineering
Stochastic Gradient Descent
Optimizes
Over-parameterized Deep
ReLU
Networks
dev.to
·
4d
·
Discuss:
DEV
🗣️
LLMs
Loading...
Loading more...
« Page 14
•
Page 16 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help