Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
124016
posts in
2.05
s
Building a “Second Brain” – A
Functional
Knowledge Stack with
Obsidian
blopig.com
·
1d
🌐
Distributed Systems
Opus 4.6 Reasoning
Distill
3k
prompts
huggingface.co
·
1d
·
Discuss:
r/LocalLLaMA
⚡
Query Optimization
Show HN: Find automation ideas and
creators
by
sharing
your business problem
humation.ai
·
1d
·
Discuss:
Hacker News
🤖
AI
Why
securing
AI model
weights
isn’t enough
the-substrate.net
·
1d
·
Discuss:
Hacker News
🤖
AI
A
GTM
guide to AI models
revengine.substack.com
·
4d
·
Discuss:
Substack
🔀
Transformers
How We Give AI Agents Long-Term Memory Without
Blowing
the Budget
metaduck.com
·
2d
·
Discuss:
DEV
,
Hacker News
🏗️
Data Engineering
When AI goes
haywire
: The case of the skyscraper and the slide
trombone
techxplore.com
·
2d
🤖
AI
[Productivity Game] SUMMARY: The
Almanack
of Naval
Ravikant
kill-the-newsletter.com
·
1d
🤖
AI
On
Economics
of A(S)I Agents
lesswrong.com
·
4d
🤖
AI
Show HN: I built a library of Claude
skills
for growth
marketers
github.com
·
1d
·
Discuss:
Hacker News
🔧
Feature Engineering
It Is
Reasonable
To Research How To Use Model
Internals
In Training
lesswrong.com
·
3d
🔀
Transformers
## Deep Reinforcement Learning for
Intuitive
Human-Robot Collaboration: Shared Cognitive Mapping via Dynamic Bayesian Fusion of
Affordance
Prediction and Goal Inference
freederia.com
·
5d
🔀
Transformers
Cursor Rules: Pay More
Upfront
,
Iterate
Less Later
dev.to
·
1d
·
Discuss:
DEV
⚡
Query Optimization
Projected
Gradient
Ascent
for Efficient Reward-Guided Updates with One-Step Generative Models
arxiv.org
·
1d
🔀
Transformers
PRoFL-IoV
: A privacy-preserving and robust federated learning framework for short-term load forecasting in the internet of vehicles
sciencedirect.com
·
23h
📈
Time Series
**Abstract:** This paper introduces a novel approach to automated credit risk assessment and early warning systems leveraging a
hierarchical
Bayesian
network...
freederia.com
·
4d
🤖
AI
rawwerks/rlm-cli
: CLI for Recursive Language Models
github.com
·
1d
🔍
Query Languages & APIs
Why No Single AI Should Ever
Decide
Alone
dev.to
·
2d
·
Discuss:
DEV
🌐
Distributed Systems
Energy-efficient robust control of vehicle
platoons
under cut-in
disturbances
: Integrating temporal-aware policy and barrier-constrained search
sciencedirect.com
·
6h
🌐
Distributed Systems
Homing
through
Reinforcement
Learning
arxiv.org
·
1d
🤖
AI
Loading...
Loading more...
« Page 17
•
Page 19 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help