Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
123858
posts in
384.6
ms
Kiro
: DeepSeek,
MiniMax
, and Qwen now available as open weight model options
kiro.dev
·
41m
·
Discuss:
Hacker News
🧭
Vector Databases
Robots
That Can See Around
Corners
Using Radio Signals and AI
seas.upenn.edu
·
2h
·
Discuss:
Hacker News
🤖
AI
Grounding
LTL
Tasks in Sub-Symbolic RL Environments for Zero-Shot Generalization
arxiv.org
·
11h
🔀
Transformers
Technology is a tool, not a
replacement
for experience
healio.com
·
20h
🔧
Feature Engineering
The
Behavioral
Shift Matrix: 4 Forces Reshaping Customer
Retention
cmswire.com
·
1d
🔧
Feature Engineering
A data-efficient foundation model for
porous
materials based on expert-guided
supervised
learning
nature.com
·
3h
🧭
Vector Databases
The
Feynman
Technique 2026: A Cognitive Algorithm to Kill the 'Illusion of
Competence
'
dev.to
·
1h
·
Discuss:
DEV
🤖
AI
New Research: How AI
Transforms
$400 Billion Of Corporate Learning JOSH
BERSIN
joshbersin.com
·
5h
🔀
Transformers
Enterprise AI Agent
Stack
: Agentic AI Architecture Where Context
Beats
Models
philippdubach.com
·
1d
·
Discuss:
Hacker News
🌐
Distributed Systems
Building Production-Ready AI
Chatbots
: Lessons from 6 Months of
Failure
lojiq.ai
·
22h
·
Discuss:
DEV
🔀
Transformers
What Are LLM Parameters? A Simple Explanation of
Weights
,
Biases
, and Scale
pub.towardsai.net
·
12h
🤖
ML
Why
reinforcement
learning breaks at scale, and how a new method
fixes
it
techxplore.com
·
6d
🌐
Distributed Systems
Agentic
Interactions
linkedin.com
·
3h
🔀
Transformers
Hybrid Model‑Based / Model‑Free Reinforcement Learning for Energy‑Efficient Autonomous Warehouse Robot Navigation with Real‑Time
Obstacle
Prediction **
Abstra
...
freederia.com
·
5d
🤖
AI
Training a
drifting
model
breno.bearblog.dev
·
1d
🤖
AI
Preference
Conditioned
Multi-Objective Reinforcement Learning:
Decomposed
, Diversity-Driven Policy Optimization
arxiv.org
·
1d
🤖
AI
The AI Bond Tsunami:
Hyperscalers
Rewrite The Credit Playbook (
NDX
)
seekingalpha.com
·
4h
🤖
AI
From Automation To
Autonomy
: AI For The
CFO
And Supply Chain Finance
forbes.com
·
23h
🤖
AI
Frequency-domain approach to automated and efficient
multivariate
kernel density estimation for
probabilistic
modeling
sciencedirect.com
·
1d
🔧
Feature Engineering
Introspective
RSI vs
Extrospective
RSI
lesswrong.com
·
4h
🔧
Feature Engineering
Loading...
Loading more...
« Page 4
•
Page 6 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help