Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎮 Reinforcement Learning
Q-Learning, Policy Gradients, Environments, Rewards
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
123642
posts in
933.9
ms
ADORA
: Training Reasoning Models with Dynamic
Advantage
Estimation on Reinforcement Learning
arxiv.org
·
9h
🤖
AI
Squeezing
More from the Stream : Learning
Representation
Online for Streaming Reinforcement Learning
arxiv.org
·
9h
🧭
Vector Databases
Technology is a tool, not a
replacement
for experience
healio.com
·
18h
🔧
Feature Engineering
Article
: From Prompts to Production: A
Playbook
for Agentic Development
infoq.com
·
5h
🌐
Distributed Systems
Advancing
AI
benchmarking
with Game Arena
dev.to
·
19h
·
Discuss:
DEV
🤖
AI
The Potential of
RLMs
dbreunig.com
·
1d
·
Discuss:
Hacker News
🌐
Distributed Systems
AI to
ROI
: Case Study
ai2roi.substack.com
·
1d
·
Discuss:
Substack
🔧
Feature Engineering
StellarSk8board/bardacle
: A metacognitive layer for AI agents - short-term memory that survives context loss
github.com
·
23h
·
Discuss:
Hacker News
🤖
AI
(8) AI Meets Brain: Memory Systems from
Cognitive
Neuroscience
to Autonomous Agents
arxiviq.substack.com
·
2d
·
Discuss:
Substack
🤖
AI
Risk-preference-aware
optimal scheduling and profit allocation of load
aggregators
and charging operators
sciencedirect.com
·
17h
⚡
Query Optimization
Handing
Power to Machines: The
Unresolved
Dilemma of AI Agents
smarterarticles.co.uk
·
12h
🤖
AI
AI tools that are actually
useful
fastcompany.com
·
1h
🤖
AI
Large Language Models for
Mortals
book
andrewpwheeler.com
·
4h
🔀
Transformers
Multi-Dimensional
Computational
Library for Physics-Aware AI
splitfxm.com
·
1d
·
Discuss:
Hacker News
🧭
Vector Databases
20
Agent-focused
Experiments
fitziswriting.substack.com
·
1d
·
Discuss:
Substack
🤖
AI
The
Behavioral
Shift Matrix: 4 Forces Reshaping Customer
Retention
cmswire.com
·
1d
🔧
Feature Engineering
New Research: How AI
Transforms
$400 Billion Of Corporate Learning JOSH
BERSIN
joshbersin.com
·
3h
🔀
Transformers
Enterprise AI Agent
Stack
: Agentic AI Architecture Where Context
Beats
Models
philippdubach.com
·
1d
🌐
Distributed Systems
Building Production-Ready AI
Chatbots
: Lessons from 6 Months of
Failure
lojiq.ai
·
20h
·
Discuss:
DEV
🔀
Transformers
Why
reinforcement
learning breaks at scale, and how a new method
fixes
it
techxplore.com
·
6d
🌐
Distributed Systems
Loading...
Loading more...
« Page 3
•
Page 5 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help