Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎯 reinforcement learning
artificial intelligence,deep learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
185434
posts in
25.9
ms
Policy
Improvement
Reinforcement
Learning
🏋️
Isaac Gym
arxiv.org
·
2d
How does
Reinforcement
Learning
Affect
Models
🤖
llm
lesswrong.com
·
4d
Reinforced
Agent: Inference-Time Feedback for
Tool-Calling
Agents
🤖
llm
machinelearning.apple.com
·
19h
[R]
Dense
process rewards from LLM feedback for multi-agent credit
assignment
🤖
llm
reddit.com
·
58m
·
r/reinforcementlearning
There Will Be a
Scientific
Theory of Deep Learning
📱
Edge AI
mail.bycloud.ai
·
2d
Why agentic AI
governance
is
falling
short – and what we can do about it
📱
Edge AI
siliconangle.com
·
15m
Is your AI strategy missing a "Safety Net"?🛡️
📱
Edge AI
turingpost.com
·
22h
DEEP
Robotics
⛏️
Autonomous Mining
youtube.com
·
4d
·
r/singularity
The Data
Layer
Tax for Robot Learning
📱
Edge AI
rerun.io
·
1d
·
Hacker News
Extrapolating
optimal
selective
maintenance strategy in new environments: A meta-reinforcement learning approach
🔮
Predictive Maintenance
sciencedirect.com
·
5h
A
game-theoretic
framework for multimodal information
utilization
under heterogeneous processing environments in neuroscience and perception science
👁️🗨️
Multimodal Sensing
frontiersin.org
·
14h
Alibaba's
Metis
agent cuts
redundant
AI tool calls from 98% to 2% — and gets more accurate doing it
📱
Edge AI
venturebeat.com
·
22h
Deep Learning Weekly: Issue 453
📱
Edge AI
deeplearningweekly.com
·
1d
Artificial Intelligence:
Foundations
of
Computational
Agents
🤖
llm
artint.info
·
4d
·
Hacker News
Reinforcement
fine-tuning
with LLM-as-a-judge
🤖
llm
aws.amazon.com
·
23h
Synthesized
Command & Control: A new way human choices can guide AI
warfighting
🎛️
Control theory
breakingdefense.com
·
4h
The Next 5 Years of AI: Tools, Agents, and
Automation
📱
Edge AI
medium.com
·
2d
Every Model Learned by Gradient
Descent
Is
Approximately
a Kernel Machine
📱
Edge AI
news.ycombinator.com
·
19h
·
Hacker News
https://
research.perplexity.ai/articles/designing-refining-and-maintaining-agent-skills-at-perplexity
🤖
llm
research.perplexity.ai
·
1h
Complementary
Intelligence
📱
Edge AI
togelius.blogspot.com
·
6d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help