Feeds to Scour
SubscribedAll
Scoured 82005 posts in 205.6 ms
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System
arxiv.org·1d
💬LLM
Preview
Report Post
**Abstract:** This paper presents a novel approach to dynamic task allocation in multi-robot systems leveraging multi-objective reinforcement learning (MORL)...
freederia.com·20h
🤖AI
Preview
Report Post
Distributed Reinforcement Learning for Scalable High-Performance Policy Optimization
towardsdatascience.com·2d
🔥PyTorch
Preview
Report Post
Adaptive Rollout Allocation for Online Reinforcement Learning with Verifiable Rewards
arxiv.org·1d
💬LLM
Preview
Report Post
Agentic AI - Building Intelligent Agents
dev.to·11h·
Discuss: DEV
🤖AI
Preview
Report Post
Massively Parallel Methods for Deep Reinforcement Learning
dev.to·2d·
Discuss: DEV
🔥PyTorch
Preview
Report Post
Self-Optimizing Football Chatbot Guided by Domain Experts on Databricks
databricks.com·11h
💬LLM
Preview
Report Post
**Abstract:** This paper proposes a novel approach for optimizing compiler performance in resource-constrained embedded systems using Reinforcement Learning ...
freederia.com·1d
🤖Machine Learning
Preview
Report Post
Thoughts on Toby Ords' AI Scaling Series
lesswrong.com·4h
🤖Machine Learning
Preview
Report Post
Swe-Replay Achieves 17.4% Performance Gain With Efficient Test-Time Scaling For Agents
quantumzeitgeist.com·17h
💬LLM
Preview
Report Post
SDPO: Reinforcement Learning via Self-Distillation
self-distillation.github.io·2d·
Discuss: r/LocalLLaMA
🤖Machine Learning
Preview
Report Post
Axiomeer – An open marketplace for AI agents
news.ycombinator.com·1d·
Discuss: Hacker News
🤖AI
Preview
Report Post
Specification-Guided Reinforcement Learning
cacm.acm.org·6d
🤖AI
Preview
Report Post
Context Engineering & Agent Memory Platform for AI Agents
getzep.com·5h
🤖AI
Preview
Report Post
The Dual Pillars of Embodied Autonomy: A Technical Deep Dive into Language-Action Models and…
pub.towardsai.net·1h
💬LLM
Preview
Report Post
The 3Cs: A Framework for AI Agent Security
docker.com·3h
🤖AI
Preview
Report Post
The Gumbel-Max Trick
blog.quipu-strands.com·11h·
Discuss: Hacker News
🤖AI
Preview
Report Post
Selection Rather Than Prediction
voratiq.com·1d·
Discuss: Hacker News
🤖AI
Preview
Report Post
Routing in a Sparse Graph: a Distributed Q-Learning Approach
towardsdatascience.com·12h
🤖AI
Preview
Report Post
Stop building systems for agents, build systems for human
blog.xiangpeng.systems·1d·
Discuss: Hacker News
💬LLM
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help