World Models

Feeds to Scour
SubscribedAll
Scoured 380 posts in 6.2 ms

Reinforcement Learning Disrupts Gradient-Based Adversarial Optimization

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

PLUME: Probabilistic Latent Unified World Modeling and Parameter Estimation for Multi-Finger Manipulation

 🦿Robot Learning  Content type: Academic
arxiv.org·

HARBOR: A Harness Framework for Agentic Robot Reinforcement Learning

 🦿Robot Learning  Content type: Academic
arxiv.org·

Learning Object Manipulation from Scratch via Contrastive Interaction

 🦾Robotics  Content type: Academic
arxiv.org·

Blind Dexterous Grasping via Real2Sim2Real Tactile Policy Learning

 🦿Robot Learning  Content type: Academic
arxiv.org·

PLAN-S: Bridging Planning with Latent Style Dynamics for Autonomous Driving World Models

 👁️VLA Models  Content type: Academic
arxiv.org·

Multi-agent rendezvous in fluid flows via reinforcement learning

 ♟️Game Theory  Content type: Academic
arxiv.org·

Geometry-Aware Reinforcement Learning for 2D Irregular Nesting

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

Deep Reinforcement Learning for Adaptive Power Allocation in ISAC Systems with Mobile Target

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

The Sim-to-Real Gap of Foundation Model Agents: A Unified MDP Perspective

 🦿Robot Learning  Content type: Academic
arxiv.org·

UniIntervene: Agentic Intervention for Efficient Real-World Reinforcement Learning

 🦾Robotics  Content type: Academic
arxiv.org·

Deep reinforcement learning for process design: Review and perspective

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

Generalization Hacking: Models Can Game Reinforcement Learning by Preventing Behavioral Generalization

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

One Lens, Many Worlds : A Capability-Typed Interface for World-Model Interpretability

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary Deep Reinforcement Learning

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

Discrete-WAM: Unified Discrete Vision-Action Token Editing for World-Policy Learning

 👁️VLA Models  Content type: Academic
arxiv.org·

SVoT: State-aware Visualization-of-Thought for Spatial Reasoning via Reinforcement Learning

 🎯Reinforcement Learning  Content type: Academic
arxiv.org·

ReflectiChain: Epistemic Grounding in LLM-Driven World Models for Supply Chain Resilience

 📄AI Research  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help