World Models

Feeds to Scour
SubscribedAll
Scoured 356 posts in 6.3 ms

Comp.compilers: Paper: MileStone: A Multi-Objective Compiler Phase Ordering Framework for Graph-based IR-Level Optimization

 🎯Post-training
compilers.iecc.com·

Major Types of Machine Learning

 🏋️Pretraining  Content type: Blog
medium.com·

Reinforcement Learning Disrupts Gradient-Based Adversarial Optimization

 🎮RL  Content type: Academic
arxiv.org·

What is MBPO? A Beginner’s Guide to Efficient Reinforcement Learning

 🎮RL  Content type: Blog

Social intelligence Arises Between Minds

 🤖AI Agents
psychologytoday.com·

Multi-agent rendezvous in fluid flows via reinforcement learning

 🎮RL  Content type: Academic
arxiv.org·

Model predictive task sampling for efficient and robust adaptation

 🎮RL  Content type: Academic
nature.com·

UniIntervene: Agentic Intervention for Efficient Real-World Reinforcement Learning

 🎮RL  Content type: Academic
arxiv.org·

Deep Reinforcement Learning for Adaptive Power Allocation in ISAC Systems with Mobile Target

 🎮RL  Content type: Academic
arxiv.org·

Reinforcement learning in linear embedding space unlocks generalizable control across soft robot configurations

 🎮RL  Content type: Academic
nature.com·

ReflectiChain: Epistemic Grounding in LLM-Driven World Models for Supply Chain Resilience

 💬LLMs  Content type: Academic
arxiv.org·

Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary Deep Reinforcement Learning

 🎮RL  Content type: Academic
arxiv.org·

Generalization Hacking: Models Can Game Reinforcement Learning by Preventing Behavioral Generalization

 🎮RL  Content type: Academic
arxiv.org·

Geometry-Aware Reinforcement Learning for 2D Irregular Nesting

 🎮RL  Content type: Academic
arxiv.org·

SVoT: State-aware Visualization-of-Thought for Spatial Reasoning via Reinforcement Learning

 🎮RL  Content type: Academic
arxiv.org·

Deep reinforcement learning for process design: Review and perspective

 🎮RL  Content type: Academic
arxiv.org·

KinematicRL: A Sim-to-Real Reinforcement Learning Framework For Social Navigation With Kinodynamic Feasibility

 🎮RL  Content type: Academic
arxiv.org·

Towards End to End Motion Planning and Execution for Autonomous Underwater Vehicles Using Reinforcement Learning

 🎮RL  Content type: Academic
arxiv.org·

RLCSD: Reinforcement Learning with Contrastive On-Policy Self-Distillation

 🎮RL  Content type: Academic
arxiv.org·

UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning

 🎮RL  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help