HumanScale: Egocentric Human Video Can Outperform Real-Robot Data for Embodied Pretraining (opens in new tab)

Covered by topicqueue.substack.com

Embodied foundation models are expected to benefit from data scaling like large language models, but face a much tighter data bottleneck. Teleoperated real-robot trajectories remain the dominant pretraining source due to their precise action supervision and embodiment alignment, yet their scalability is limited by high collection cost, acquisition difficulty, and low behavioral and environmental diversity. These limitations have sparked interest...

Read the original article

Sign in to keep reading the full article.

Sign Up Log In

Covered in 1 article

topicqueue.substack.com·

Hours of Humanoid Teleop, Recorded in Real Homes

Discussed on Substack