Garbage In, Garbage Out: The Case for Better Robot Data Understanding
huggingface.co·5h·
Discuss: Hacker News
Flag this post

Low quality robot data → Poor robot performance.

Robot data collection is expensive, requiring hundreds of human expert teleoperation hours.

At the same time, collecting high quality robot data is difficult - even for a highly skilled teleoperator. For example, idle trajectories may occur when the teleoperator pauses or poor lighting might affect visual clarity.

While the precise definition of what constitutes a high quality training example is a complicated question (e.g. does a dark video increase policy resilience or reduce performance?), a few quality indicators can provide an insightful snapshot into your dataset. Data understanding is the first step to data improvement.

In this article, we introduce a lightweight Open Source toolkit to find low quality exampl…

Similar Posts

Loading similar posts...