LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models
dev.to¡1d¡
Discuss: DEV
Flag this post

When Robots Stumble: The Hidden Flaws Behind Their Perfect Scores

Ever wondered why a robot that aces a test can still mess up in your living room? Researchers discovered that the latest vision‑language‑action (VLA) models, which seem to master object‑handling tasks, are actually walking on a tightrope. By shaking things up—changing camera angles, moving the robot’s starting pose, or dimming the lights—they found performance plummeting from near‑perfect to barely a third of the original score. Imagine a chef who can bake a flawless cake only when the kitchen lights are exactly right; the moment the bulb flickers, the recipe collapses. Surprisingly, the robots barely cared about the spoken instructions, acting almost as if they were deaf. This tells us that high benchmark num…

Similar Posts

Loading similar posts...