Detecting Non-Optimal Decisions of Embodied Agents via Diversity-Guided Metamorphic Testing
arxiv.org·3d
🔲Cellular Automata
Preview
Report Post

View PDF HTML (experimental)

Abstract:As embodied agents advance toward real-world deployment, ensuring optimal decisions becomes critical for resource-constrained applications. Current evaluation methods focus primarily on functional correctness, overlooking the non-functional optimality of generated plans. This gap can lead to significant performance degradation and resource waste. We identify and formalize the problem of Non-optimal Decisions (NoDs), where agents complete tasks successfully but inefficiently. We present NoD-DGMT, a systematic framework for detecting NoDs in embodied agent task planning via diversity-guided metamorphic testing. Our key insight is that optimal planners should exhibit in…

Similar Posts

Loading similar posts...