IPAB Workshop - 6/11/25
informatics.ed.ac.uk·4d
Flag this post

**Speaker: **Wanming Yu

Title: Preference-Based Long-Horizon Robotic Stacking with Multimodal Large Language Models

Abstract: Pretrained large language models (LLMs) can work as high-level robotic planners by reasoning over abstract task descriptions and natural language instructions, etc. However, they have shown a lack of knowledge and effectiveness in planning long-horizon robotic manipulation tasks where the physical properties of the objects are essential. An example is the stacking of containers with hidden objects inside, which involves reasoning over hidden physics properties such as weight and stability. To this end, this paper proposes to use multimodal LLMs as high-level planners for such long-horizon robotic stacking tasks. The LLM takes multimodal inputs for ea…

Similar Posts

Loading similar posts...