graphical interfaces, gesture interfaces, multimodal interactions, voice interactions
CAPE: A CLIP-Aware Pointing Ensemble of Complementary Heatmap Cues for Embodied Reference Understanding
arxiv.org·3d
Loading...Loading more...
graphical interfaces, gesture interfaces, multimodal interactions, voice interactions