Generating Dialogues from Egocentric Instructional Videos for Task Assistance: Dataset, Method and Benchmark
arxiv.org·3d
GhostObjects: Instructing Robots by Manipulating Spatially Aligned Virtual Twins in Augmented Reality
arxiv.org·3d
STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes
arxiv.org·6d
Data-Driven Deepfake Image Detection Method -- The 2024 Global Deepfake Image Detection Challenge
arxiv.org·3d
GANDiff FR: Hybrid GAN Diffusion Synthesis for Causal Bias Attribution in Face Recognition
arxiv.org·3d
Deep Learning Enables Large-Scale Shape and Appearance Modeling in Total-Body DXA Imaging
arxiv.org·6d
Loading...Loading more...