Physics Question Scene Graph: Fine-grained Evaluation of Physical Plausibility in Text-to-Video Generation (opens in new tab)
Video generation models are increasingly capable of producing realistic videos, but they still struggle to generate videos that follow basic physical laws. Compounding this is a lack of reliable granular evaluation methods for localizing and specifying physical law violations in videos. We address this by introducing Physics Question Scene Graph (PQSG), a hierarchical question-based evaluation pipeline. PQSG evaluates generated videos by checkin...
Read the original article