WorldOlympiad: Can Your World Model Survive a Triathlon? (opens in new tab) 🍌Endurance Nutrition Content type: Academic
We introduce WorldOlympiad, a benchmark for diagnosing video-based world models across physical faithfulness, geometric consistency, and interaction fidelity. While existing benchmarks often focus on visual quality, semantic alignment, or short-term temporal coherence, they provide limited insight into whether generated videos obey physical rules, preserve coherent 3D structure, and sustain controllable interactions over long horizons. To addres...
Read the original article