SANA-WM in 5 quick facts (opens in new tab)
SANA-WM is worth watching for one reason: it combines longer video generation with explicit camera control. Five quick facts: It is an open-source 2.6B-parameter world model from NVIDIA Research. It targets minute-scale 720p video generation. It uses precise 6-DoF camera trajectories instead of only unconstrained motion. The paper reports single-GPU generation, with a distilled variant denoising a 60-second clip in about 34 seconds on one RTX 5090. Its benchmark reports roughly 36x higher thr...
Read the original article