Microsoft Research's Mirage gives video generation a persistent spatial memory that doesn't forget what's around the corner (opens in new tab)

Covers Latent Spatial Memory for Video World Models

Mirage, a video world model from Microsoft Research and several universities, stores scene information directly in latent space instead of pixel-based point clouds. That slashes compute time and graphics memory while keeping scenes spatially consistent through long camera moves. It still can't reliably track moving objects across segments. The article appeared first on .

Read the original article