ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond SemanticDependency Constraints
paperium.net·11h·
Discuss: DEV
Flag this post

Advancing Imaginative Video Generation with ImagerySearch

This article addresses a significant challenge in current video generation models: their notable performance degradation when handling imaginative scenarios involving rarely co-occurring concepts and long-distance semantic relationships. To overcome this, the authors introduce ImagerySearch, a novel prompt-guided adaptive test-time search strategy. This innovative approach dynamically adjusts both the inference search space and the reward function based on the semantic relationships within the prompt, enabling the creation of more coherent and visually plausible videos in complex settings. The research also presents LDT-Bench, the first dedicated benchmark for evaluating models on long-distance semantic prompts, alongside…

Similar Posts

Loading similar posts...