SuperFlow: Training Flow Matching Models with RL on the Fly
arxiv.org·4d
🎨Graphics Programming
Preview
Report Post

Title:SuperFlow: Training Flow Matching Models with RL on the Fly

View PDF HTML (experimental)

Abstract:Recent progress in flow-based generative models and reinforcement learning (RL) has improved text-image alignment and visual quality. However, current RL training for flow models still has two main problems: (i) GRPO-style fixed per-prompt group sizes ignore variation in sampling importance across prompts, which leads to inefficient sampling and slower training; and (ii) trajectory-level advantages are reused as per-step estimates, which biases credit assignment along the flow. We propose SuperFlow, an RL training framework for flow-based models that adjusts group sizes with variance-aware samplin…

Similar Posts

Loading similar posts...