ProductConsistency: Improving Product Identity Preservation in Instruction-Based Image Editing via SFT and RL (opens in new tab)
Recent advances in instruction-based image editing have enabled models to perform complex visual edits from natural language instructions. However, in product-centric scenarios where preserving product features, branding, and textual elements are critical, current open and closed source models often struggle to maintain this fine-grained object identity. This issue is further compounded by the lack of datasets for instruction-based product image...
Read the original article