Paper page - RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions

RefEdit, an instruction-based editing model trained on synthetic data, outperforms baselines in complex scene editing and referring expression tasks.

Despite recent advances in inversion and instruction-based image editing,
existing approaches primarily excel at editing single, prominent objects but
significantly struggle when applied to complex scenes containing multiple
entities. To quantify this gap, we first introduce RefEdit-Bench, a rigorous
real-world benchmark rooted in RefCOCO, where even baselines trained on
millions of samples perform poorly. To overcome this limitation, we introduce
RefEdit — an instruction-based editing model trained on our scalable synthetic
data generation pipeline. Our RefEdit, trained on only 20,000 editing triplets,
outperforms the Flux/SD3 model-based baselines trained on millions of data.
Extensive evaluations across various benchmarks demonstrate that our model not
only excels in referring expression tasks but also enhances performance on
traditional benchmarks, achieving state-of-the-art results comparable to
closed-source methods. We release data \& checkpoint for reproducibility.

Source link

What's Hot

Story, Stability AI collaborate to help creators make money from their work in the AI ecosystem

Basecamp Research leverages Microsoft and Nvidia AI to…

Meta Just Escalated the AI Talent War With OpenAI

Paper page – RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions

Paper page – LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper page – TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation

Paper page – Captain Cinema: Towards Short Movie Generation

David Geffen Sued By Estranged Husband for Breach of Contract

Auction House Will Sell Egyptian Artifact Despite Concern From Experts

Anish Kapoor Lists New York Apartment for $17.75 M.

Street Fighter 6 Community Rocked by AI Art Controversy

Story, Stability AI collaborate to help creators make money from their work in the AI ecosystem

Basecamp Research leverages Microsoft and Nvidia AI to…

Meta Just Escalated the AI Talent War With OpenAI

What's Hot

Paper page – RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions

Related Posts

Subscribe to Updates