Durian: Dual Reference-guided Portrait Animation With Attribute Transfer - Takara TLDR

We present Durian, the first method for generating portrait animation videos
with facial attribute transfer from a given reference image to a target
portrait in a zero-shot manner. To enable high-fidelity and spatially
consistent attribute transfer across frames, we introduce dual reference
networks that inject spatial features from both the portrait and attribute
images into the denoising process of a diffusion model. We train the model
using a self-reconstruction formulation, where two frames are sampled from the
same portrait video: one is treated as the attribute reference and the other as
the target portrait, and the remaining frames are reconstructed conditioned on
these inputs and their corresponding masks. To support the transfer of
attributes with varying spatial extent, we propose a mask expansion strategy
using keypoint-conditioned image generation for training. In addition, we
further augment the attribute and portrait images with spatial and
appearance-level transformations to improve robustness to positional
misalignment between them. These strategies allow the model to effectively
generalize across diverse attributes and in-the-wild reference combinations,
despite being trained without explicit triplet supervision. Durian achieves
state-of-the-art performance on portrait animation with attribute transfer, and
notably, its dual reference design enables multi-attribute composition in a
single generation pass without additional training.

Source link

What's Hot

Drawing2CAD: Sequence-to-Sequence Learning for CAD Generation from Vector Drawings – Takara TLDR

China advances in AI agentic tools as Tencent, ByteDance weigh in

OpenAI Announces Hiring Platform, Will Use AI to Match Companies With Talent

Durian: Dual Reference-guided Portrait Animation with Attribute Transfer – Takara TLDR

Drawing2CAD: Sequence-to-Sequence Learning for CAD Generation from Vector Drawings – Takara TLDR

False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize – Takara TLDR

Robix: A Unified Model for Robot Interaction, Reasoning and Planning – Takara TLDR

Fan Conventions Are Drawing The Line On AI ‘Slop’

Sculptor Who Defined Minimalism Dies at 88

Amy Sherald’s Canceled Smithsonian Show Goes to Baltimore

Rabkin Foundation Names 2025 Arts Journalism Grant Winners

Drawing2CAD: Sequence-to-Sequence Learning for CAD Generation from Vector Drawings – Takara TLDR

China advances in AI agentic tools as Tencent, ByteDance weigh in

OpenAI Announces Hiring Platform, Will Use AI to Match Companies With Talent

What's Hot

Durian: Dual Reference-guided Portrait Animation with Attribute Transfer – Takara TLDR

Related Posts

Subscribe to Updates