PickStyle: Video-to-Video Style Transfer With Context-Style Adapters - Takara TLDR

We address the task of video style transfer with diffusion models, where the
goal is to preserve the context of an input video while rendering it in a
target style specified by a text prompt. A major challenge is the lack of
paired video data for supervision. We propose PickStyle, a video-to-video style
transfer framework that augments pretrained video diffusion backbones with
style adapters and benefits from paired still image data with source-style
correspondences for training. PickStyle inserts low-rank adapters into the
self-attention layers of conditioning modules, enabling efficient
specialization for motion-style transfer while maintaining strong alignment
between video content and style. To bridge the gap between static image
supervision and dynamic video, we construct synthetic training clips from
paired images by applying shared augmentations that simulate camera motion,
ensuring temporal priors are preserved. In addition, we introduce Context-Style
Classifier-Free Guidance (CS-CFG), a novel factorization of classifier-free
guidance into independent text (style) and video (context) directions. CS-CFG
ensures that context is preserved in generated video while the style is
effectively transferred. Experiments across benchmarks show that our approach
achieves temporally coherent, style-faithful, and content-preserving video
translations, outperforming existing baselines both qualitatively and
quantitatively.

Source link

What's Hot

Tencent unveils new AI model ‘Hunyuan T1’ that rivals DeepSeek R1 in performance and price

Perplexity Comet Vs Google Chrome — Should You Switch To An AI Browser?

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs – Takara TLDR

PickStyle: Video-to-Video Style Transfer with Context-Style Adapters – Takara TLDR

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs – Takara TLDR

OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment – Takara TLDR

GCPO: When Contrast Fails, Go Gold – Takara TLDR

Smithsonian Closes Museums Amid Government Shutdown

The Rubin Names 2025 Art Prize, Research and Art Projects Grants

Kochi-Muziris Biennial Announces 66 Artists for December Exhibition

Instagram Launches ‘Rings’ Awards for Creators—With KAWS as a Judge

Tencent unveils new AI model ‘Hunyuan T1’ that rivals DeepSeek R1 in performance and price

Perplexity Comet Vs Google Chrome — Should You Switch To An AI Browser?

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs – Takara TLDR

What's Hot

PickStyle: Video-to-Video Style Transfer with Context-Style Adapters – Takara TLDR

Related Posts

Subscribe to Updates