Paper Page - Frame Guidance: Training-Free Guidance For Frame-Level Control In Video Diffusion Models

Frame Guidance offers a training-free method for controlling video generation using frame-level signals, reducing memory usage and enhancing globally coherent video output.

Advancements in diffusion models have significantly improved video quality,
directing attention to fine-grained controllability. However, many existing
methods depend on fine-tuning large-scale video models for specific tasks,
which becomes increasingly impractical as model sizes continue to grow. In this
work, we present Frame Guidance, a training-free guidance for controllable
video generation based on frame-level signals, such as keyframes, style
reference images, sketches, or depth maps. For practical training-free
guidance, we propose a simple latent processing method that dramatically
reduces memory usage, and apply a novel latent optimization strategy designed
for globally coherent video generation. Frame Guidance enables effective
control across diverse tasks, including keyframe guidance, stylization, and
looping, without any training, compatible with any video models. Experimental
results show that Frame Guidance can produce high-quality controlled videos for
a wide range of tasks and input signals.

Source link

What's Hot

Creating uniquely human digital banking experiences at TD

C3 AI Stock Plunges After ‘Completely Unacceptable’ Q1 Sales – C3.ai (NYSE:AI)

Meta says its Llama AI models being used by banks, tech companies

Paper page – Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models

Paper page – Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability

Paper page – Evaluating, Synthesizing, and Enhancing for Customer Support Conversation

Paper page – Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis

Midjourney Slams Lawsuit Filed by Disney to Prevent AI Training

Smithsonian Updates Museum Display on Impeachment To Include Trump

Funder Tried to Hijack Kandinsky Art Theft Suits, Says Collector

How to Stylize Your Images with Flux Kontext in ComfyUI

Creating uniquely human digital banking experiences at TD

C3 AI Stock Plunges After ‘Completely Unacceptable’ Q1 Sales – C3.ai (NYSE:AI)

Meta says its Llama AI models being used by banks, tech companies

What's Hot

Paper page – Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models

Related Posts

Subscribe to Updates