Stable Cinemetrics : Structured Taxonomy And Evaluation For Professional Video Generation - Takara TLDR

Recent advances in video generation have enabled high-fidelity video
synthesis from user provided prompts. However, existing models and benchmarks
fail to capture the complexity and requirements of professional video
generation. Towards that goal, we introduce Stable Cinemetrics, a structured
evaluation framework that formalizes filmmaking controls into four
disentangled, hierarchical taxonomies: Setup, Event, Lighting, and Camera.
Together, these taxonomies define 76 fine-grained control nodes grounded in
industry practices. Using these taxonomies, we construct a benchmark of prompts
aligned with professional use cases and develop an automated pipeline for
prompt categorization and question generation, enabling independent evaluation
of each control dimension. We conduct a large-scale human study spanning 10+
models and 20K videos, annotated by a pool of 80+ film professionals. Our
analysis, both coarse and fine-grained reveal that even the strongest current
models exhibit significant gaps, particularly in Events and Camera-related
controls. To enable scalable evaluation, we train an automatic evaluator, a
vision-language model aligned with expert annotations that outperforms existing
zero-shot baselines. SCINE is the first approach to situate professional video
generation within the landscape of video generative models, introducing
taxonomies centered around cinematic controls and supporting them with
structured evaluation pipelines and detailed analyses to guide future research.

Source link

What's Hot

Apple, OpenAI tell judge to dismiss Elon Musk’s App Store lawsuit

MIT Sloan CFO Summit Presents “How CFOs Lead, Structure, and Manage Change” with CFO keynotes Arm, FedEx, Shopify, and Tapestry

Character.AI removes Disney characters after receiving cease-and-desist letter

Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation – Takara TLDR

Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark – Takara TLDR

DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively – Takara TLDR

DA^2: Depth Anything in Any Direction – Takara TLDR

Smithsonian Museums to Remain Open Amid Government Shutdown

Statue Left Behind by Grave Robbers Unearthed in Saqqara, Egypt

Security Guards Accuse de Young Museum of Abusive Workplace Culture

Vancouver Art Gallery Taps Canadian Firms to Co-Design New Building

Apple, OpenAI tell judge to dismiss Elon Musk’s App Store lawsuit

MIT Sloan CFO Summit Presents “How CFOs Lead, Structure, and Manage Change” with CFO keynotes Arm, FedEx, Shopify, and Tapestry

Character.AI removes Disney characters after receiving cease-and-desist letter

What's Hot

Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation – Takara TLDR

Related Posts

Subscribe to Updates