StyleMM: Stylized 3D Morphable Face Model Via Text-Driven Aligned Image Translation - Takara TLDR

We introduce StyleMM, a novel framework that can construct a stylized 3D
Morphable Model (3DMM) based on user-defined text descriptions specifying a
target style. Building upon a pre-trained mesh deformation network and a
texture generator for original 3DMM-based realistic human faces, our approach
fine-tunes these models using stylized facial images generated via text-guided
image-to-image (i2i) translation with a diffusion model, which serve as
stylization targets for the rendered mesh. To prevent undesired changes in
identity, facial alignment, or expressions during i2i translation, we introduce
a stylization method that explicitly preserves the facial attributes of the
source image. By maintaining these critical attributes during image
stylization, the proposed approach ensures consistent 3D style transfer across
the 3DMM parameter space through image-based training. Once trained, StyleMM
enables feed-forward generation of stylized face meshes with explicit control
over shape, expression, and texture parameters, producing meshes with
consistent vertex connectivity and animatability. Quantitative and qualitative
evaluations demonstrate that our approach outperforms state-of-the-art methods
in terms of identity-level facial diversity and stylization capability. The
code and videos are available at
[kwanyun.github.io/stylemm_page](kwanyun.github.io/stylemm_page).

Source link

What's Hot

3 strategies to retain your entry-level employees

Relativity Launches Rel Labs – Will Invest In Startups – Artificial Lawyer

How Confident are Video Models? Empowering Video Models to Express their Uncertainty – Takara TLDR

StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation – Takara TLDR

How Confident are Video Models? Empowering Video Models to Express their Uncertainty – Takara TLDR

SurveyBench: How Well Can LLM(-Agents) Write Academic Surveys? – Takara TLDR

SpineBench: A Clinically Salient, Level-Aware Benchmark Powered by the SpineMed-450k Corpus – Takara TLDR

Morning Links for October 6, 2025

Sotheby’s to Sell René Magritte Held in Same Collection for 100 years

Former ARTnews Publisher Dies at 97

National Gallery of Art Closes as a Result of Government Shutdown

3 strategies to retain your entry-level employees

Relativity Launches Rel Labs – Will Invest In Startups – Artificial Lawyer

How Confident are Video Models? Empowering Video Models to Express their Uncertainty – Takara TLDR

What's Hot

StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation – Takara TLDR

Related Posts

Subscribe to Updates