InfiniHuman: Infinite 3D Human Creation With Precise Control - Takara TLDR

Generating realistic and controllable 3D human avatars is a long-standing
challenge, particularly when covering broad attribute ranges such as ethnicity,
age, clothing styles, and detailed body shapes. Capturing and annotating
large-scale human datasets for training generative models is prohibitively
expensive and limited in scale and diversity. The central question we address
in this paper is: Can existing foundation models be distilled to generate
theoretically unbounded, richly annotated 3D human data? We introduce
InfiniHuman, a framework that synergistically distills these models to produce
richly annotated human data at minimal cost and with theoretically unlimited
scalability. We propose InfiniHumanData, a fully automatic pipeline that
leverages vision-language and image generation models to create a large-scale
multi-modal dataset. User study shows our automatically generated identities
are undistinguishable from scan renderings. InfiniHumanData contains 111K
identities spanning unprecedented diversity. Each identity is annotated with
multi-granularity text descriptions, multi-view RGB images, detailed clothing
images, and SMPL body-shape parameters. Building on this dataset, we propose
InfiniHumanGen, a diffusion-based generative pipeline conditioned on text, body
shape, and clothing assets. InfiniHumanGen enables fast, realistic, and
precisely controllable avatar generation. Extensive experiments demonstrate
significant improvements over state-of-the-art methods in visual quality,
generation speed, and controllability. Our approach enables high-quality avatar
generation with fine-grained control at effectively unbounded scale through a
practical and affordable solution. We will publicly release the automatic data
generation pipeline, the comprehensive InfiniHumanData dataset, and the
InfiniHumanGen models at https://yuxuan-xue.com/infini-human.

Source link

What's Hot

CISA staff shift, AMD-OpenAI pact, ransomware cartel

Airtel Partners with IBM to Expand Airtel Cloud with AI Servers

MIT graduates, accused of $25 million cryptocurrency fraud, blame it on ‘lack of government regulations’

InfiniHuman: Infinite 3D Human Creation with Precise Control – Takara TLDR

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search – Takara TLDR

ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding – Takara TLDR

LikePhys: Evaluating Intuitive Physics Understanding in Video Diffusion Models via Likelihood Preference – Takara TLDR

Qatar Reveals It’s the Owner of Courbet’s Famous Self-Portrait

Issy Wood Paints Charli XCX—and Her ‘Britishness’—for Vanity Fair

San Francisco May Destroy Vaillancourt Fountain in Redevelopment Plan

DuSable Black History Museum Responds to Accusations of Retaliation

CISA staff shift, AMD-OpenAI pact, ransomware cartel

Airtel Partners with IBM to Expand Airtel Cloud with AI Servers

MIT graduates, accused of $25 million cryptocurrency fraud, blame it on ‘lack of government regulations’

What's Hot

InfiniHuman: Infinite 3D Human Creation with Precise Control – Takara TLDR

Related Posts

Subscribe to Updates