Paper page - From One to More: Contextual Part Latents for 3D Generation

A part-aware diffusion framework, CoPart, enhances 3D generation by decomposing objects into contextual parts, improving complexity handling, relationship modeling, and part-level conditioning.

Recent advances in 3D generation have transitioned from multi-view 2D
rendering approaches to 3D-native latent diffusion frameworks that exploit
geometric priors in ground truth data. Despite progress, three key limitations
persist: (1) Single-latent representations fail to capture complex multi-part
geometries, causing detail degradation; (2) Holistic latent coding neglects
part independence and interrelationships critical for compositional design; (3)
Global conditioning mechanisms lack fine-grained controllability. Inspired by
human 3D design workflows, we propose CoPart – a part-aware diffusion framework
that decomposes 3D objects into contextual part latents for coherent multi-part
generation. This paradigm offers three advantages: i) Reduces encoding
complexity through part decomposition; ii) Enables explicit part relationship
modeling; iii) Supports part-level conditioning. We further develop a mutual
guidance strategy to fine-tune pre-trained diffusion models for joint part
latent denoising, ensuring both geometric coherence and foundation model
priors. To enable large-scale training, we construct Partverse – a novel 3D
part dataset derived from Objaverse through automated mesh segmentation and
human-verified annotations. Extensive experiments demonstrate CoPart’s superior
capabilities in part-level editing, articulated object generation, and scene
composition with unprecedented controllability.

Source link

What's Hot

AI’s fourth wave is here — are enterprises ready for what’s next?

Malaysia will require trade permits for U.S. AI chips

NVIDIA’s New AI: Impossible Weather Graphics!

Paper page – From One to More: Contextual Part Latents for 3D Generation

One Token to Fool LLM-as-a-Judge

Paper page – BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

Paper page – Robust Multimodal Large Language Models Against Modality Conflict

Murujuga Rock Art in Australia Receives UNESCO World Heritage Status

Homeland Security Targets Chicago’s National Museum of Puerto Rican Arts & Culture

1,600-Year-Old Tomb of Mayan City’s Founding King Discovered in Belize

Centre Pompidou Cancels Caribbean Art Show, Raising Controversy

AI’s fourth wave is here — are enterprises ready for what’s next?

Malaysia will require trade permits for U.S. AI chips

NVIDIA’s New AI: Impossible Weather Graphics!

What's Hot

Paper page – From One to More: Contextual Part Latents for 3D Generation

Related Posts

Subscribe to Updates