VoxHammer: Training-Free Precise And Coherent 3D Editing In Native 3D Space - Takara TLDR

3D local editing of specified regions is crucial for game industry and robot
interaction. Recent methods typically edit rendered multi-view images and then
reconstruct 3D models, but they face challenges in precisely preserving
unedited regions and overall coherence. Inspired by structured 3D generative
models, we propose VoxHammer, a novel training-free approach that performs
precise and coherent editing in 3D latent space. Given a 3D model, VoxHammer
first predicts its inversion trajectory and obtains its inverted latents and
key-value tokens at each timestep. Subsequently, in the denoising and editing
phase, we replace the denoising features of preserved regions with the
corresponding inverted latents and cached key-value tokens. By retaining these
contextual features, this approach ensures consistent reconstruction of
preserved areas and coherent integration of edited parts. To evaluate the
consistency of preserved regions, we constructed Edit3D-Bench, a
human-annotated dataset comprising hundreds of samples, each with carefully
labeled 3D editing regions. Experiments demonstrate that VoxHammer
significantly outperforms existing methods in terms of both 3D consistency of
preserved regions and overall quality. Our method holds promise for
synthesizing high-quality edited paired data, thereby laying the data
foundation for in-context 3D generation. See our project page at
https://huanngzh.github.io/VoxHammer-Page/.

Source link

What's Hot

Juro + Wordsmith Form MCP-Based AI Partnership – Artificial Lawyer

DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis – Takara TLDR

Nvidia AI chips sales rise but so do fears of an AI bubble bursting

VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space – Takara TLDR

DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis – Takara TLDR

Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels – Takara TLDR

CineScale: Free Lunch in High-Resolution Cinematic Visual Generation – Takara TLDR

Artifacts From 2,000-Year-old Sunken City Lifted Out of the Sea

Fita Threatens Legal Action for Uni’s Trans-Inclusive Museum Guidance

Claire Oliver Gallery Expands in New York’s Harlem Neighborhood

Van Gogh Museum Threatens Dutch Government with Closure

Juro + Wordsmith Form MCP-Based AI Partnership – Artificial Lawyer

DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis – Takara TLDR

Nvidia AI chips sales rise but so do fears of an AI bubble bursting

What's Hot

VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space – Takara TLDR

Related Posts

Subscribe to Updates