Snap-Snap: Taking Two Images To Reconstruct 3D Human Gaussians In Milliseconds - Takara TLDR

Reconstructing 3D human bodies from sparse views has been an appealing topic,
which is crucial to broader the related applications. In this paper, we propose
a quite challenging but valuable task to reconstruct the human body from only
two images, i.e., the front and back view, which can largely lower the barrier
for users to create their own 3D digital humans. The main challenges lie in the
difficulty of building 3D consistency and recovering missing information from
the highly sparse input. We redesign a geometry reconstruction model based on
foundation reconstruction models to predict consistent point clouds even input
images have scarce overlaps with extensive human data training. Furthermore, an
enhancement algorithm is applied to supplement the missing color information,
and then the complete human point clouds with colors can be obtained, which are
directly transformed into 3D Gaussians for better rendering quality.
Experiments show that our method can reconstruct the entire human in 190 ms on
a single NVIDIA RTX 4090, with two images at a resolution of 1024×1024,
demonstrating state-of-the-art performance on the THuman2.0 and cross-domain
datasets. Additionally, our method can complete human reconstruction even with
images captured by low-cost mobile devices, reducing the requirements for data
collection. Demos and code are available at
https://hustvl.github.io/Snap-Snap/.

Source link

What's Hot

NVIDIA (NVDA) Unveils Spectrum-XGS Ethernet To Revolutionize AI Super-Factories

Stability AI cutting staff in the name of restructuring

Amanpour and Company | CEO of Google DeepMind: We Must Approach AI with “Cautious Optimism” | Season 2024

Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds – Takara TLDR

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries – Takara TLDR

Visual Autoregressive Modeling for Instruction-Guided Image Editing – Takara TLDR

LLaSO: A Foundational Framework for Reproducible Research in Large Language and Speech Model – Takara TLDR

Mütter Museum in Philadelphia Announces New Policy for Human Remains

Inigo Philbrick, Art Dealer Convicted of Fraud, Appears in BBC Film

Links for August 22, 2025

White House Targets Specific Artworks at Smithsonian Museums

NVIDIA (NVDA) Unveils Spectrum-XGS Ethernet To Revolutionize AI Super-Factories

Stability AI cutting staff in the name of restructuring

Amanpour and Company | CEO of Google DeepMind: We Must Approach AI with “Cautious Optimism” | Season 2024

What's Hot

Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds – Takara TLDR

Related Posts

Subscribe to Updates