SPATIALGEN: Layout-guided 3D Indoor Scene Generation - Takara TLDR

Creating high-fidelity 3D models of indoor environments is essential for
applications in design, virtual reality, and robotics. However, manual 3D
modeling remains time-consuming and labor-intensive. While recent advances in
generative AI have enabled automated scene synthesis, existing methods often
face challenges in balancing visual quality, diversity, semantic consistency,
and user control. A major bottleneck is the lack of a large-scale, high-quality
dataset tailored to this task. To address this gap, we introduce a
comprehensive synthetic dataset, featuring 12,328 structured annotated scenes
with 57,440 rooms, and 4.7M photorealistic 2D renderings. Leveraging this
dataset, we present SpatialGen, a novel multi-view multi-modal diffusion model
that generates realistic and semantically consistent 3D indoor scenes. Given a
3D layout and a reference image (derived from a text prompt), our model
synthesizes appearance (color image), geometry (scene coordinate map), and
semantic (semantic segmentation map) from arbitrary viewpoints, while
preserving spatial consistency across modalities. SpatialGen consistently
generates superior results to previous methods in our experiments. We are
open-sourcing our data and models to empower the community and advance the
field of indoor scene understanding and generation.

Source link

What's Hot

Meta is making its Llama AI models available to more governments in Europe and Asia

Claude : The Ultimate AI Tool for Smarter Workflows & Automations

Nvidia Invests in OpenAI With $100 Billion to Build Out More AI Data Centers

SPATIALGEN: Layout-guided 3D Indoor Scene Generation – Takara TLDR

VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models – Takara TLDR

ByteWrist: A Parallel Robotic Wrist Enabling Flexible and Anthropomorphic Motion for Confined Spaces – Takara TLDR

MetaEmbed: Scaling Multimodal Retrieval at Test-Time with Flexible Late Interaction – Takara TLDR

Court Rules ‘Gender Ideology’ Ban on Art Endowments Unconstitutional

Rural Danish Art Museum Acquires Painting By Artemisia Gentileschi

Dan Nadel Is Expanding American Art History, One Outlier at a Time

St. Patrick’s Cathedral Unveils Monumental Mural by Adam Cvijanovic

Meta is making its Llama AI models available to more governments in Europe and Asia

Claude : The Ultimate AI Tool for Smarter Workflows & Automations

Nvidia Invests in OpenAI With $100 Billion to Build Out More AI Data Centers

What's Hot

SPATIALGEN: Layout-guided 3D Indoor Scene Generation – Takara TLDR

Related Posts

Subscribe to Updates