DiT360: High-Fidelity Panoramic Image Generation Via Hybrid Training - Takara TLDR

In this work, we propose DiT360, a DiT-based framework that performs hybrid
training on perspective and panoramic data for panoramic image generation. For
the issues of maintaining geometric fidelity and photorealism in generation
quality, we attribute the main reason to the lack of large-scale, high-quality,
real-world panoramic data, where such a data-centric view differs from prior
methods that focus on model design. Basically, DiT360 has several key modules
for inter-domain transformation and intra-domain augmentation, applied at both
the pre-VAE image level and the post-VAE token level. At the image level, we
incorporate cross-domain knowledge through perspective image guidance and
panoramic refinement, which enhance perceptual quality while regularizing
diversity and photorealism. At the token level, hybrid supervision is applied
across multiple modules, which include circular padding for boundary
continuity, yaw loss for rotational robustness, and cube loss for distortion
awareness. Extensive experiments on text-to-panorama, inpainting, and
outpainting tasks demonstrate that our method achieves better boundary
consistency and image fidelity across eleven quantitative metrics. Our code is
available at https://github.com/Insta360-Research-Team/DiT360.

Source link

What's Hot

Two Nobel-winning MIT economists leaving for Zurich but will remain affiliated with Cambridge campus

Google Meet launches an AI-powered makeup feature

Build Hour: Responses API

DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training – Takara TLDR

Diffusion Transformers with Representation Autoencoders – Takara TLDR

QeRL: Beyond Efficiency — Quantization-enhanced Reinforcement Learning for LLMs – Takara TLDR

Demystifying Reinforcement Learning in Agentic Reasoning – Takara TLDR

Egyptian Archaeologists Discover Large New Kingdom Military Fortress

Joan Weinstein to Head Vice President for Getty-Wide Program Planning

India Plots First Venice Biennale Pavilion in Seven Years

Massive Moai Statues Once ‘Walked’ to Their Platforms on Easter Island

Two Nobel-winning MIT economists leaving for Zurich but will remain affiliated with Cambridge campus

Google Meet launches an AI-powered makeup feature

Build Hour: Responses API

What's Hot

DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training – Takara TLDR

Related Posts

Subscribe to Updates