DreamCube: 3D Panorama Generation via Multi-plane Synchronization

Research Paper
Yukun Huang 1     Yanning Zhou 2     Jianan Wang 3     Kaiyi Huang 1     Xihui Liu 1
1 The University of Hong Kong     2 Tencent     3 Astribot    
Teaser image.

We introduce Multi-plane Synchronization to adapt 2D diffusion models for multi-plane panoramic representations, which can be applied to different tasks including RGB-D panorama generation, panorama depth estimation, and 3D scene generation.

Overview

We demonstrate a non-training strategy termed Multi-plane Synchronization, which enables pre-trained 2D diffusion models to generate seam-continuous multi-plane panoramic representations like cube maps and sky boxes. This strategy works by extending the model's 2D spatial operators to the omnidirectional image domain, making them omnidirectionally translation-equivalent.

Based on this strategy, we further present DreamCube, a diffusion model for RGB-D cube map generation from single-view input. Benefiting from cube map representations and synchronization strategies, DreamCube maximizes the reuse of 2D diffusion model weights, achieving high-quality appearance and accurate geometry generation. Extensive experiments the effectiveness of our approach in panoramic image generation, panoramic depth estimation, and 3D scene generation.

What's New

📢 2025-06: Paper and project page released!

Methodology

Multi-plane Synchronization

Synchronizing different spatial operators (attentions, 2d convs, group norms) of diffusion U-Net and VAE enables seam-continuous cube map generation without fine-tuning.

DreamCube framework.

DreamCube: An RGB-D Cubemap Generation Framework

Training and inference framework of DreamCube for RGB-D cube map generation:

DreamCube framework.

Results

Versatility of Multi-plane Synchronization

Results of Multi-plane Synchronization on existing pre-trained 2D diffusion models, including SD2, SDXL, and Marigold:

DreamCube framework.

RGB-D Cubemap Generation

Out-of-domain RGB-D panorama generation from single view inputs:

DreamCube framework.

Interactive Demo



Your browser does not support the HTML5 canvas element.

Your browser does not support the HTML5 canvas element.


Free angle observation. Try using the mouse to drag and rotate the view (Left: RGB, Right: Euclidean Depth).


Selected Image Selected Depth

Panoramic overview (Left: RGB, Right: Euclidean Depth).

BibTeX

@article{huang2025dreamcube,
  title={{DreamCube: RGB-D Panorama Generation via Multi-plane Synchronization}},
  author={Huang, Yukun and Zhou, Yanning and Wang, Jianan and Huang, Kaiyi and Liu, Xihui},
  year={2025},
  eprint={},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
}