Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane
| dc.contributor.author | Yan, Han | en |
| dc.contributor.author | Li, Yang | en |
| dc.contributor.author | Wu, Zhennan | en |
| dc.contributor.author | Chen, Shenzhou | en |
| dc.contributor.author | Sun, Weixuan | en |
| dc.contributor.author | Shang, Taizhang | en |
| dc.contributor.author | Liu, Weizhe | en |
| dc.contributor.author | Chen, Tian | en |
| dc.contributor.author | Dai, Xiaqiang | en |
| dc.contributor.author | Ma, Chao | en |
| dc.contributor.author | Li, Hongdong | en |
| dc.contributor.author | Ji, Pan | en |
| dc.date.accessioned | 2025-05-23T02:22:12Z | |
| dc.date.available | 2025-05-23T02:22:12Z | |
| dc.date.issued | 2024-12-03 | en |
| dc.description.abstract | We present Frankenstein, a diffusion-based framework that can generate semantic-compositional 3D scenes in a single pass. Unlike existing methods that output a single, unified 3D shape, Frankenstein simultaneously generates multiple separated shapes, each corresponding to a semantically meaningful part. The 3D scene information is encoded in one single triplane tensor, from which multiple Signed Distance Function (SDF) fields can be decoded to represent the compositional shapes. During training, an auto-encoder compresses tri-planes into a latent space, and then the denoising diffusion process is employed to approximate the distribution of the compositional scenes. Frankenstein demonstrates promising results in generating room interiors as well as human avatars with automatically separated parts. The generated scenes facilitate many downstream applications, such as part-wise re-texturing, object rearrangement in the room or avatar cloth re-targeting. | en |
| dc.description.sponsorship | This work was supported in part by NSFC (62322113, 62376156) and Shanghai Municipal Science and Technology Major Project (2021SHZDZX0102). | en |
| dc.description.status | Peer-reviewed | en |
| dc.identifier.isbn | 9798400711312 | en |
| dc.identifier.other | ORCID:/0000-0003-4125-1554/work/184100031 | en |
| dc.identifier.scopus | 85217101369 | en |
| dc.identifier.uri | http://www.scopus.com/inward/record.url?scp=85217101369&partnerID=8YFLogxK | en |
| dc.identifier.uri | https://hdl.handle.net/1885/733750817 | |
| dc.language.iso | en | en |
| dc.publisher | Association for Computing Machinery (ACM) | en |
| dc.relation.ispartof | Proceedings - SIGGRAPH Asia 2024 Conference Papers, SA 2024 | en |
| dc.relation.ispartofseries | 2024 SIGGRAPH Asia 2024 Conference Papers, SA 2024 | en |
| dc.relation.ispartofseries | Proceedings - SIGGRAPH Asia 2024 Conference Papers, SA 2024 | en |
| dc.rights | Publisher Copyright: © 2024 Copyright held by the owner/author(s). | en |
| dc.subject | 3D Scene Generation | en |
| dc.subject | Diffusion Model | en |
| dc.subject | Semantic Composition | en |
| dc.title | Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane | en |
| dc.type | Conference paper | en |
| dspace.entity.type | Publication | en |
| local.contributor.affiliation | Yan, Han; Shanghai Jiao Tong University | en |
| local.contributor.affiliation | Li, Yang; Tencent | en |
| local.contributor.affiliation | Wu, Zhennan; The University of Tokyo | en |
| local.contributor.affiliation | Chen, Shenzhou; Tencent | en |
| local.contributor.affiliation | Sun, Weixuan; Tencent | en |
| local.contributor.affiliation | Shang, Taizhang; Tencent | en |
| local.contributor.affiliation | Liu, Weizhe; Tencent | en |
| local.contributor.affiliation | Chen, Tian; Tencent | en |
| local.contributor.affiliation | Dai, Xiaqiang; Tencent | en |
| local.contributor.affiliation | Ma, Chao; Shanghai Jiao Tong University | en |
| local.contributor.affiliation | Li, Hongdong; School of Computing, ANU College of Systems and Society, The Australian National University | en |
| local.contributor.affiliation | Ji, Pan; Tencent | en |
| local.identifier.doi | 10.1145/3680528.3687672 | en |
| local.identifier.pure | 8bd2af28-fa9e-4e0c-a262-b1db4a3f35f4 | en |
| local.identifier.url | https://www.scopus.com/pages/publications/85217101369 | en |
| local.type.status | Published | en |