DGPose: Deep Generative Models for Human Body Analysis
| dc.contributor.author | de Bem, Rodrigo | |
| dc.contributor.author | Ghosh, Arnab | |
| dc.contributor.author | Ajanthan, Thalaiyasingam | |
| dc.contributor.author | Miksik, Ondrej | |
| dc.contributor.author | Boukhayma, Adnane | |
| dc.contributor.author | Siddharth, N | |
| dc.contributor.author | Torr, Philip | |
| dc.date.accessioned | 2024-01-09T23:00:46Z | |
| dc.date.available | 2024-01-09T23:00:46Z | |
| dc.date.issued | 2020 | |
| dc.date.updated | 2022-09-25T08:16:28Z | |
| dc.description.abstract | Deep generative modelling for human body analysis is an emerging problem with many interesting applications. However, the latent space learned by such approaches is typically not interpretable, resulting in less flexibility. In this work, we present deep generative models for human body analysis in which the body pose and the visual appearance are disentangled. Such a disentanglement allows independent manipulation of pose and appearance, and hence enables applications such as pose-transfer without specific training for such a task. Our proposed models, the Conditional-DGPose and the Semi-DGPose, have different characteristics. In the first, body pose labels are taken as conditioners, from a fully-supervised training set. In the second, our structured semi-supervised approach allows for pose estimation to be performed by the model itself and relaxes the need for labelled data. Therefore, the Semi-DGPose aims for the joint understanding and generation of people in images. It is not only capable of mapping images to interpretable latent representations but also able to map these representations back to the image space. We compare our models with relevant baselines, the ClothNet-Body and the Pose Guided Person Generation networks, demonstrating their merits on the Human3.6M, ChictopiaPlus and DeepFashion benchmarks | en_AU |
| dc.description.sponsorship | This work was supported by the ERC grant ERC-2012-AdG 321162-HELIOS, EPSRC grant Seebibyte EP/M013774/1 and EPSRC/MURI grant EP/N019474/1. We would also like to acknowledge the Royal Academy of Engineering and FiveAI. Rodrigo de Bem is a CAPES Foundation scholarship holder (Process no: 99999.013296/2013-02, Ministry of Education, Brazil). | en_AU |
| dc.format.mimetype | application/pdf | en_AU |
| dc.identifier.issn | 0920-5691 | en_AU |
| dc.identifier.uri | http://hdl.handle.net/1885/311292 | |
| dc.language.iso | en_AU | en_AU |
| dc.provenance | This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. | en_AU |
| dc.publisher | Springer | en_AU |
| dc.rights | © 2020 The authors | en_AU |
| dc.rights.license | Creative Commons Attribution licence | en_AU |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en_AU |
| dc.source | International Journal of Computer Vision | en_AU |
| dc.subject | Deep generative models | en_AU |
| dc.subject | Semi-supervised learning · | en_AU |
| dc.subject | Human pose estimation | en_AU |
| dc.subject | Variational autoencoders | en_AU |
| dc.subject | Variational autoencoders | en_AU |
| dc.title | DGPose: Deep Generative Models for Human Body Analysis | en_AU |
| dc.type | Journal article | en_AU |
| dcterms.accessRights | Open Access | en_AU |
| local.bibliographicCitation.lastpage | 1563 | en_AU |
| local.bibliographicCitation.startpage | 1537 | en_AU |
| local.contributor.affiliation | de Bem, Rodrigo, University of Oxford | en_AU |
| local.contributor.affiliation | Ghosh, Arnab, University of Oxford | en_AU |
| local.contributor.affiliation | Ajanthan, Thalaiyasingam, College of Engineering and Computer Science, ANU | en_AU |
| local.contributor.affiliation | Miksik, Ondrej, University of Oxford | en_AU |
| local.contributor.affiliation | Boukhayma, Adnane, University of Oxford | en_AU |
| local.contributor.affiliation | Siddharth, N, University of Oxford | en_AU |
| local.contributor.affiliation | Torr, Philip, University of Oxford | en_AU |
| local.contributor.authoruid | Ajanthan, Thalaiyasingam, u5478870 | en_AU |
| local.description.notes | Imported from ARIES | en_AU |
| local.identifier.absfor | 460304 - Computer vision | en_AU |
| local.identifier.ariespublication | a383154xPUB13246 | en_AU |
| local.identifier.citationvolume | 128 | en_AU |
| local.identifier.doi | 10.1007/s11263-020-01306-1 | en_AU |
| local.identifier.scopusID | 2-s2.0-85084132977 | |
| local.identifier.thomsonID | WOS:000528435100001 | |
| local.publisher.url | https://link.springer.com/ | en_AU |
| local.type.status | Published Version | en_AU |
Downloads
Original bundle
1 - 1 of 1
Loading...
- Name:
- s11263-020-01306-1.pdf
- Size:
- 15.88 MB
- Format:
- Adobe Portable Document Format
- Description: