DGPose: Deep Generative Models for Human Body Analysis

de Bem, Rodrigo; Ghosh, Arnab; Ajanthan, Thalaiyasingam; Miksik, Ondrej; Boukhayma, Adnane; Siddharth, N; Torr, Philip

DGPose: Deep Generative Models for Human Body Analysis

dc.contributor.author	de Bem, Rodrigo
dc.contributor.author	Ghosh, Arnab
dc.contributor.author	Ajanthan, Thalaiyasingam
dc.contributor.author	Miksik, Ondrej
dc.contributor.author	Boukhayma, Adnane
dc.contributor.author	Siddharth, N
dc.contributor.author	Torr, Philip
dc.date.accessioned	2024-01-09T23:00:46Z
dc.date.available	2024-01-09T23:00:46Z
dc.date.issued	2020
dc.date.updated	2022-09-25T08:16:28Z
dc.description.abstract	Deep generative modelling for human body analysis is an emerging problem with many interesting applications. However, the latent space learned by such approaches is typically not interpretable, resulting in less flexibility. In this work, we present deep generative models for human body analysis in which the body pose and the visual appearance are disentangled. Such a disentanglement allows independent manipulation of pose and appearance, and hence enables applications such as pose-transfer without specific training for such a task. Our proposed models, the Conditional-DGPose and the Semi-DGPose, have different characteristics. In the first, body pose labels are taken as conditioners, from a fully-supervised training set. In the second, our structured semi-supervised approach allows for pose estimation to be performed by the model itself and relaxes the need for labelled data. Therefore, the Semi-DGPose aims for the joint understanding and generation of people in images. It is not only capable of mapping images to interpretable latent representations but also able to map these representations back to the image space. We compare our models with relevant baselines, the ClothNet-Body and the Pose Guided Person Generation networks, demonstrating their merits on the Human3.6M, ChictopiaPlus and DeepFashion benchmarks	en_AU
dc.description.sponsorship	This work was supported by the ERC grant ERC-2012-AdG 321162-HELIOS, EPSRC grant Seebibyte EP/M013774/1 and EPSRC/MURI grant EP/N019474/1. We would also like to acknowledge the Royal Academy of Engineering and FiveAI. Rodrigo de Bem is a CAPES Foundation scholarship holder (Process no: 99999.013296/2013-02, Ministry of Education, Brazil).	en_AU
dc.format.mimetype	application/pdf	en_AU
dc.identifier.issn	0920-5691	en_AU
dc.identifier.uri	http://hdl.handle.net/1885/311292
dc.language.iso	en_AU	en_AU
dc.provenance	This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.	en_AU
dc.publisher	Springer	en_AU
dc.rights	© 2020 The authors	en_AU
dc.rights.license	Creative Commons Attribution licence	en_AU
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	en_AU
dc.source	International Journal of Computer Vision	en_AU
dc.subject	Deep generative models	en_AU
dc.subject	Semi-supervised learning ·	en_AU
dc.subject	Human pose estimation	en_AU
dc.subject	Variational autoencoders	en_AU
dc.subject	Variational autoencoders	en_AU
dc.title	DGPose: Deep Generative Models for Human Body Analysis	en_AU
dc.type	Journal article	en_AU
dcterms.accessRights	Open Access	en_AU
local.bibliographicCitation.lastpage	1563	en_AU
local.bibliographicCitation.startpage	1537	en_AU
local.contributor.affiliation	de Bem, Rodrigo, University of Oxford	en_AU
local.contributor.affiliation	Ghosh, Arnab, University of Oxford	en_AU
local.contributor.affiliation	Ajanthan, Thalaiyasingam, College of Engineering and Computer Science, ANU	en_AU
local.contributor.affiliation	Miksik, Ondrej, University of Oxford	en_AU
local.contributor.affiliation	Boukhayma, Adnane, University of Oxford	en_AU
local.contributor.affiliation	Siddharth, N, University of Oxford	en_AU
local.contributor.affiliation	Torr, Philip, University of Oxford	en_AU
local.contributor.authoruid	Ajanthan, Thalaiyasingam, u5478870	en_AU
local.description.notes	Imported from ARIES	en_AU
local.identifier.absfor	460304 - Computer vision	en_AU
local.identifier.ariespublication	a383154xPUB13246	en_AU
local.identifier.citationvolume	128	en_AU
local.identifier.doi	10.1007/s11263-020-01306-1	en_AU
local.identifier.scopusID	2-s2.0-85084132977
local.identifier.thomsonID	WOS:000528435100001
local.publisher.url	https://link.springer.com/	en_AU
local.type.status	Published Version	en_AU

Downloads

Original bundle

Now showing 1 - 1 of 1

Name:: s11263-020-01306-1.pdf
Size:: 15.88 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

ANU Research Publications

Cultural advice

DGPose: Deep Generative Models for Human Body Analysis

Downloads

Original bundle

Collections