Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

DGPose: Deep Generative Models for Human Body Analysis

dc.contributor.authorde Bem, Rodrigo
dc.contributor.authorGhosh, Arnab
dc.contributor.authorAjanthan, Thalaiyasingam
dc.contributor.authorMiksik, Ondrej
dc.contributor.authorBoukhayma, Adnane
dc.contributor.authorSiddharth, N
dc.contributor.authorTorr, Philip
dc.date.accessioned2024-01-09T23:00:46Z
dc.date.available2024-01-09T23:00:46Z
dc.date.issued2020
dc.date.updated2022-09-25T08:16:28Z
dc.description.abstractDeep generative modelling for human body analysis is an emerging problem with many interesting applications. However, the latent space learned by such approaches is typically not interpretable, resulting in less flexibility. In this work, we present deep generative models for human body analysis in which the body pose and the visual appearance are disentangled. Such a disentanglement allows independent manipulation of pose and appearance, and hence enables applications such as pose-transfer without specific training for such a task. Our proposed models, the Conditional-DGPose and the Semi-DGPose, have different characteristics. In the first, body pose labels are taken as conditioners, from a fully-supervised training set. In the second, our structured semi-supervised approach allows for pose estimation to be performed by the model itself and relaxes the need for labelled data. Therefore, the Semi-DGPose aims for the joint understanding and generation of people in images. It is not only capable of mapping images to interpretable latent representations but also able to map these representations back to the image space. We compare our models with relevant baselines, the ClothNet-Body and the Pose Guided Person Generation networks, demonstrating their merits on the Human3.6M, ChictopiaPlus and DeepFashion benchmarksen_AU
dc.description.sponsorshipThis work was supported by the ERC grant ERC-2012-AdG 321162-HELIOS, EPSRC grant Seebibyte EP/M013774/1 and EPSRC/MURI grant EP/N019474/1. We would also like to acknowledge the Royal Academy of Engineering and FiveAI. Rodrigo de Bem is a CAPES Foundation scholarship holder (Process no: 99999.013296/2013-02, Ministry of Education, Brazil).en_AU
dc.format.mimetypeapplication/pdfen_AU
dc.identifier.issn0920-5691en_AU
dc.identifier.urihttp://hdl.handle.net/1885/311292
dc.language.isoen_AUen_AU
dc.provenanceThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.en_AU
dc.publisherSpringeren_AU
dc.rights© 2020 The authorsen_AU
dc.rights.licenseCreative Commons Attribution licenceen_AU
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en_AU
dc.sourceInternational Journal of Computer Visionen_AU
dc.subjectDeep generative modelsen_AU
dc.subjectSemi-supervised learning ·en_AU
dc.subjectHuman pose estimationen_AU
dc.subjectVariational autoencodersen_AU
dc.subjectVariational autoencodersen_AU
dc.titleDGPose: Deep Generative Models for Human Body Analysisen_AU
dc.typeJournal articleen_AU
dcterms.accessRightsOpen Accessen_AU
local.bibliographicCitation.lastpage1563en_AU
local.bibliographicCitation.startpage1537en_AU
local.contributor.affiliationde Bem, Rodrigo, University of Oxforden_AU
local.contributor.affiliationGhosh, Arnab, University of Oxforden_AU
local.contributor.affiliationAjanthan, Thalaiyasingam, College of Engineering and Computer Science, ANUen_AU
local.contributor.affiliationMiksik, Ondrej, University of Oxforden_AU
local.contributor.affiliationBoukhayma, Adnane, University of Oxforden_AU
local.contributor.affiliationSiddharth, N, University of Oxforden_AU
local.contributor.affiliationTorr, Philip, University of Oxforden_AU
local.contributor.authoruidAjanthan, Thalaiyasingam, u5478870en_AU
local.description.notesImported from ARIESen_AU
local.identifier.absfor460304 - Computer visionen_AU
local.identifier.ariespublicationa383154xPUB13246en_AU
local.identifier.citationvolume128en_AU
local.identifier.doi10.1007/s11263-020-01306-1en_AU
local.identifier.scopusID2-s2.0-85084132977
local.identifier.thomsonIDWOS:000528435100001
local.publisher.urlhttps://link.springer.com/en_AU
local.type.statusPublished Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s11263-020-01306-1.pdf
Size:
15.88 MB
Format:
Adobe Portable Document Format
Description:
abcd