The IKEA ASM Dataset: Understanding people assembling furniture through actions, objects and pose
| dc.contributor.author | Ben Shabat, Yizhak | |
| dc.contributor.author | Yu, Xin | |
| dc.contributor.author | Saleh, Fatemehsadat | |
| dc.contributor.author | Campbell, Dylan | |
| dc.contributor.author | Rodriguez Opazo, Cristian | |
| dc.contributor.author | Li, Hongdong | |
| dc.contributor.author | Gould, Stephen | |
| dc.coverage.spatial | Virtual, Waikoloa, HI, USA | |
| dc.date.accessioned | 2024-01-31T00:48:45Z | |
| dc.date.created | January 5-9, 2021 | |
| dc.date.issued | 2021 | |
| dc.date.updated | 2022-10-02T07:18:48Z | |
| dc.description.abstract | The availability of a large labeled dataset is a key requirement for applying deep learning methods to solve various computer vision tasks. In the context of understanding human activities, existing public datasets, while large in size, are often limited to a single RGB camera and provide only per-frame or per-clip action annotations. To enable richer analysis and understanding of human activities, we introduce IKEA ASM - a three million frame, multi-view, furniture assembly video dataset that includes depth, atomic actions, object segmentation, and human poses. Additionally, we benchmark prominent methods for video action recognition, object segmentation and human pose estimation tasks on this challenging dataset. The dataset enables the development of holistic methods, which integrate multi-modal and multi-view data to better perform on these tasks. | en_AU |
| dc.description.sponsorship | This work was funded by the Australian Research Council Centre of Excellence for Robotic Vision. | en_AU |
| dc.format.mimetype | application/pdf | en_AU |
| dc.identifier.isbn | 978-1-6654-0477-8 | en_AU |
| dc.identifier.uri | http://hdl.handle.net/1885/312455 | |
| dc.language.iso | en_AU | en_AU |
| dc.provenance | https://www.ieee.org/publications/rights/author-posting-policy.html..."The policy reaffirms the principle that authors are free to post their own version of their IEEE periodical or conference articles on their personal Web sites, those of their employers, or their funding agencies for the purpose of meeting public availability requirements prescribed by their funding agencies. " from the publisher site (as at 31 Jan 2024) © 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works | |
| dc.publisher | IEEE | en_AU |
| dc.relation | http://purl.org/au-research/grants/arc/CE140100016 | en_AU |
| dc.relation.ispartofseries | 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021 | en_AU |
| dc.rights | © 2021 IEEE | en_AU |
| dc.source | Proceedings of the 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021 | en_AU |
| dc.source.uri | https://wacv2021.thecvf.com/home | en_AU |
| dc.title | The IKEA ASM Dataset: Understanding people assembling furniture through actions, objects and pose | en_AU |
| dc.type | Conference paper | en_AU |
| dcterms.accessRights | Open Access | |
| local.bibliographicCitation.lastpage | 858 | en_AU |
| local.bibliographicCitation.startpage | 846 | en_AU |
| local.contributor.affiliation | Ben Shabat, Yizhak, College of Engineering and Computer Science, ANU | en_AU |
| local.contributor.affiliation | Yu, Xin, University of Technology Sydney | en_AU |
| local.contributor.affiliation | Saleh, Fatemehsadat, College of Engineering and Computer Science, ANU | en_AU |
| local.contributor.affiliation | Campbell, Dylan, College of Engineering and Computer Science, ANU | en_AU |
| local.contributor.affiliation | Rodriguez Opazo, Cristian, College of Engineering and Computer Science, ANU | en_AU |
| local.contributor.affiliation | Li, Hongdong, College of Engineering and Computer Science, ANU | en_AU |
| local.contributor.affiliation | Gould, Stephen, College of Engineering and Computer Science, ANU | en_AU |
| local.contributor.authoruid | Ben Shabat, Yizhak, u1086420 | en_AU |
| local.contributor.authoruid | Saleh, Fatemehsadat, u5704022 | en_AU |
| local.contributor.authoruid | Campbell, Dylan, u5436050 | en_AU |
| local.contributor.authoruid | Rodriguez Opazo, Cristian, u5419700 | en_AU |
| local.contributor.authoruid | Li, Hongdong, u4056952 | en_AU |
| local.contributor.authoruid | Gould, Stephen, u4971180 | en_AU |
| local.description.notes | Imported from ARIES | en_AU |
| local.description.refereed | Yes | |
| local.identifier.absfor | 461103 - Deep learning | en_AU |
| local.identifier.absfor | 460304 - Computer vision | en_AU |
| local.identifier.ariespublication | a383154xPUB24253 | en_AU |
| local.identifier.doi | 10.1109/WACV48630.2021.00089 | en_AU |
| local.identifier.scopusID | WACV)2-s2.0-85116103082 | |
| local.publisher.url | https://www.ieee.org/ | en_AU |
| local.type.status | Accepted Version | en_AU |
Downloads
Original bundle
1 - 1 of 1