Transductive learning for zero-shot object detection
| dc.contributor.author | Rahman, Shafin | |
| dc.contributor.author | Khan, Salman Hameed | |
| dc.contributor.author | Barnes, Nick | |
| dc.contributor.editor | Lee, Kyoung Mu | |
| dc.contributor.editor | Forsyth, David | |
| dc.contributor.editor | Pollefeys, Marc | |
| dc.contributor.editor | Tang, Xiaoou | |
| dc.coverage.spatial | Seoul South Korea | |
| dc.date.accessioned | 2023-07-24T23:42:22Z | |
| dc.date.created | Oct 27-Nov 2 2019 | |
| dc.date.issued | 2019 | |
| dc.date.updated | 2022-05-29T08:16:33Z | |
| dc.description.abstract | Zero-shot object detection (ZSD) is a relatively unexplored research problem as compared to the conventional zero-shot recognition task. ZSD aims to detect previously unseen objects during inference. Existing ZSD works suffer from two critical issues: (a) large domain-shift between the source (seen) and target (unseen) domains since the two distributions are highly mismatched. (b) the learned model is biased against unseen classes, therefore in generalized ZSD settings, where both seen and unseen objects co-occur during inference, the learned model tends to misclassify unseen to seen categories. This brings up an important question: How effectively can a transductive setting address the aforementioned problems? To the best of our knowledge, we are the first to propose a transductive zero-shot object detection approach that convincingly reduces the domain-shift and model-bias against unseen classes. Our approach is based on a self-learning mechanism that uses a novel hybrid pseudo-labeling technique. It progressively updates learned model parameters by associating unlabeled data samples to their corresponding classes. During this process, our technique makes sure that knowledge that was previously acquired on the source domain is not forgotten. We report significant 'relative' improvements of 34.9% and 77.1% in terms of mAP and recall rates over the previous best inductive models on MSCOCO dataset. | en_AU |
| dc.description.sponsorship | This work was supported in part by NH&MRC Project grant #1082358 | en_AU |
| dc.format.mimetype | application/pdf | en_AU |
| dc.identifier.isbn | 9781728148038 | en_AU |
| dc.identifier.uri | http://hdl.handle.net/1885/294523 | |
| dc.language.iso | en_AU | en_AU |
| dc.publisher | IEEE, Institute of Electrical and Electronics Engineers | en_AU |
| dc.relation | http://purl.org/au-research/grants/nhmrc/1082358 | en_AU |
| dc.relation.ispartofseries | 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019 | en_AU |
| dc.rights | © 2019 IEEE | en_AU |
| dc.source | Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019 | en_AU |
| dc.title | Transductive learning for zero-shot object detection | en_AU |
| dc.type | Conference paper | en_AU |
| local.bibliographicCitation.lastpage | 6090 | en_AU |
| local.bibliographicCitation.startpage | 6081 | en_AU |
| local.contributor.affiliation | Rahman, Shafin, College of Engineering and Computer Science, ANU | en_AU |
| local.contributor.affiliation | Khan, Salman, Academic Portfolio, ANU | en_AU |
| local.contributor.affiliation | Barnes, Nick, College of Engineering and Computer Science, ANU | en_AU |
| local.contributor.authoruid | Rahman, Shafin, u5929575 | en_AU |
| local.contributor.authoruid | Khan, Salman, u1029115 | en_AU |
| local.contributor.authoruid | Barnes, Nick, u4591576 | en_AU |
| local.description.embargo | 2099-12-31 | |
| local.description.notes | Imported from ARIES | en_AU |
| local.description.refereed | Yes | |
| local.identifier.absfor | 460300 - Computer vision and multimedia computation | en_AU |
| local.identifier.ariespublication | a383154xPUB11590 | en_AU |
| local.identifier.doi | 10.1109/ICCV.2019.00618 | en_AU |
| local.identifier.scopusID | 2-s2.0-85081924703 | |
| local.identifier.thomsonID | WOS:000548549201020 | |
| local.publisher.url | https://www.ieee.org/ | en_AU |
| local.type.status | Published Version | en_AU |
Downloads
Original bundle
1 - 1 of 1
Loading...
- Name:
- Transductive_Learning_for_Zero-Shot_Object_Detection.pdf
- Size:
- 603.46 KB
- Format:
- Adobe Portable Document Format
- Description: