Learning dynamic hierarchical models for anytime scene labeling
| dc.contributor.author | Liu, Buyu | en |
| dc.contributor.author | He, Xuming | en |
| dc.date.accessioned | 2025-12-17T15:40:50Z | |
| dc.date.available | 2025-12-17T15:40:50Z | |
| dc.date.issued | 2016 | en |
| dc.description.abstract | With increasing demand for efficient image and video analysis, test-time cost of scene parsing becomes critical for many large-scale or time-sensitive vision applications. We propose a dynamic hierarchical model for anytime scene labeling that allows us to achieve flexible tradeoffs between efficiency and accuracy in pixel-level prediction. In particular, our approach incorporates the cost of feature computation and model inference, and optimizes the model performance for any given test-time budget by learning a sequence of image-adaptive hierarchical models. We formulate this anytime representation learning as a Markov Decision Process with a discrete-continuous state-action space. A high-quality policy of feature and model selection is learned based on an approximate policy iteration method with action proposal mechanism. We demonstrate the advantages of our dynamic non-myopic anytime scene parsing on three semantic segmentation datasets, which achieves 90% of the state-of-the-art performances by using 15% of their overall costs. | en |
| dc.description.sponsorship | DATA61 (formerly NICTA) is funded by the Australian Government through the Department of Communications and the Australian Research Council through the ICT Centre for Excellence Program. We thank NVIDIA Corporation for the donation of GPUs used in this research. | en |
| dc.description.status | Peer-reviewed | en |
| dc.format.extent | 17 | en |
| dc.identifier.isbn | 9783319464657 | en |
| dc.identifier.issn | 0302-9743 | en |
| dc.identifier.scopus | 84990050134 | en |
| dc.identifier.uri | https://hdl.handle.net/1885/733796095 | |
| dc.language.iso | en | en |
| dc.publisher | Springer Verlag | en |
| dc.relation.ispartof | Computer Vision - 14th European Conference, ECCV 2016, Proceedings | en |
| dc.relation.ispartofseries | 14th European Conference on Computer Vision, ECCV 2016 | en |
| dc.relation.ispartofseries | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | en |
| dc.rights | Publisher Copyright: © Springer International Publishing AG 2016. | en |
| dc.title | Learning dynamic hierarchical models for anytime scene labeling | en |
| dc.type | Conference paper | en |
| dspace.entity.type | Publication | en |
| local.bibliographicCitation.lastpage | 666 | en |
| local.bibliographicCitation.startpage | 650 | en |
| local.contributor.affiliation | Liu, Buyu; School of Engineering, ANU College of Systems and Society, The Australian National University | en |
| local.contributor.affiliation | He, Xuming; School of Computing, ANU College of Systems and Society, The Australian National University | en |
| local.identifier.ariespublication | a383154xPUB4414 | en |
| local.identifier.doi | 10.1007/978-3-319-46466-4_39 | en |
| local.identifier.essn | 1611-3349 | en |
| local.identifier.pure | 9d57a8cb-0cab-4d77-ad68-9d724467d79c | en |
| local.identifier.url | https://www.scopus.com/pages/publications/84990050134 | en |
| local.type.status | Published | en |