A Multi-modal Graphical Model for Scene Analysis
| dc.contributor.author | Taghavi Namin, Sarah | |
| dc.contributor.author | Najafi, Mohammad | |
| dc.contributor.author | Salzmann, Mathieu | |
| dc.contributor.author | Petersson, Lars | |
| dc.coverage.spatial | Honolulu, USA | |
| dc.date.accessioned | 2016-06-14T23:20:02Z | |
| dc.date.created | January 5-9 2015 | |
| dc.date.issued | 2015 | |
| dc.date.updated | 2016-06-14T08:45:02Z | |
| dc.description.abstract | In this paper, we introduce a multi-modal graphical model to address the problems of semantic segmentation using 2D-3D data exhibiting extensive many-to-one correspondences. Existing methods often impose a hard correspondence between the 2D and 3D data, where the 2D and 3D corresponding regions are forced to receive identical labels. This results in performance degradation due to misalignments, 3D-2D projection errors and occlusions. We address this issue by defining a graph over the entire set of data that models soft correspondences between the two modalities. This graph encourages each region in a modality to leverage the information from its corresponding regions in the other modality to better estimate its class label. We evaluate our method on a publicly available dataset and beat the state-of-the-art. Additionally, to demonstrate the ability of our model to support multiple correspondences for objects in 3D and 2D domains, we introduce a new multi-modal dataset, which is composed of panoramic images and LIDAR data, and features a rich set of many-to-one correspondences. | |
| dc.identifier.isbn | 9781479966820 | |
| dc.identifier.uri | http://hdl.handle.net/1885/103164 | |
| dc.publisher | IEEE | |
| dc.relation.ispartofseries | 2015 15th IEEE Winter Conference on Applications of Computer Vision, WACV 2015 | |
| dc.source | Proceedings - 2015 IEEE Winter Conference on Applications of Computer Vision, WACV 2015 | |
| dc.title | A Multi-modal Graphical Model for Scene Analysis | |
| dc.type | Conference paper | |
| local.bibliographicCitation.lastpage | 1013 | |
| local.bibliographicCitation.startpage | 1006 | |
| local.contributor.affiliation | Taghavi Namin, Sarah, College of Engineering and Computer Science, ANU | |
| local.contributor.affiliation | Najafi, Mohammad, College of Engineering and Computer Science, ANU | |
| local.contributor.affiliation | Salzmann, Mathieu, College of Engineering and Computer Science, ANU | |
| local.contributor.affiliation | Petersson, Lars, College of Engineering and Computer Science, ANU | |
| local.contributor.authoruid | Taghavi Namin, Sarah, u5105580 | |
| local.contributor.authoruid | Najafi, Mohammad, u4938496 | |
| local.contributor.authoruid | Salzmann, Mathieu, u5214770 | |
| local.contributor.authoruid | Petersson, Lars, u4048690 | |
| local.description.embargo | 2037-12-31 | |
| local.description.notes | Imported from ARIES | |
| local.description.refereed | Yes | |
| local.identifier.absfor | 080000 - INFORMATION AND COMPUTING SCIENCES | |
| local.identifier.absfor | 080104 - Computer Vision | |
| local.identifier.absseo | 970108 - Expanding Knowledge in the Information and Computing Sciences | |
| local.identifier.ariespublication | U3488905xPUB5393 | |
| local.identifier.doi | 10.1109/WACV.2015.139 | |
| local.identifier.scopusID | 2-s2.0-84925431004 | |
| local.type.status | Published Version |
Downloads
Original bundle
1 - 1 of 1
Loading...
- Name:
- 01_Taghavi+Namin_A_Multi-modal_Graphical_Model_2015.pdf
- Size:
- 874.29 KB
- Format:
- Adobe Portable Document Format