A Multi-modal Graphical Model for Scene Analysis

Taghavi Namin, Sarah; Najafi, Mohammad; Salzmann, Mathieu; Petersson, Lars

A Multi-modal Graphical Model for Scene Analysis

dc.contributor.author	Taghavi Namin, Sarah
dc.contributor.author	Najafi, Mohammad
dc.contributor.author	Salzmann, Mathieu
dc.contributor.author	Petersson, Lars
dc.coverage.spatial	Honolulu, USA
dc.date.accessioned	2016-06-14T23:20:02Z
dc.date.created	January 5-9 2015
dc.date.issued	2015
dc.date.updated	2016-06-14T08:45:02Z
dc.description.abstract	In this paper, we introduce a multi-modal graphical model to address the problems of semantic segmentation using 2D-3D data exhibiting extensive many-to-one correspondences. Existing methods often impose a hard correspondence between the 2D and 3D data, where the 2D and 3D corresponding regions are forced to receive identical labels. This results in performance degradation due to misalignments, 3D-2D projection errors and occlusions. We address this issue by defining a graph over the entire set of data that models soft correspondences between the two modalities. This graph encourages each region in a modality to leverage the information from its corresponding regions in the other modality to better estimate its class label. We evaluate our method on a publicly available dataset and beat the state-of-the-art. Additionally, to demonstrate the ability of our model to support multiple correspondences for objects in 3D and 2D domains, we introduce a new multi-modal dataset, which is composed of panoramic images and LIDAR data, and features a rich set of many-to-one correspondences.
dc.identifier.isbn	9781479966820
dc.identifier.uri	http://hdl.handle.net/1885/103164
dc.publisher	IEEE
dc.relation.ispartofseries	2015 15th IEEE Winter Conference on Applications of Computer Vision, WACV 2015
dc.source	Proceedings - 2015 IEEE Winter Conference on Applications of Computer Vision, WACV 2015
dc.title	A Multi-modal Graphical Model for Scene Analysis
dc.type	Conference paper
local.bibliographicCitation.lastpage	1013
local.bibliographicCitation.startpage	1006
local.contributor.affiliation	Taghavi Namin, Sarah, College of Engineering and Computer Science, ANU
local.contributor.affiliation	Najafi, Mohammad, College of Engineering and Computer Science, ANU
local.contributor.affiliation	Salzmann, Mathieu, College of Engineering and Computer Science, ANU
local.contributor.affiliation	Petersson, Lars, College of Engineering and Computer Science, ANU
local.contributor.authoruid	Taghavi Namin, Sarah, u5105580
local.contributor.authoruid	Najafi, Mohammad, u4938496
local.contributor.authoruid	Salzmann, Mathieu, u5214770
local.contributor.authoruid	Petersson, Lars, u4048690
local.description.embargo	2037-12-31
local.description.notes	Imported from ARIES
local.description.refereed	Yes
local.identifier.absfor	080000 - INFORMATION AND COMPUTING SCIENCES
local.identifier.absfor	080104 - Computer Vision
local.identifier.absseo	970108 - Expanding Knowledge in the Information and Computing Sciences
local.identifier.ariespublication	U3488905xPUB5393
local.identifier.doi	10.1109/WACV.2015.139
local.identifier.scopusID	2-s2.0-84925431004
local.type.status	Published Version

Downloads

Original bundle

Now showing 1 - 1 of 1

Name:: 01_Taghavi+Namin_A_Multi-modal_Graphical_Model_2015.pdf
Size:: 874.29 KB
Format:: Adobe Portable Document Format

Download

Collections

ANU Research Publications