A Multi-modal Graphical Model for Scene Analysis

dc.contributor.authorTaghavi Namin, Sarah
dc.contributor.authorNajafi, Mohammad
dc.contributor.authorSalzmann, Mathieu
dc.contributor.authorPetersson, Lars
dc.coverage.spatialHonolulu, USA
dc.date.accessioned2016-06-14T23:20:02Z
dc.date.createdJanuary 5-9 2015
dc.date.issued2015
dc.date.updated2016-06-14T08:45:02Z
dc.description.abstractIn this paper, we introduce a multi-modal graphical model to address the problems of semantic segmentation using 2D-3D data exhibiting extensive many-to-one correspondences. Existing methods often impose a hard correspondence between the 2D and 3D data, where the 2D and 3D corresponding regions are forced to receive identical labels. This results in performance degradation due to misalignments, 3D-2D projection errors and occlusions. We address this issue by defining a graph over the entire set of data that models soft correspondences between the two modalities. This graph encourages each region in a modality to leverage the information from its corresponding regions in the other modality to better estimate its class label. We evaluate our method on a publicly available dataset and beat the state-of-the-art. Additionally, to demonstrate the ability of our model to support multiple correspondences for objects in 3D and 2D domains, we introduce a new multi-modal dataset, which is composed of panoramic images and LIDAR data, and features a rich set of many-to-one correspondences.
dc.identifier.isbn9781479966820
dc.identifier.urihttp://hdl.handle.net/1885/103164
dc.publisherIEEE
dc.relation.ispartofseries2015 15th IEEE Winter Conference on Applications of Computer Vision, WACV 2015
dc.sourceProceedings - 2015 IEEE Winter Conference on Applications of Computer Vision, WACV 2015
dc.titleA Multi-modal Graphical Model for Scene Analysis
dc.typeConference paper
local.bibliographicCitation.lastpage1013
local.bibliographicCitation.startpage1006
local.contributor.affiliationTaghavi Namin, Sarah, College of Engineering and Computer Science, ANU
local.contributor.affiliationNajafi, Mohammad, College of Engineering and Computer Science, ANU
local.contributor.affiliationSalzmann, Mathieu, College of Engineering and Computer Science, ANU
local.contributor.affiliationPetersson, Lars, College of Engineering and Computer Science, ANU
local.contributor.authoruidTaghavi Namin, Sarah, u5105580
local.contributor.authoruidNajafi, Mohammad, u4938496
local.contributor.authoruidSalzmann, Mathieu, u5214770
local.contributor.authoruidPetersson, Lars, u4048690
local.description.embargo2037-12-31
local.description.notesImported from ARIES
local.description.refereedYes
local.identifier.absfor080000 - INFORMATION AND COMPUTING SCIENCES
local.identifier.absfor080104 - Computer Vision
local.identifier.absseo970108 - Expanding Knowledge in the Information and Computing Sciences
local.identifier.ariespublicationU3488905xPUB5393
local.identifier.doi10.1109/WACV.2015.139
local.identifier.scopusID2-s2.0-84925431004
local.type.statusPublished Version

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
01_Taghavi+Namin_A_Multi-modal_Graphical_Model_2015.pdf
Size:
874.29 KB
Format:
Adobe Portable Document Format