Learning to Generate Object Segment Proposals with Multi-modal Cues

Zhang, Haoyang; He, Xuming; Porikli, Fatih

Learning to Generate Object Segment Proposals with Multi-modal Cues

Date

2017

Authors

Zhang, Haoyang

He, Xuming

Porikli, Fatih

Publisher

Springer International Publishing AG

Abstract

This paper presents a learning-based object segmentation proposal generation method for stereo images. Unlike existing methods which mostly rely on low-level appearance cue and handcrafted similarity functions to group segments, our method makes use of learned deep features and designed geometric features to represent a region, as well as a learned similarity network to guide the grouping process. Given an initial segmentation hierarchy, we sequentially merge adjacent regions in each level based on their affinity measured by the similarity network. This merging process generates new segmentation hierarchies, which are then used to produce a pool of regional proposals by taking region singletons, pairs, triplets and 4-tuples from them. In addition, we learn a ranking network that predicts the objectness score of each regional proposal and diversify the ranking based on Maximum Marginal Relevance measures. Experiments on the Cityscapes dataset show that our approach performs significantly better than the baseline and the current state-of-the-art.

URI

http://hdl.handle.net/1885/241203

Collections

ANU Research Publications

Type

Conference paper

Book Title

Computer Vision – ACCV 2016

DOI

10.1007/978-3-319-54181-5_8

Restricted until

2099-12-31

Downloads

File

Description

01_Zhang_Learning_to_Generate_Object_2017.pdf (1.9 MB)

Full item page

Cultural advice

Learning to Generate Object Segment Proposals with Multi-modal Cues

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads