Liu, Beyang; Gould, Stephen; Koller, Daphne
We consider the problem of estimating the depth of each pixel in a scene from a single monocular image. Unlike traditional approaches [18, 19], which attempt to map from appearance features to depth directly, we first perform a semantic segmentation of the scene and use the semantic labels to guide the 3D reconstruction. This approach provides several advantages: By knowing the semantic class of a pixel or region, depth and geometry constraints can be easily enforced (e.g., "sky" is far away...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.