Saleh, Fatemehsadat; Ali Akbarian, Mohammad Sadegh; Salzmann, Mathieu; Petersson, Lars; Alvarez, Jose; Gould, Stephen
Pixel-level annotations are expensive and time consuming to obtain. Hence, weak supervision using only image tags could have a significant impact in semantic segmentation. Recently, CNN-based methods have proposed to fine-tune pre-trained networks using image tags. Without additional information, this leads to poor localization accuracy. This problem, however, was alleviated by making use of objectness priors to generate foreground/background masks. Unfortunately these priors either require...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.