Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Partially-supervised image captioning

dc.contributor.authorAnderson, Peter
dc.contributor.authorGould, Stephen
dc.contributor.authorJohnson, Mark
dc.contributor.editorGrauman, K
dc.contributor.editorCesa-Bianchi, N
dc.contributor.editorGarnett, R
dc.contributor.editorBengio, S
dc.contributor.editorLarochelle, H
dc.contributor.editorWallach, H
dc.coverage.spatialMontreal, Canada
dc.date.accessioned2024-02-12T22:37:02Z
dc.date.createdDecember 2-8 2018
dc.date.issued2018
dc.date.updated2022-10-02T07:19:29Z
dc.description.abstractImage captioning models are becoming increasingly successful at describing the content of images in restricted domains. However, if these models are to function in the wild - for example, as assistants for people with impaired vision - a much larger number and variety of visual concepts must be understood. To address this problem, we teach image captioning models new visual concepts from labeled images and object detection datasets. Since image labels and object classes can be interpreted as partial captions, we formulate this problem as learning from partially-specified sequence data. We then propose a novel algorithm for training sequence models, such as recurrent neural networks, on partially-specified sequences which we represent using finite state automata. In the context of image captioning, our method lifts the restriction that previously required image captioning models to be trained on paired image-sentence corpora only, or otherwise required specialized model architectures to take advantage of alternative data modalities. Applying our approach to an existing neural captioning model, we achieve state of the art results on the novel object captioning task using the COCO dataset. We further show that we can train a captioning model to describe new visual concepts from the Open Images dataset while maintaining competitive COCO evaluation scores.en_AU
dc.description.sponsorshipThis research was supported by a Google award through the Natural Language Understanding Focused Program, CRP 8201800363 from Data61/CSIRO, and under the Australian Research Council’s Discovery Projects funding scheme (project number DP160102156).en_AU
dc.format.mimetypeapplication/pdfen_AU
dc.identifier.urihttp://hdl.handle.net/1885/313418
dc.language.isoen_AUen_AU
dc.publisherNeural Information Processing Systems Foundationen_AU
dc.relationhttp://purl.org/au-research/grants/arc/DP160102156en_AU
dc.relation.ispartofseries32nd Conference on Neural Information Processing Systems, NeurIPS 2018en_AU
dc.rights© 2018 Neural Information Processing Systems Foundationen_AU
dc.sourceAdvances in Neural Information Processing Systemsen_AU
dc.source.urihttps://proceedings.neurips.cc/paper_files/paper/2018en_AU
dc.titlePartially-supervised image captioningen_AU
dc.typeConference paperen_AU
dcterms.accessRightsFree Access via publisher websiteen_AU
local.bibliographicCitation.lastpage1886en_AU
local.bibliographicCitation.startpage1875en_AU
local.contributor.affiliationAnderson, Peter, Georgia Techen_AU
local.contributor.affiliationGould, Stephen, College of Engineering and Computer Science, ANUen_AU
local.contributor.affiliationJohnson, Mark, Macquarie Universityen_AU
local.contributor.authoruidGould, Stephen, u4971180en_AU
local.description.embargo2099-12-31
local.description.notesImported from ARIESen_AU
local.description.refereedYes
local.identifier.absfor460200 - Artificial intelligenceen_AU
local.identifier.ariespublicationu3102795xPUB1743en_AU
local.identifier.citationvolume31en_AU
local.identifier.scopusID2-s2.0-85064841019
local.publisher.urlhttps://proceedings.neurips.cc/en_AU
local.type.statusPublished Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
NeurIPS-2018-partially-supervised-image-captioning-Paper.pdf
Size:
3.71 MB
Format:
Adobe Portable Document Format
Description:
abcd