Proposal-free temporal moment localization of a natural-language query in video using guided attention

Rodriguez Opazo, Cristian; Marrese-Taylor, Edison; Saleh, Fatemehsadat; Li, Hongdong; Gould, Stephen

Proposal-free temporal moment localization of a natural-language query in video using guided attention

Date

2020

Authors

Rodriguez Opazo, Cristian

Marrese-Taylor, Edison

Saleh, Fatemehsadat

Li, Hongdong

Gould, Stephen

Publisher

IEEE

Abstract

This paper studies the problem of temporal moment localization in a long untrimmed video using natural language as the query. Given an untrimmed video and a query sentence, the goal is to determine the start and end of the relevant visual moment in the video that corresponds to the query sentence. While most previous works have tackled this by a propose-and-rank approach, we introduce a more efficient, end-to-end trainable, and proposal-free approach that is built upon three key components: a dynamic filter which adaptively transfers language information to visual domain attention map, a new loss function to guide the model to attend the most relevant part of the video, and soft labels to cope with annotation uncertainties. Our method is evaluated on three standard benchmark datasets, Charades-STA, TACoS and ActivityNet-Captions. Experimental results show our method outperforms state-of-the-art methods on these datasets, confirming the effectiveness of the method. We believe the proposed dynamic filter-based guided attention mechanism will prove valuable for other vision and language tasks as well.

URI

http://hdl.handle.net/1885/311631

Collections

ANU Research Publications

Source

2020 IEEE Winter Conference on Applications of Computer Vision (WACV)

Type

Conference paper

Access Statement

Open Access

DOI

10.1109/WACV45572.2020.9093328

Downloads

File

Description

1908.07236.pdf (2.63 MB)

Full item page

Cultural advice

Proposal-free temporal moment localization of a natural-language query in video using guided attention

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads