Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Spatial feature learning for robust binaural sound source localization using a composite feature vector

Loading...
Thumbnail Image

Date

Authors

Wu, Xiang
Talagala, Dumidu S.
Zhang, Wen
Abhayapala, Thushara

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

The performance of binaural speech source localization systems can be significantly impacted by an imperfect selection of spatial localization cues, due to the limited bandwidth of speech, and the effects of noise. In order to mitigate these impacts, this paper presents a novel method that combines a deterministic localization approach with a spatial feature learning process. Here, we (i) obtain a composite feature vector derived from analysing the mutual information between different spatial cues and (ii) estimate the optimum feature combination that minimizes the angular localization error in three dimensional space. The performance of the proposed mutual information based feature learning approach is evaluated for a range of speech inputs and noise conditions. We also demonstrate that the proposed approach improves the localization accuracy and its robustness, with respect to traditional localization algorithms, especially in the relatively low signal-to-noise ratio localization scenarios.

Description

Keywords

Citation

Source

Book Title

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20-25 March 2016

Entity type

Access Statement

License Rights

Restricted until

abcd