Skip navigation
Skip navigation

Binaural localization of speech sources in the median plane using cepstral hrtf extraction

Talagala, Dumidu; Wu, Xiang; Zhang, Wen; Abhayapala, Thushara

Description

In binaural systems, source localization in the median plane is challenging due to the difficulty of exploring the spectral cues of the head-related transfer function (HRTF) independently of the source spectra. This paper presents a method of extracting the HRTF spectral cues using cepstral analysis for speech source localization in the median plane. Binaural signals are preprocessed in the cepstral domain so that the fine spectral structure of speech and the HRTF spectral envelope can be...[Show more]

dc.contributor.authorTalagala, Dumidu
dc.contributor.authorWu, Xiang
dc.contributor.authorZhang, Wen
dc.contributor.authorAbhayapala, Thushara
dc.coverage.spatialLisbon Portugal
dc.date.accessioned2015-12-10T22:23:14Z
dc.date.createdSeptember 1-5 2014
dc.identifier.isbn9780992862619
dc.identifier.urihttp://hdl.handle.net/1885/52687
dc.description.abstractIn binaural systems, source localization in the median plane is challenging due to the difficulty of exploring the spectral cues of the head-related transfer function (HRTF) independently of the source spectra. This paper presents a method of extracting the HRTF spectral cues using cepstral analysis for speech source localization in the median plane. Binaural signals are preprocessed in the cepstral domain so that the fine spectral structure of speech and the HRTF spectral envelope can be easily separated. We introduce (i) a truncated cepstral transformation to extract the relevant localization cues, and (ii) a mechanism to normalize the effects of the time varying speech spectra. The proposed method is evaluated and compared with a convolution based localization method using a speech corpus of multiple speakers. The results suggest that the proposed method fully exploits the available spectral cues for robust speaker independent binaural source localization in the median plane.
dc.publisherIEEE
dc.relation.ispartofseries22nd European Signal Processing Conference, EUSIPCO 2014
dc.sourceEuropean Signal Processing Conference
dc.titleBinaural localization of speech sources in the median plane using cepstral hrtf extraction
dc.typeConference paper
local.description.notesImported from ARIES
local.description.refereedYes
dc.date.issued2014
local.identifier.absfor090609 - Signal Processing
local.identifier.ariespublicationa383154xPUB253
local.type.statusPublished Version
local.contributor.affiliationTalagala, Dumidu, College of Engineering and Computer Science, ANU
local.contributor.affiliationWu, Xiang, College of Engineering and Computer Science, ANU
local.contributor.affiliationZhang, Wen, College of Engineering and Computer Science, ANU
local.contributor.affiliationAbhayapala, Thushara, College of Engineering and Computer Science, ANU
local.description.embargo2037-12-31
local.bibliographicCitation.startpage2055
local.bibliographicCitation.lastpage2059
dc.date.updated2015-12-09T09:05:53Z
local.identifier.scopusID2-s2.0-84911942485
CollectionsANU Research Publications

Download

File Description SizeFormat Image
01_Talagala_Binaural_localization_of_2014.pdf167.31 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  19 May 2020/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator