Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Binaural localization of speech sources in the median plane using cepstral hrtf extraction

dc.contributor.authorTalagala, Dumidu
dc.contributor.authorWu, Xiang
dc.contributor.authorZhang, Wen
dc.contributor.authorAbhayapala, Thushara
dc.coverage.spatialLisbon Portugal
dc.date.accessioned2015-12-10T22:23:14Z
dc.date.createdSeptember 1-5 2014
dc.date.issued2014
dc.date.updated2015-12-09T09:05:53Z
dc.description.abstractIn binaural systems, source localization in the median plane is challenging due to the difficulty of exploring the spectral cues of the head-related transfer function (HRTF) independently of the source spectra. This paper presents a method of extracting the HRTF spectral cues using cepstral analysis for speech source localization in the median plane. Binaural signals are preprocessed in the cepstral domain so that the fine spectral structure of speech and the HRTF spectral envelope can be easily separated. We introduce (i) a truncated cepstral transformation to extract the relevant localization cues, and (ii) a mechanism to normalize the effects of the time varying speech spectra. The proposed method is evaluated and compared with a convolution based localization method using a speech corpus of multiple speakers. The results suggest that the proposed method fully exploits the available spectral cues for robust speaker independent binaural source localization in the median plane.
dc.identifier.isbn9780992862619
dc.identifier.urihttp://hdl.handle.net/1885/52687
dc.publisherIEEE
dc.relation.ispartofseries22nd European Signal Processing Conference, EUSIPCO 2014
dc.sourceEuropean Signal Processing Conference
dc.titleBinaural localization of speech sources in the median plane using cepstral hrtf extraction
dc.typeConference paper
local.bibliographicCitation.lastpage2059
local.bibliographicCitation.startpage2055
local.contributor.affiliationTalagala, Dumidu, College of Engineering and Computer Science, ANU
local.contributor.affiliationWu, Xiang, College of Engineering and Computer Science, ANU
local.contributor.affiliationZhang, Wen, College of Engineering and Computer Science, ANU
local.contributor.affiliationAbhayapala, Thushara, College of Engineering and Computer Science, ANU
local.contributor.authoruidTalagala, Dumidu, u4689954
local.contributor.authoruidWu, Xiang, u4914406
local.contributor.authoruidZhang, Wen, u2580478
local.contributor.authoruidAbhayapala, Thushara, u9701943
local.description.embargo2037-12-31
local.description.notesImported from ARIES
local.description.refereedYes
local.identifier.absfor090609 - Signal Processing
local.identifier.ariespublicationa383154xPUB253
local.identifier.scopusID2-s2.0-84911942485
local.type.statusPublished Version

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
01_Talagala_Binaural_localization_of_2014.pdf
Size:
167.31 KB
Format:
Adobe Portable Document Format
abcd