Exploring sub-band cepstral distances for more robust speaker classification
Date
Authors
Osanai, Takashi
Kinoshita, Yuko
Clermont, Frantz
Journal Title
Journal ISSN
Volume Title
Publisher
The Australasian Speech Science and Technology Association, Inc.
Abstract
This paper presents the first of two-part exploration into the potential of parametric cepstral distance (PCD) as a forensic voice comparison feature, based on Japanese vowel data collected from 306 male native speakers under microphone and mobile transmission conditions. The behaviours of PCDs were closely examined by altering sub-band settings, and we found the behaviour of PCDs to correspond well to what is known about formants, which suggests that PCDs are relatable to articulatory gestures. Comparison between sub-band and full-band PCD revealed that limiting the band range to a specific frequency region makes the feature more robust against channel mismatch, encouraging further examination of this potential feature.
Description
Citation
Collections
Source
Proceedings of the 17th Australasian International Conference on Speech Science and Technology
Type
Book Title
Entity type
Access Statement
License Rights
DOI
Restricted until
Downloads
File
Description