Exploring sub-band cepstral distances for more robust speaker classification

Date

Authors

Osanai, Takashi
Kinoshita, Yuko
Clermont, Frantz

Journal Title

Journal ISSN

Volume Title

Publisher

The Australasian Speech Science and Technology Association, Inc.

Abstract

This paper presents the first of two-part exploration into the potential of parametric cepstral distance (PCD) as a forensic voice comparison feature, based on Japanese vowel data collected from 306 male native speakers under microphone and mobile transmission conditions. The behaviours of PCDs were closely examined by altering sub-band settings, and we found the behaviour of PCDs to correspond well to what is known about formants, which suggests that PCDs are relatable to articulatory gestures. Comparison between sub-band and full-band PCD revealed that limiting the band range to a specific frequency region makes the feature more robust against channel mismatch, encouraging further examination of this potential feature.

Description

Citation

Source

Proceedings of the 17th Australasian International Conference on Speech Science and Technology

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until