Skip navigation
Skip navigation

Exploring the discriminatory potential of F0 distribution parameters in traditional forensic speaker recognition

Kinoshita, Yuko; Ishihara, Shunichi; Rose, Philip

Description

Despite its many prima facie attractive properties for forensic speaker recognition, F0 is regarded as having limited forensic value due to its large within-speaker variability. However, its forensic use to date has been limited mostly to its long-term mean and standard deviation. This paper examines the discriminatory potential, within a Likeli-hood Ratio-based approach, of additional parametric features from the distribution of long-term Fo: its skew, kurtosis, modal F0 and modal density....[Show more]

dc.contributor.authorKinoshita, Yuko
dc.contributor.authorIshihara, Shunichi
dc.contributor.authorRose, Philip
dc.date.accessioned2015-12-10T22:30:56Z
dc.identifier.issn1748-8893
dc.identifier.urihttp://hdl.handle.net/1885/55307
dc.description.abstractDespite its many prima facie attractive properties for forensic speaker recognition, F0 is regarded as having limited forensic value due to its large within-speaker variability. However, its forensic use to date has been limited mostly to its long-term mean and standard deviation. This paper examines the discriminatory potential, within a Likeli-hood Ratio-based approach, of additional parametric features from the distribution of long-term Fo: its skew, kurtosis, modal F0 and modal density. Motivated by the observation that the shape of the long-term F0 distribution shows less within-speaker occasion-to-occasion difference, we report a forensic discrimination experiment with non-contemporaneous speech samples from 201 male Japanese speakers. Using a multivariate Likelihood Ratio as discriminant distance with the six LTF0 distribution parameters, an equal error rate of 10.7% is obtained from 201 target and 80400 non-target trials. We also investigate how the EER degrades as a function of amount of voiced speech.
dc.publisherEquinox Publishing Ltd
dc.sourceThe International Journal of Speech, Language and the Law
dc.subjectKeywords: Duration; Fo; Forensic speaker recognition; Japanese; Likelihood ratio; Multivariate likelihood ratio; Non-contemporaneous recordings; Spontaneous speech; Univariate likelihood ratio
dc.titleExploring the discriminatory potential of F0 distribution parameters in traditional forensic speaker recognition
dc.typeJournal article
local.description.notesImported from ARIES
local.identifier.citationvolume16
dc.date.issued2009
local.identifier.absfor200404 - Laboratory Phonetics and Speech Science
local.identifier.ariespublicationu9803255xPUB324
local.type.statusPublished Version
local.contributor.affiliationKinoshita, Yuko, University of Canberra
local.contributor.affiliationIshihara, Shunichi, College of Asia and the Pacific, ANU
local.contributor.affiliationRose, Philip, College of Arts and Social Sciences, ANU
local.description.embargo2037-12-31
local.bibliographicCitation.issue1
local.bibliographicCitation.startpage91
local.bibliographicCitation.lastpage111
local.identifier.doi10.1558/ijsll.v16i1.91
dc.date.updated2016-02-24T12:12:11Z
local.identifier.scopusID2-s2.0-70349732908
local.identifier.thomsonID000271741600004
CollectionsANU Research Publications

Download

File Description SizeFormat Image
01_Kinoshita_Exploring_the_discriminatory_2009.pdf738.55 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator