Cantonese forensic voice comparison with higher‐level features: Likelihood ratio‐based validation using F‐pattern and tonal F0 trajectories over a disyllabic hexaphone

dc.contributor.authorRose, Philen
dc.contributor.authorWang, Xiaoen
dc.date.accessioned2026-01-01T08:42:06Z
dc.date.available2026-01-01T08:42:06Z
dc.date.issued2016en
dc.description.abstractA pilot experiment relating to estimation of strength of evidence in forensic voice comparison is described which explores the use of higher-level features extracted over a disyllabic word as a whole, rather than over individual monosyllables as conventionally practiced. The trajectories of the first three formants and tonal F0 of the hexaphonic disyllabic Cantonese word daihyat 'first' from controlled but natural non-contemporaneous recordings of 23 male speakers are modeled with polynomials, and multivariate likelihood ratios estimated from their coefficients. Evaluation with the log likelihood ratio cost validity metric Cllr shows an optimum performance is obtained, surprisingly, with lower order polynomials, with F2 requiring a cubic fit, and F1 and F3 quadratic. Fusion of F-pattern and tonal F0 results in considerable improvement over the individual features, reducing the Cllr to ca. 0.1. The forensic potential of the daihyat data is demonstrated by fusion with two other higher-level features: the F-pattern of Cantonese /i/ and short-term F0, which reduces the Cllr still further to 0.03. Important pros and cons of higher-level features and likelihood ratios are discussed, the latter illustrated with data from Japanese, and two varieties of English in real forensic casework.en
dc.description.statusPeer-revieweden
dc.format.extent8en
dc.identifier.scopus85073247802en
dc.identifier.urihttps://hdl.handle.net/1885/733799171
dc.language.isoenen
dc.relation.ispartofseriesSpeaker and Language Recognition Workshop, Odyssey 2016en
dc.rightsPublisher Copyright: © Odyssey 2016: Speaker and Language Recognition Workshop. All rights reserved.en
dc.subjectCantoneseen
dc.subjectF-pattern trajectoriesen
dc.subjectForensic voice comparisonen
dc.subjectHigher-level featuresen
dc.subjectLikelihood ratioen
dc.subjectTonal F0 trajectoryen
dc.titleCantonese forensic voice comparison with higher‐level features: Likelihood ratio‐based validation using F‐pattern and tonal F0 trajectories over a disyllabic hexaphoneen
dc.typeConference paperen
dspace.entity.typePublicationen
local.bibliographicCitation.lastpage333en
local.bibliographicCitation.startpage326en
local.contributor.affiliationRose, Phil; Sch of Culture History & Lang, School of Culture, History & Language, ANU College of Asia & the Pacific, The Australian National Universityen
local.contributor.affiliationWang, Xiao; Australian National Universityen
local.identifier.ariespublicationa383154xPUB38746en
local.identifier.doi10.21437/Odyssey.2016-47en
local.identifier.pure8c90954e-66c2-45f3-b5bc-0b0dc997dd36en
local.identifier.urlhttps://www.scopus.com/pages/publications/85073247802en
local.type.statusPublisheden

Downloads