Correlating cepstra with formant frequencies: Implications for phonetically-informed forensic voice comparison
dc.contributor.author | Hughes, Vincent | |
dc.contributor.author | Clermont, Frantz | |
dc.contributor.author | Harrison, Philip | |
dc.coverage.spatial | Shanghai, China (online) | |
dc.date.accessioned | 2024-01-05T02:55:43Z | |
dc.date.created | 25-29 October 2020 | |
dc.date.issued | 2020 | |
dc.date.updated | 2022-09-18T08:16:19Z | |
dc.description.abstract | A significant question for forensic voice comparison, and for speaker recognition more generally, is the extent to which different input features capture complementary speaker-specific information. Understanding complementarity allows us to make predictions about how combining methods using different features may produce better overall performance. In forensic contexts, it is also important to be able to explain to courts what information the underlying features are actually capturing. This paper addresses these issues by examining the extent to which MFCCs and LPCCs can predict F0, F1, F2, and F3 values using data extracted from the midpoint of the vocalic portion of the hesitation marker um for 89 speakers of standard southern British English. By-speaker correlations were calculated using multiple linear regression and performance was assessed using mean rho (?) values. Results show that the first two formants were more accurately predicted than F3 or F0. LPCCs consistently produced stronger correlations with the linguistic features than MFCCs, while increasing cepstral order up to 16 also increased the strength of the correlations. There was, however, considerable variability across speakers in terms of the accuracy of the predictions. We discuss the implications of these findings for forensic voice comparison. | en_AU |
dc.format.mimetype | application/pdf | en_AU |
dc.identifier.isbn | 9781713820697 | en_AU |
dc.identifier.uri | http://hdl.handle.net/1885/311183 | |
dc.language.iso | en_AU | en_AU |
dc.publisher | International Speech Communication Association | en_AU |
dc.relation.ispartofseries | 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) | en_AU |
dc.rights | © 2020 ISCA | en_AU |
dc.source | 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020): Cognitive Intelligence for Speech Processing | en_AU |
dc.title | Correlating cepstra with formant frequencies: Implications for phonetically-informed forensic voice comparison | en_AU |
dc.type | Conference paper | en_AU |
dcterms.accessRights | Free Access via publisher website | en_AU |
local.bibliographicCitation.lastpage | 1862 | en_AU |
local.bibliographicCitation.startpage | 1858 | en_AU |
local.contributor.affiliation | Hughes, Vincent, University of York | en_AU |
local.contributor.affiliation | Clermont, Frantz, College of Asia and the Pacific, ANU | en_AU |
local.contributor.affiliation | Harrison, Philip, University of York | en_AU |
local.contributor.authoremail | u3674215@anu.edu.au | en_AU |
local.contributor.authoruid | Clermont, Frantz, u3674215 | en_AU |
local.description.embargo | 2099-12-31 | |
local.description.notes | Imported from ARIES | en_AU |
local.description.refereed | Yes | |
local.identifier.absfor | 470401 - Applied linguistics and educational linguistics | en_AU |
local.identifier.ariespublication | a383154xPUB16957 | en_AU |
local.identifier.doi | 10.21437/Interspeech.2020-2216 | en_AU |
local.identifier.scopusID | 2-s2.0-85098154787 | |
local.identifier.uidSubmittedBy | a383154 | en_AU |
local.publisher.url | https://www.isca-speech.org/archive/interspeech_2020/hughes20_interspeech.html | en_AU |
local.type.status | Published Version | en_AU |
Downloads
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Correlating cepstra with formant frequencies.pdf
- Size:
- 901.01 KB
- Format:
- Adobe Portable Document Format
- Description: