A Comparative Study of 2D and 3D Lip Tracking Methods for AV ASR

dc.contributor.authorGöcke, Rolanden
dc.contributor.authorAsthana, Akshayen
dc.date.accessioned2025-12-31T17:41:34Z
dc.date.available2025-12-31T17:41:34Z
dc.date.issued2008en
dc.description.abstractOver the past two decades, many algorithms have been proposed to detect and track a human face and its facial features. Of particular interest to the Automatic Speech Recognition (ASR) community are algorithms that can track the shape of the lips, as such visual speech input can then be used in an auditory-visual (AV) ASR system to improve the recognition accuracy of traditional audio-only ASR systems, particularly in the presence of acoustic noise. Despite the large number of face and lip tracking algorithms that have been proposed over the years, there is a lack of a comparative study that evaluates such algorithms in the context of AV ASR performance. In this paper, the performance of various 2D and 3D lip tracking algorithms is compared from a point of view of AV ASR. In particular, the focus of this study is on algorithms that use explicit lip models. A number of variants of the recently popular Active Appearance Models (AAMs) are compared with a 3D lip tracking algorithm that uses stereo vision. All performance evaluations are made using the AVOZES data corpus.en
dc.description.statusPeer-revieweden
dc.format.extent6en
dc.identifier.scopus84859899509en
dc.identifier.urihttps://hdl.handle.net/1885/733797399
dc.language.isoenen
dc.relation.ispartofseries2008 International Conference on Auditory-Visual Speech Processing, AVSP 2008en
dc.rightsPublisher Copyright: Copyright © 2008 AVISA.en
dc.subjectActive appearance modelen
dc.subjectAuditory-visual automatic speech recognitionen
dc.subjectLip trackingen
dc.titleA Comparative Study of 2D and 3D Lip Tracking Methods for AV ASRen
dc.typeConference paperen
dspace.entity.typePublicationen
local.bibliographicCitation.lastpage240en
local.bibliographicCitation.startpage235en
local.contributor.affiliationGöcke, Roland; School of Engineering, ANU College of Systems and Society, The Australian National Universityen
local.contributor.affiliationAsthana, Akshay; School of Engineering, ANU College of Systems and Society, The Australian National Universityen
local.identifier.ariespublicationu2505865xPUB52en
local.identifier.pure02f733d8-e308-495a-91c2-1127da2ad990en
local.identifier.urlhttps://www.scopus.com/pages/publications/84859899509en
local.type.statusPublisheden

Downloads