Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Speech denoising in multi-noise source environments using multiple microphone devices via Relative Transfer Matrix

Loading...
Thumbnail Image

Date

Authors

Kumar, Manish
Birnie, Lachlan
Abhayapala, Thushara
Arcos Holzinger, Sandra
BASTINE, AMY
Grixti-Cheng, Daniel
Samarasinghe, Prasanga

Journal Title

Journal ISSN

Volume Title

Publisher

Access Statement

Research Projects

Organizational Units

Journal Issue

Abstract

Speech denoising is a challenging problem when there are multiple active noise sources. This paper introduces a novel blind denoising approach using the Relative Transfer Matrix (ReTM) as a spatial feature of noise source locations and the environment in multi-microphone settings. The ReTM is a generalization of Relative Transfer Function (ReTF) for simultaneously active sources and multiple receivers. We allocate receivers into two multichannel groups and formulate the ReTM to describe the spatial mapping between them. The ReTM with respect to noise sources is estimated blindly using covariance matrices of microphone recordings during speech-free intervals. We use the ReTM to estimate the noise at one group of microphones from the other. The estimated noise is then subtracted from the incoming signal to achieve speech denoising. We illustrate the effectiveness of the proposed algorithm through simulations and experimental recordings. The method does not require prior knowledge of the number of speech and noise sources, nor source and microphone locations, and can be extended to a configuration with more than three microphones.

Description

Keywords

Citation

Source

2024 European Signal Processing Conference (EUSIPCO)

Book Title

Entity type

Publication

Access Statement

License Rights

DOI

Restricted until

abcd