Multi-source doa estimation through pattern recognition of the modal coherence of a reverberant soundfield

dc.contributor.authorFahim, Abdullah
dc.contributor.authorSamarasinghe, Prasanga
dc.contributor.authorAbhayapala, Thushara
dc.date.accessioned2024-02-20T02:34:53Z
dc.date.issued2020
dc.date.updated2022-10-02T07:20:22Z
dc.description.abstractWe propose a novel multi-source direction of arrival (DOA) estimation technique using a convolutional neural network algorithm which learns the modal coherence patterns of an incident soundfield through measured spherical harmonic coefficients. We train our model for individual time-frequency bins in the short-time Fourier transform spectrum by analyzing the unique snapshot of modal coherence for each desired direction. The proposed method is capable of estimating simultaneously active multiple sound sources on a 3D space using a single-source training scheme. This single-source training scheme reduces the training time and resource requirements as well as allows the reuse of the same trained model for different multi-source combinations. The method is evaluated against various simulated and practical noisy and reverberant environments with varying acoustic criteria and found to outperform the baseline methods in terms of DOA estimation accuracy. Furthermore, the proposed algorithm allows independent training of azimuth and elevation during a full DOA estimation over 3D space which significantly improves its training efficiency without affecting the overall estimation accuracy.en_AU
dc.format.mimetypeapplication/pdfen_AU
dc.identifier.issn2329-9290en_AU
dc.identifier.urihttp://hdl.handle.net/1885/313766
dc.language.isoen_AUen_AU
dc.publisherIEEE Signal Processing Societyen_AU
dc.rights© 2019 IEEEen_AU
dc.sourceIEEE/ACM Transactions on Audio, Speech, and Language Processingen_AU
dc.subjectConvolutional neural networken_AU
dc.subjectDOA estimationen_AU
dc.subjectspatial audio processingen_AU
dc.subjectspherical harmonicsen_AU
dc.titleMulti-source doa estimation through pattern recognition of the modal coherence of a reverberant soundfielden_AU
dc.typeJournal articleen_AU
local.bibliographicCitation.lastpage14en_AU
local.bibliographicCitation.startpage1en_AU
local.contributor.affiliationFahim, Abdullah, College of Engineering and Computer Science, ANUen_AU
local.contributor.affiliationSamarasinghe, Prasanga, College of Engineering and Computer Science, ANUen_AU
local.contributor.affiliationAbhayapala, Thushara, College of Engineering and Computer Science, ANUen_AU
local.contributor.authoruidFahim, Abdullah, u5898301en_AU
local.contributor.authoruidSamarasinghe, Prasanga, u4801876en_AU
local.contributor.authoruidAbhayapala, Thushara, u9701943en_AU
local.description.embargo2099-12-31
local.description.notesImported from ARIESen_AU
local.identifier.absfor400607 - Signal processingen_AU
local.identifier.absfor460302 - Audio processingen_AU
local.identifier.ariespublicationu6269649xPUB711en_AU
local.identifier.citationvolume28en_AU
local.identifier.doi10.1109/TASLP.2019.2960734en_AU
local.identifier.scopusID2-s2.0-85078725718
local.identifier.thomsonIDWOS:000526685200004
local.publisher.urlhttps://ieeexplore.ieee.org/en_AU
local.type.statusPublished Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Multi-Source_DOA_Estimation_Through_Pattern_Recognition_of_the_Modal_Coherence_of_a_Reverberant_Soundfield.pdf
Size:
2 MB
Format:
Adobe Portable Document Format
Description: