BLIND MULTI-CHANNEL SPEECH SEPARATION USING SPATIAL ESTIMATION IN TWO-SPEAKER ENVIRONMENTS
Author affiliations
DOI:
https://doi.org/10.15625/0866-708X/48/4/1176Abstract
ABSTRACT
This paper investigates the problem of speech separation from a mixture of two speech signals without source localization information in a room environment. Due to the lack of source information, the use of spatial detector comes at an expense of permutation ambiguity. To solve the problem, a permutation alignment algorithm based on correlation is employed to group the beamformer outputs into the correct sources. Evaluations using recordings from a real room environment show that the proposed beamformer offers a good interference suppression level whilst maintaining a low distortion level of the desired source.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Vietnam Journal of Sciences and Technology (VJST) is an open access and peer-reviewed journal. All academic publications could be made free to read and downloaded for everyone. In addition, articles are published under term of the Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA) Licence which permits use, distribution and reproduction in any medium, provided the original work is properly cited & ShareAlike terms followed.
Copyright on any research article published in VJST is retained by the respective author(s), without restrictions. Authors grant VAST Journals System a license to publish the article and identify itself as the original publisher. Upon author(s) by giving permission to VJST either via VJST journal portal or other channel to publish their research work in VJST agrees to all the terms and conditions of https://creativecommons.org/licenses/by-sa/4.0/ License and terms & condition set by VJST.
Authors have the responsibility of to secure all necessary copyright permissions for the use of 3rd-party materials in their manuscript.