Published April 1, 2022 | Version v1
Publication

Solving permutations in frequencyy-domain for blind separation of an arbitrary number of speech sources

Description

Blind separation of speech sources in reverberant environ ments is usually performed in the time-frequency domain, which gives rise to the permutation problem: the different ordering of estimated sources for different frequency components. A two-stage method to solve permutations with an arbitrary number of sources is proposed. The suggested procedure is based on the spectral consistency of the sources. At the first stage frequency bins are compared with each other, while at the second stage the neighboring frequencies are emphasized. Experiments for perfect separation situations and for live recordings show that the proposed method improves the results of existing approaches.

Abstract

Ministerio de Ciencia e Innovación (España) TEC2011-23559

Additional details

Created:
March 25, 2023
Modified:
December 1, 2023