Published May 12, 2019 | Version v1
Conference paper

Speech enhancement with variational autoencoders and alpha-stable distributions

Contributors

Others:

Description

his paper focuses on single-channel semi-supervised speech en-hancement. We learn a speaker-independent deep generative speechmodel using the framework of variational autoencoders. The noisemodel remains unsupervised because we do not assume prior knowl-edge of the noisy recording environment. In this context, our con-tribution is to propose a noise model based on alpha-stable distribu-tions, instead of the more conventional Gaussian non-negative ma-trix factorization approach found in previous studies. We develop aMonte Carlo expectation-maximization algorithm for estimating themodel parameters at test time. Experimental results show the supe-riority of the proposed approach both in terms of perceptual qualityand intelligibility of the enhanced speech signal.

Abstract

International audience

Additional details

Identifiers

URL
https://inria.hal.science/hal-02005106
URN
urn:oai:HAL:hal-02005106v1

Origin repository

Origin repository
UNICA