Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS

Q Liu; W Wang; PJB Jackson

doi:10.1007/978-3-642-15995-4_17

Back

Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS

Conference presentation

Open access

Peer reviewed

Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS

Q Liu, W Wang and PJB Jackson

Lecture Notes in Computer Science (LNCS 6365), Vol.6365/2, pp.131-139

9th International Conference on Latent Variable Analysis and Signal Separation (formerly the International Conference on Independent Component Analysis and Signal Separation) (St. Malo, France, 27/09/2010 - 30/09/2010)

27/09/2010

DOI: https://doi.org/10.1007/978-3-642-15995-4_17

Abstract

Recent studies show that visual information contained in visual speech can be helpful for the performance enhancement of audio-only blind source separation (BSS) algorithms. Such information is exploited through the statistical characterisation of the coherence between the audio and visual speech using, e.g. a Gaussian mixture model (GMM). In this paper, we present two new contributions. An adapted expectation maximization (AEM) algorithm is proposed in the training process to model the audio-visual coherence upon the extracted features. The coherence is exploited to solve the permutation problem in the frequency domain using a new sorting scheme. We test our algorithm on the XM2VTS multimodal database. The experimental results show that our proposed algorithm outperforms traditional audio-only BSS.

Files and links (2)

pdf

LiuWangJackson_LVA10640.01 kBDownload View

TextSRIDA, Open Access

url

http://dx.doi.org/10.1007/978-3-642-15995-4_17View

Published (Version of record)

Metrics

304 File views/ downloads

43 Record Views

Details

Title: Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS
Creators: Q Liu
W Wang
PJB Jackson
Contributors: Springer (null)
Publication Details: Lecture Notes in Computer Science (LNCS 6365), Vol.6365/2, pp.131-139
Conference: 9th International Conference on Latent Variable Analysis and Signal Separation (formerly the International Conference on Independent Component Analysis and Signal Separation) (St. Malo, France, 27/09/2010 - 30/09/2010)
Date published: 27/09/2010
Date submitted: 14/11/2011
Identifiers: 99512968602346
Academic Unit: School of Computer Science and Electronic Engineering
Resource Type: Conference presentation

Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS

Abstract

Files and links (2)

Metrics

Details

Usage Policy