Multimodal speech separation

B Rivet; J Chambers

doi:10.1007/978-3-642-11509-7_1

Back

Journal article

Peer reviewed

Multimodal speech separation

B Rivet and J Chambers

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol.5933 L, pp.1-11

30/04/2010

DOI: https://doi.org/10.1007/978-3-642-11509-7_1

Abstract

The work of Bernstein and Benoît has confirmed that it is advantageous to use multiple senses, for example to employ both audio and visual modalities, in speech perception. As a consequence, looking at the speaker's face can be useful to better hear a speech signal in a noisy environment and to extract it from competing sources, as originally identified by Cherry, who posed the so-called "Cocktail Party" problem. To exploit the intrinsic coherence between audition and vision within a machine, the method of blind source separation (BSS) is particularly attractive. © 2010 Springer-Verlag.

Metrics

12 Record Views

Details

Title: Multimodal speech separation
Creators: B Rivet
J Chambers
Publication Details: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol.5933 L, pp.1-11
Date published: 30/04/2010
Date submitted: 17/05/2017
Identifiers: 99514297802346
Academic Unit: University of Surrey
Resource Type: Journal article

Multimodal speech separation

Abstract

Metrics

Details

Usage Policy