Improving the perceptual quality of ideal binary masked speech

L Lightburn; Enzo De Sena; A Moore; PA Naylor; M Brookes

doi:10.1109/ICASSP.2017.7952238

Back

Improving the perceptual quality of ideal binary masked speech

Conference presentation

Peer reviewed

Improving the perceptual quality of ideal binary masked speech

L Lightburn, Enzo De Sena, A Moore, PA Naylor and M Brookes

Proceedings of ICASSP 2017

IEEE

ICASSP 2017 - 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (New Orleans, USA, 05/03/2017 - 09/03/2017)

19/06/2017

DOI: https://doi.org/10.1109/ICASSP.2017.7952238

Abstract

— Binary mask

speech quality

speech intelligibility

speech enhancement

speech presence probability

Music and Media

It is known that applying a time-frequency binary mask to very noisy speech can improve its intelligibility but results in poor perceptual quality. In this paper we propose a new approach to applying a binary mask that combines the intelligibility gains of conventional binary masking with the perceptual quality gains of a classical speech enhancer. The binary mask is not applied directly as a time-frequency gain as in most previous studies. Instead, the mask is used to supply prior information to a classical speech enhancer about the probability of speech presence in different time-frequency regions. Using an oracle ideal binary mask, we show that the proposed method results in a higher predicted quality than other methods of applying a binary mask whilst preserving the improvements in predicted intelligibility.

Files and links (2)

url

http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000002View

url

http://www.ieee-icassp2017.org/View

Organisation

Metrics

21 Record Views

Details

Title: Improving the perceptual quality of ideal binary masked speech
Creators: L Lightburn
Enzo De Sena
A Moore
PA Naylor
M Brookes
Contributors: IEEE
Publication Details: Proceedings of ICASSP 2017
Conference: ICASSP 2017 - 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (New Orleans, USA, 05/03/2017 - 09/03/2017)
Publisher: IEEE
Date published: 19/06/2017
Date submitted: 18/01/2017
Identifiers: 99516172402346
Copyright: (c) 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
Academic Unit: Department of Music and Media
Resource Type: Conference presentation

Improving the perceptual quality of ideal binary masked speech

Abstract

Files and links (2)

Metrics

Details

Usage Policy