A Perceptually-Weighted Deep Neural Network for Monaural Speech Enhancement in Various Background Noise Conditions

Qingju Liu; Wenwu Wang; Philip Jackson; Yan Tang

Back

Conference presentation

Open access

Peer reviewed

A Perceptually-Weighted Deep Neural Network for Monaural Speech Enhancement in Various Background Noise Conditions

Qingju Liu, Wenwu Wang, Philip Jackson and Yan Tang

Proceedings of the 2017 25th European Signal Processing Conference (EUSIPCO)

2017 25th European Signal Processing Conference (EUSIPCO) (Kos Island, Greece, 02/09/2017)

02/09/2017

Abstract

Deep neural networks (DNN) have recently been shown to give state-of-the-art performance in monaural speech enhancement. However in the DNN training process, the perceptual difference between different components of the DNN output is not fully exploited, where equal importance is often assumed. To address this limitation, we have proposed a new perceptually-weighted objective function within a feedforward DNN framework, aiming to minimize the perceptual difference between the enhanced speech and the target speech. A perceptual weight is integrated into the proposed objective function, and has been tested on two types of output features: spectra and ideal ratio masks. Objective evaluations for both speech quality and speech intelligibility have been performed. Integration of our perceptual weight shows consistent improvement on several noise levels and a variety of different noise types.

Files and links (2)

pdf

A Perceptually-Weighted Deep Neural Network for Monaural Speech Enhancement in Various Background Noise Conditions1.08 MBDownload View

Open Access

url

https://www.eusipco2017.org/View

Published (Version of record)

Metrics

321 File views/ downloads

91 Record Views

Details

Title: A Perceptually-Weighted Deep Neural Network for Monaural Speech Enhancement in Various Background Noise Conditions
Creators: Qingju Liu
Wenwu Wang
Philip Jackson
Yan Tang
Publication Details: Proceedings of the 2017 25th European Signal Processing Conference (EUSIPCO)
Conference: 2017 25th European Signal Processing Conference (EUSIPCO) (Kos Island, Greece, 02/09/2017)
Date published: 02/09/2017
Date submitted: 26/07/2017
Grant note: Funder: Engineering and Physical Sciences Research Council (EPSRC) | Grant Title: Grant S3A: Future Spatial Audio for an Immersive Listener Experience at Home | Grant ID: EP/L000539/1
Identifiers: 99515002702346
Academic Unit: School of Computer Science and Electronic Engineering
Resource Type: Conference presentation

A Perceptually-Weighted Deep Neural Network for Monaural Speech Enhancement in Various Background Noise Conditions

Abstract

Files and links (2)

Metrics

Details

Usage Policy