Iterative deep neural networks for speaker-independent binaural blind speech separation

Qingju Liu; Yong Xu; Philip Jackson; Wenwu Wang; Philip Coleman

doi:10.1109/ICASSP.2018.8462603

Back

Iterative deep neural networks for speaker-independent binaural blind speech separation

Conference proceeding

Open access

Peer reviewed

Iterative deep neural networks for speaker-independent binaural blind speech separation

Qingju Liu, Yong Xu, Philip Jackson, Wenwu Wang and Philip Coleman

ICASSP 2018 Proceedings

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (Calgary, Alberta, Canada, 15/04/2018 - 20/04/2018)

2018

DOI: https://doi.org/10.1109/ICASSP.2018.8462603

Abstract

Deep neural network

binaural blind speech separation

spectral and spatial

iterative DNN

In this paper, we propose an iterative deep neural network (DNN)-based binaural source separation scheme, for recovering two concurrent speech signals in a room environment. Besides the commonly-used spectral features, the DNN also takes non-linearly wrapped binaural spatial features as input, which are refined iteratively using parameters estimated from the DNN output via a feedback loop. Different DNN structures have been tested, including a classic multilayer perception regression architecture as well as a new hybrid network with both convolutional and densely-connected layers. Objective evaluations in terms of PESQ and STOI showed consistent improvement over baseline methods using traditional binaural features, especially when the hybrid DNN architecture was employed. In addition, our proposed scheme is robust to mismatches between the training and testing data.

Files and links (2)

pdf

2595LIU482.62 kBDownload View

Text Open Access

url

https://2018.ieeeicassp.org/View

Published (Version of record)

Metrics

553 File views/ downloads

71 Record Views

Details

Title: Iterative deep neural networks for speaker-independent binaural blind speech separation
Creators: Qingju Liu
Yong Xu
Philip Jackson
Wenwu Wang
Philip Coleman
Publication Details: ICASSP 2018 Proceedings
Conference: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (Calgary, Alberta, Canada, 15/04/2018 - 20/04/2018)
Publisher: IEEE
Date published: 2018
Date submitted: 13/04/2018
Grant note: Funder: EPSRC | Grant ID: EP/L000539/1
Identifiers: 99516313102346
Copyright: © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Academic Unit: School of Computer Science and Electronic Engineering
Resource Type: Conference proceeding

Iterative deep neural networks for speaker-independent binaural blind speech separation

Abstract

Files and links (2)

Metrics

Details

Usage Policy