Perceptual quality of audio separated using sigmoidal masks

T Stokes; C Hummersone; T Brookes; A Mason

Back

Perceptual quality of audio separated using sigmoidal masks

Conference presentation

Open access

Peer reviewed

Perceptual quality of audio separated using sigmoidal masks

T Stokes, C Hummersone, T Brookes and A Mason

137th Audio Engineering Society Convention 2014, pp.167-173

137th Audio Engineering Society Convention 2014 (Los Angeles, USA)

09/10/2014

Abstract

Media

Separation of underdetermined audio mixtures is often performed in the Time-Frequency (TF) domain by masking each TF element according to its target-to-mixture ratio. This work uses sigmoidal functions to map the target-to-mixture ratio to mask values. The series of functions used encompasses the ratio mask and an approximation of the binary mask. Mixtures are chosen to represent a range of different amounts of TF overlap, then separated and evaluated using objective measures. PEASS results show improved interferer suppression and artifact scores can be achieved using softer masking than that applied by binary or ratio masks. The improvement in these scores gives an improved overall perceptual score; this observation is repeated at multiple TF resolutions.

Files and links (1)

pdf

PERCEPTUAL QUALITY OF AUDIO SEPARATED USING SIGMOIDAL MASKS_1416.81 kBDownload View

Text Open Access

Metrics

55 File views/ downloads

40 Record Views

Details

Title: Perceptual quality of audio separated using sigmoidal masks
Creators: T Stokes
C Hummersone
T Brookes
A Mason
Publication Details: 137th Audio Engineering Society Convention 2014, pp.167-173
Conference: 137th Audio Engineering Society Convention 2014 (Los Angeles, USA)
Date published: 09/10/2014
Date submitted: 06/10/2015
Identifiers: 99515651502346
Copyright: © 2014 Audio Engineering Society. This paper was peer-reviewed as a complete manuscript for presentation at this Convention. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA; also see www.aes.org. All rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society.
Academic Unit: Department of Music and Media
Resource Type: Conference presentation

Perceptual quality of audio separated using sigmoidal masks

Abstract

Files and links (1)

Metrics

Details

Usage Policy