Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking

Yang Liu; Yong Xu; Peipei Wu; Wenwu Wang

doi:10.1109/TMM.2023.3301221

Back

Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking

Journal article

Open access

Peer reviewed

Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking

Yang Liu, Yong Xu, Peipei Wu and Wenwu Wang

IEEE transactions on multimedia, pp.2544-2559

01/09/2023

DOI: https://doi.org/10.1109/TMM.2023.3301221

Abstract

Atmospheric measurements

Audio-Visual Tracking

Filtering algorithms

Information filters

Par-ticle Flow

Particle measurements

Radio frequency

SMC-PHD Filter

Target tracking

Visualization

Particle flow (PF) is a method originally proposed for single target tracking, and used recently to address the weight degeneracy problem of the sequential Monte Carlo probability hypothesis density (SMC-PHD) filter for audio-visual (AV) multi-speaker tracking, where the particle flow is calculated by using only the measurements near the particle, assuming that the target is detected, as in a recent method based on non-zero particle flow (NPF), i.e. the AV-NPF-SMC-PHD filter. This, however, can be problematic when occlusion happens and the occluded speaker may not be detected. To address this issue, we propose a new method where the labels of the particles are estimated using the likelihood function, and the particle flow is calculated in terms of the selected particles with the same labels. As a result, the particles associated with detected speakers and undetected speakers are distinguished based on the particle labels. With this novel method, named as AV-LPF-SMC-PHD, the speaker states can be estimated as the weighted mean of the labelled particles, which is computationally more efficient than using a clustering method as in the AV-NPF-SMC-PHD filter. The proposed algorithm is compared systematically with several baseline tracking methods using the AV16.3, AVDIAR and CLEAR datasets, and is shown to offer improved tracking accuracy with a lower computational cost.

Files and links (1)

pdf

Liu_etal_TMM_2023 (1)14.63 MBDownload View

Author's Accepted Manuscript Open Access

Metrics

15 Record Views

Details

Title: Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking
Creators: Yang Liu - Seattle University
Yong Xu
Peipei Wu - University of Surrey
Wenwu Wang - University of Surrey
Publication Details: IEEE transactions on multimedia, pp.2544-2559
Publisher: IEEE
Date published: 01/09/2023
Date accepted: 10/07/2023
Grant note: SIGNetS
Identifiers: 99822202902346
Copyright: © 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.”
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Journal article

Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking

Abstract

Files and links (1)

Metrics

Details

Usage Policy