Logo image
Open Research University homepage
Surrey researchers Sign in
Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization
Journal article   Open access   Peer reviewed

Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization

Davide Berghi and Philip J. B. Jackson
IEEE/ACM transactions on audio, speech, and language processing, Vol.32, pp.1-12
25/12/2023

Abstract

active speaker detection and localization Faces Feature extraction Location awareness microphone array multichannel selfsupervised learning Speech processing Task analysis Training Visualization
pdf
Leveraging Visual Supervision - AAM2.01 MBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access

Metrics

Details

Logo image

Usage Policy