Surrey researchers Sign in
Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization
Journal article   Open access  Peer reviewed

Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization

Davide Berghi and Philip J. B. Jackson
IEEE/ACM transactions on audio, speech, and language processing, Vol.32, pp.1-12
25/12/2023

Abstract

active speaker detection and localization Faces Feature extraction Location awareness microphone array multichannel selfsupervised learning Speech processing Task analysis Training Visualization
pdf
Leveraging Visual Supervision - AAM2.01 MBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access

Metrics

3 File views/ downloads
24 Record Views

Details

Usage Policy