Logo image
Open Research University homepage
Surrey researchers Sign in
Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection
Conference proceeding   Open access

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Davide Berghi, Peipei Wu, Jinzheng Zhao, Wenwu Wang and Philip J. B. Jackson
Proceedings of the ICASSP 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024)
ICASSP 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024) (14/04/2024–19/04/2024)
18/03/2024

Abstract

microphone array 360 video sound event localization and detection audio-visual fusion cross-modal attention
pdf
Fusion of Audio and Visual Embeddings - AAM292.66 kBDownloadView
Author's Accepted Manuscript Open Access
url
https://2024.ieeeicassp.org/View
Conference website

Metrics

49 File views/ downloads
139 Record Views

Details

Logo image

Usage Policy