Logo image
Open Research University homepage
Surrey researchers Sign in
Integrating Spatial and Semantic Embeddings for Stereo Sound Event Localization in Videos
Conference paper   Open access

Integrating Spatial and Semantic Embeddings for Stereo Sound Event Localization in Videos

Davide Berghi and Philip J B Jackson
Proceedings of the 10th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2025)
Proceedings - DCASE, DCASE
10th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2025) (Barcelona, Spain, 29/10/2025–31/10/2025)
2025

Abstract

Sound Event Localization and Detection Stereo Sounds Audio-Visual Machine Learning Multimodal Localization Audio Understanding
pdf
DCASE2025_Workshop_CameraReady5.00 MBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access
url
https://dcase.community/workshop2025/indexView
Event WebsiteConference website

Metrics

1 Record Views

Details

Logo image

Usage Policy