Logo image
Open Research University homepage
Surrey researchers Sign in
PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and Detection
Journal article   Open access   Peer reviewed

PSELDNets: Pre-trained Neural Networks on a Large-scale Synthetic Dataset for Sound Event Localization and Detection

Jinbo Hu, Yin Cao, Ming Wu, Fang Kang, Feiran Yang, Wenwu Wang, Mark D. Plumbley and Jun Yang
IEEE Transactions on Audio, Speech and Language Processing, Vol.33, pp.2845 -2860
08/07/2025

Abstract

Adaptation models Computational modeling data-efficient fine-tuning Foundation models Location awareness Ontologies pre-trained SELD networks Sound event localization and detection (SELD) Spectrogram Synthetic data Training Transformers Acoustics
pdf
TASLPRO35874462.34 MBDownloadView
Author's Accepted Manuscript Open Access

Metrics

4 Record Views

Details

Logo image

Usage Policy