Logo image
Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation
Journal article   Peer reviewed

Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation

Davide Berghi and Philip J. B. Jackson
IEEE Signal Processing Letters, Vol.In Press(In Press)
14/04/2026

Abstract

Sound Event Localization and Detection Sound Source Localization Reverberation machine listening sound event localisation and detection Distance Estimation

Sound event localization and detection (SELD) involves predicting active sound event classes over time while estimating their positions. The localization subtask in SELD is usually treated as a direction of arrival estimation problem, ignoring source distance. Only recently, SELD was extended to 3D by incorporating distance estimation, enabling the prediction of sound event positions in 3D space (3D SELD). However, existing methods lack input features specifically designed for distance estimation. We address this gap by introducing two novel reverberation-based feature formats: one using the direct-to-reverberant ratio (DRR) and another leveraging signal autocorrelation to capture early reflections. We extensively evaluate and benchmark these features on the STARSS23 dataset, combining them with established SELD features for sound event detection (SED) and direction-of-arrival estimation (DOAE), and testing across different network architectures. Our proposed features, applicable to both FOA and MIC formats, achieve state-of-the-art distance estimation, enhancing overall 3D SELD performance.

pdf
camera_readySPL2025522.88 kB
Author's Accepted Manuscript Restricted. Access maybe granted on request., This file will be open access upon publication. CC BY V4.0
url
https://github.com/dberghi/SELD-distance-featuresView
GitHub repository containing the feature extraction code

Metrics

1 Record Views

Details

Logo image

Usage Policy