Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network

Helin Wang; Yuexian Zou; Dading Chong; Wenwu Wang

doi:10.1109/LSP.2020.3019702

Back

Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network

Journal article

Open access

Peer reviewed

Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network

Helin Wang, Yuexian Zou, Dading Chong and Wenwu Wang

IEEE Signal Processing Letters

20/08/2020

DOI: https://doi.org/10.1109/LSP.2020.3019702

Abstract

As a multi-label classification task, audio tagging aims to predict the presence or absence of certain sound events in an audio recording. Existing works in audio tagging do not explicitly consider the probabilities of the co-occurrences between sound events, which is termed as the label dependencies in this study. To address this issue, we propose to model the label dependencies via a graph-based method, where each node of the graph represents a label. An adjacency matrix is constructed by mining the statistical relations between labels to represent the graph structure information, and a graph convolutional network (GCN) is employed to learn node representations by propagating information between neighboring nodes based on the adjacency matrix, which implicitly models the label dependencies. The generated node representations are then applied to the acoustic representations for classification. Experiments on Audioset show that our method achieves a state-of-the-art mean average precision (mAP) of 0:434.

Files and links (1)

pdf

WangZCW_SPL_2020766.42 kBDownload View

Text Open Access

Metrics

272 File views/ downloads

128 Record Views

9 Times Cited - Web of Science

Details

Title: Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
Creators: Helin Wang
Yuexian Zou
Dading Chong
Wenwu Wang
Publication Details: IEEE Signal Processing Letters
Publisher: Institute of Electrical and Electronics Engineers
Date accepted: 16/08/2020
Date submitted: 20/08/2020
Identifiers: 99514400602346
Academic Unit: School of Computer Science and Electronic Engineering
Resource Type: Journal article

Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network

Abstract

Files and links (1)

Metrics

Details

Usage Policy