DCASE 2018 Challenge Surrey Cross-task convolutional neural network baseline

Qiuqiang Kong; Turab Iqbal; Yong Xu; Wenwu Wang; Mark D Plumbley

doi:10.48550/arXiv.1808.00773

Back

DCASE 2018 Challenge Surrey Cross-task convolutional neural network baseline

Conference proceeding

Open access

DCASE 2018 Challenge Surrey Cross-task convolutional neural network baseline

Qiuqiang Kong, Turab Iqbal, Yong Xu, Wenwu Wang and Mark D Plumbley

DCASE2018 Workshop

DCASE2018 Workshop on Detection and Classification of Acoustic Scenes and Events (Surrey, UK, 19/11/2018 - 20/11/2018)

2018

DOI: https://doi.org/10.48550/arXiv.1808.00773

Abstract

DCASE 2018 challenge

convolutional neural networks

open source

The Detection and Classiﬁcation of Acoustic Scenes and Events (DCASE) consists of ﬁve audio classiﬁcation and sound event detectiontasks: 1)Acousticsceneclassiﬁcation,2)General-purposeaudio tagging of Freesound, 3) Bird audio detection, 4) Weakly-labeled semi-supervised sound event detection and 5) Multi-channel audio classiﬁcation. In this paper, we create a cross-task baseline system for all ﬁve tasks based on a convlutional neural network (CNN): a “CNN Baseline” system. We implemented CNNs with 4 layers and 8 layers originating from AlexNet and VGG from computer vision. We investigated how the performance varies from task to task with the same conﬁguration of neural networks. Experiments show that deeper CNN with 8 layers performs better than CNN with 4 layers on all tasks except Task 1. Using CNN with 8 layers, we achieve an accuracy of 0.680 on Task 1, an accuracy of 0.895 and a mean average precision (MAP) of 0.928 on Task 2, an accuracy of 0.751 andanareaunderthecurve(AUC)of0.854onTask3,asoundevent detectionF1scoreof20.8%onTask4,andanF1scoreof87.75%on Task 5. We released the Python source code of the baseline systems under the MIT license for further research.

Files and links (2)

pdf

DCASE 2018 CHALLENGE SURREY CROSS-TASK CONVOLUTIONAL NEURAL NETWORK BASELINE162.06 kBDownload View

Text Open Access

url

http://dcase.community/workshop2018/View

Published (Version of record)

Metrics

42 File views/ downloads

45 Record Views

Details

Title: DCASE 2018 Challenge Surrey Cross-task convolutional neural network baseline
Creators: Qiuqiang Kong
Turab Iqbal
Yong Xu
Wenwu Wang
Mark D Plumbley
Publication Details: DCASE2018 Workshop
Conference: DCASE2018 Workshop on Detection and Classification of Acoustic Scenes and Events (Surrey, UK, 19/11/2018 - 20/11/2018)
Date published: 2018
Date submitted: 09/10/2018
Identifiers: 99515283702346
Academic Unit: School of Computer Science and Electronic Engineering
Resource Type: Conference proceeding

DCASE 2018 Challenge Surrey Cross-task convolutional neural network baseline

Abstract

Files and links (2)

Metrics

Details

Usage Policy