Surrey researchers Sign in
ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification
Journal article   Open access   Peer reviewed

ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification

IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol.32, pp.3684-3693
15/07/2024

Abstract

Audio Classification Audio Spectrogram Computational modeling Context modeling Group Masked Model Learning Image reconstruction Self-supervised Learning Similarity learning Spectrogram Task analysis Transformers Vision Transformers
pdf
ASLP_ASiT_CameraReady526.67 kBDownloadView
Author's Accepted Manuscript Open Access

Metrics

2 File views/ downloads
6 Record Views

Details

Usage Policy