Efficient CNNs via Passive Filter Pruning

Arshdeep Singh; Mark David Plumbley

doi:10.1109/TASLPRO.2025.3561589

Back

Efficient CNNs via Passive Filter Pruning

Journal article

Open access

Peer reviewed

Efficient CNNs via Passive Filter Pruning

Arshdeep Singh and Mark David Plumbley

IEEE Transactions on Audio, Speech, and Language Processing, Vol.33, pp.1763-1774

16/04/2025

DOI: https://doi.org/10.1109/TASLPRO.2025.3561589

Abstract

Index Terms—CNNs, Low-complexity, Pruning filters, Acoustic

scene classification, Audio tagging, DCASE, VGGish, PANNs,

ResNet50, ConvNeXt, Image classification

—Convolutional neural networks (CNNs) have shown state-of-the-art performance in various applications. However, CNNs are resource-hungry due to their requirement of high computational complexity and memory storage. Recent efforts toward achieving computational and memory efficiency in CNNs involve filter pruning methods that eliminate some of the filters in CNNs based on the " importance " of the filters. The majority of existing filter pruning methods are either " active " , which use a dataset and generate feature maps to quantify filter importance, or " passive " , which compute filter importance using entry-wise norm of the filters or by measuring similarity among filters without involving data. However, the existing passive filter pruning methods eliminate relatively smaller norm filters or similar filters without considering the significance of the filters in producing the node output, resulting in degradation in the performance. To address this, we present a passive filter pruning method where the least significant filters with relatively smaller contribution in producing output are pruned away by incorporating the operator norm of the filters. The proposed pruning method results in better performance across various CNNs compared to that of the existing passive filter pruning methods. In comparison to the existing active filter pruning methods, the proposed pruning method is more efficient and achieves similar performance as well. The efficacy of the proposed pruning method is evaluated on audio scene classification and audio tagging tasks using various CNNs architecture such as VGGish, DCASE21 Net and PANNs. The proposed pruning method reduces number of computations and parameters of the unrpuned CNNs by at least 40% and 50% respectively, enhancing inference latency while maintaining similar performance as obtained using the unpruned CNNs.

Files and links (1)

pdf

open_research_IEEE_TASLP2.10 MBDownload View

Author's Accepted Manuscript CC BY V4.0, Open Access

Metrics

1 File views/ downloads

2 Record Views

Details

Title: Efficient CNNs via Passive Filter Pruning
Creators: Arshdeep Singh (Corresponding Author) - University of Surrey, School of Computer Science and Electronic Engineering
Mark David Plumbley (Author) - University of Surrey, School of Computer Science and Electronic Engineering
Publication Details: IEEE Transactions on Audio, Speech, and Language Processing, Vol.33, pp.1763-1774
Publisher: IEEE
Number of pages: 12
Publication Date: 16/04/2025
Grants: AI for Sound, EP/T019751/1, Engineering and Physical Sciences Research Council (United Kingdom, Swindon) - EPSRC
Grant note: Engineering and Physical Sciences Research Council (EPSRC): EP/T019751/1
This work was supported by Engineering and Physical Sciences Research Council (EPSRC) under Grant EP/T019751/1 "AI for Sound (AI4S)".
Identifiers: 99990666102346; WOS:001479353400003
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Journal article

Efficient CNNs via Passive Filter Pruning

Abstract

Files and links (1)

Metrics

Related content

Details

Usage Policy