2D progressive fusion module for action recognition

Zhongwei Shen; Xiao-Jun Wu; Josef Kittler

doi:10.1016/j.imavis.2021.104122

Back

2D progressive fusion module for action recognition

Journal article

Peer reviewed

2D progressive fusion module for action recognition

Zhongwei Shen, Xiao-Jun Wu and Josef Kittler

Image and vision computing, Vol.109, p.104122

31/05/2021

DOI: https://doi.org/10.1016/j.imavis.2021.104122

Abstract

2D CNN

action recognition

Convergence

spatiotemporal modeling

Network convergence as well as recognition accuracy are essential issues when applying Convolutional Neural Networks (CNN) to human action recognition. Most deep learning methods neglect model convergence when striving to improve the abstraction capability, thus degrading the performances sharply when computing resources are limited. To mitigate this problem, we propose a structure named 2D Progressive Fusion (2DPF) Module which is inserted after the 2D backbone CNN layers. 2DPF fuses features through a novel 2D convolution on the spatial and temporal dimensions called variation attenuating convolution and applies fusion techniques to improve the recognition accuracy and the convergency. Our experiments performed on several benchmarks (e.g., Something-Something V1&V2, Kinetics400 & 600, AViD, UCF101) demonstrate the effectiveness of the proposed method. ARTICLE INFO.

Metrics

11 Record Views

5 Times Cited - Web of Science

Details

Title: 2D progressive fusion module for action recognition
Creators: Zhongwei Shen - Jiangnan University
Xiao-Jun Wu - Jiangnan University
Josef Kittler - University of Surrey
Publication Details: Image and vision computing, Vol.109, p.104122
Publisher: Elsevier B.V
Date published: 31/05/2021
Grant note: EP/N007743/1 U1836218; 61672265; 61902153 / National Natural Science Foundation of China EPSRC Programme B12018 / 111 Project of Ministry of Education of China EP/R013616/1 / EPSRC/MURI/Dstl
Identifiers: 99771848502346
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Journal article

2D progressive fusion module for action recognition

Abstract

Metrics

Details

Usage Policy