Generative Data Augmentation for Skeleton Action Recognition

Xu Dong; Wanqing Li; Femi Adeyemi-Ejeye; Andrew Gilbert

Back

Generative Data Augmentation for Skeleton Action Recognition

Conference proceeding

Peer reviewed

Generative Data Augmentation for Skeleton Action Recognition

Xu Dong, Wanqing Li, Femi Adeyemi-Ejeye and Andrew Gilbert

20th IEEE International Conference on Automatic Face and Gesture Recognition

IEEE International Conference on Automatic Face and Gesture Recognition, 20th (Kyoto, Japan, 25/05/2026–29/05/2026)

02/04/2026

Abstract

Skeleton-based human action recognition is a powerful approach for understanding human behaviour from pose data, but collecting large-scale, diverse, and well-annotated 3D skeleton datasets is both expensive and labor-intensive. To address this challenge, we propose a conditional generative pipeline for data augmentation in skeleton action recognition. Our method learns the distribution of real skeleton sequences under the constraint of action labels, enabling the synthesis of diverse and high-fidelity data. Even with limited training samples, it can effectively generate skeleton sequences and achieve competitive recognition performance in low-data scenarios, demonstrating strong generalisation in downstream tasks. Specifically, we introduce a Transformer-based encoder–decoder architecture, combined with a generative refinement module and a dropout mechanism, to balance fidelity and diversity during sampling. Experiments on Hu-manAct12 and the refined NTU-RGBD (NTU-VIBE) dataset show that our approach consistently improves the accuracy of multiple skeleton-based action recognition models, validating its effectiveness in both few-shot and full-data settings. The source code can be found at here.

Files and links (1)

pdf

0173.13 MB

Author's Accepted Manuscript Embargoed Access, Embargo ends: 25/05/2026

Metrics

1 Record Views

Details

Title: Generative Data Augmentation for Skeleton Action Recognition
Creators: Xu Dong - University of Surrey, Music and Media
Wanqing Li (null)
Femi Adeyemi-Ejeye (null) - University of Surrey, Music and Media
Andrew Gilbert (null) - University of Surrey, Music and Media
Publication Details: 20th IEEE International Conference on Automatic Face and Gesture Recognition
Conference: IEEE International Conference on Automatic Face and Gesture Recognition, 20th (Kyoto, Japan, 25/05/2026–29/05/2026)
Publisher: IEEE
Date accepted for publication: 02/04/2026
Identifiers: 991127995502346
Academic Unit: Music and Media
Language: English
Resource Type: Conference proceeding

Generative Data Augmentation for Skeleton Action Recognition

Abstract

Files and links (1)

Metrics

Details

Usage Policy