Using Sign Language Production as Data Augmentation to enhance Sign Language Translation

Harry Thomas Walsh; Maksym Ivashechkin; Richard Bowden Prof

doi:10.1145/3742886.3756723

Back

Using Sign Language Production as Data Augmentation to enhance Sign Language Translation

Conference proceeding

Peer reviewed

Using Sign Language Production as Data Augmentation to enhance Sign Language Translation

Harry Thomas Walsh, Maksym Ivashechkin and Richard Bowden Prof

IVA Adjunct '25: Adjunct Proceedings of the 25th ACM International Conference on Intelligent Virtual Agents, pp.1-10

25th ACM International Conference on Intelligent Virtual Agents (Berlin, Germany)

30/09/2025

DOI: https://doi.org/10.1145/3742886.3756723

Abstract

Computing methodologies

Sign Language Translation

Data Augmentation

Sign Language Production

Generative Models

Computer Vision

Machine learning models fundamentally rely on large quantities of high-quality data. Collecting the necessary data for these models can be challenging due to cost, scarcity, and privacy restrictions. Signed languages are visual languages used by the deaf community and are considered low-resource languages. Sign language datasets are often orders of magnitude smaller than their spoken language counterparts. Sign Language Production (SLP) is the task of generating sign language videos from spoken language sentences, while Sign Language Translation (SLT) is the reverse translation task. Here, we propose leveraging recent advancements in SLP to augment existing sign language datasets and enhance the performance of SLT models. For this, we utilize three techniques: a skeleton-based approach to production, sign stitching, and two photo-realistic generative models, SignGAN and SignSplat. We evaluate the effectiveness of these techniques in enhancing the performance of SLT models by generating variation in the signer's appearance and the motion of the skeletal data. Our results demonstrate that the proposed methods can effectively augment existing datasets and enhance the performance of SLT models by up to 19%, paving the way for more robust and accurate SLT systems, even in resource-constrained environments.

Metrics

36 Record Views

Details

Title: Using Sign Language Production as Data Augmentation to enhance Sign Language Translation
Creators: Harry Thomas Walsh (Author) - University of Surrey, School of Computer Science & Electronic Engineering
Maksym Ivashechkin (Author) - University of Surrey, School of Computer Science & Electronic Engineering
Richard Bowden Prof (Author) - University of Surrey, School of Computer Science & Electronic Engineering
Publication Details: IVA Adjunct '25: Adjunct Proceedings of the 25th ACM International Conference on Intelligent Virtual Agents, pp.1-10
Conference: 25th ACM International Conference on Intelligent Virtual Agents (Berlin, Germany)
Publisher: Association for Computing Machinery (ACM)
First online publication date: 30/09/2025
Date accepted for publication: 08/07/2025
Grants: SMILE II, CRSII5 193686, Swiss National Science Foundation (Switzerland, Bern) - FNS
IICT Flagship, PFFS-21-47, Innosuisse – Swiss Innovation Agency (Switzerland, Bern)
SignGPT-EP/Z535370/1, APP24554, UK Research and Innovation (United Kingdom, Swindon) - UKRI
Grant note: This work was supported by the SNSF project ‘SMILE II’ (CRSII5 193686), the Innosuisse IICT Flagship (PFFS-21-47), EPSRC grant APP24554 (SignGPT-EP/Z535370/1) and through funding from Google.org via the AI for Global Goals scheme. This work reflects only the author’s views and the funders are not responsible for any use that may be made of the information it contains.
Identifiers: 991016866202346
Academic Unit: School of Computer Science & Electronic Engineering
Language: English
Resource Type: Conference proceeding

Using Sign Language Production as Data Augmentation to enhance Sign Language Translation

Abstract

Metrics

Related content

Details

Usage Policy