Gloss-Free Sign Language Translation: An Unbiased Evaluation of Progress in the Field

Ozge Mercanoglu Sincan; Jian He Low; Sobhan Asasi; Richard Bowden Prof

doi:10.1016/j.cviu.2025.104498

Back

Gloss-Free Sign Language Translation: An Unbiased Evaluation of Progress in the Field

Journal article

Open access

Peer reviewed

Gloss-Free Sign Language Translation: An Unbiased Evaluation of Progress in the Field

Ozge Mercanoglu Sincan, Jian He Low, Sobhan Asasi and Richard Bowden Prof

Computer Vision and Image Understanding, Vol.261, 104498

07/10/2025

DOI: https://doi.org/10.1016/j.cviu.2025.104498

Abstract

Sign language translation

gloss-free

assistive technology

codebase

Sign Language Translation (SLT) aims to automatically convert visual sign language videos into spoken language text and vice versa.

While recent years have seen rapid progress, the true sources of performance improvements often remain unclear. Do reported performance gains come from methodological novelty, or from the choice of a different backbone, training optimizations, hyperparameter tuning, or even differences in the calculation of evaluation metrics? This paper presents a comprehensive study of recent gloss-free SLT models by re-implementing key contributions in a unified codebase. We ensure fair comparison by standardizing preprocessing, video encoders, and training setups across all methods. Our analysis shows that many of the performance gains reported in the literature often diminish when models are evaluated under consistent conditions, suggesting that implementation details and evaluation setups play a significant role in determining results. We make the codebase publicly available here to support transparency and reproducibility in SLT research.

Files and links (1)

pdf

CVIU_paper1.44 MBDownload View

Author's Accepted Manuscript CC BY-NC V4.0, Open Access

Metrics

29 File views/ downloads

19 Record Views

Details

Title: Gloss-Free Sign Language Translation: An Unbiased Evaluation of Progress in the Field
Creators: Ozge Mercanoglu Sincan (Author) - University of Surrey, School of Computer Science and Electronic Engineering
Jian He Low - University of Surrey, School of Computer Science and Electronic Engineering
Sobhan Asasi - University of Surrey, School of Computer Science and Electronic Engineering
Richard Bowden Prof - University of Surrey, School of Computer Science and Electronic Engineering
Publication Details: Computer Vision and Image Understanding, Vol.261, 104498
Publisher: Elsevier; SAN DIEGO
Number of pages: 11
First online publication date: 07/10/2025
Date accepted for publication: 11/09/2025
Grants: SMILE II, CRSII5 193686, Swiss National Science Foundation (Switzerland, Bern) - FNS
IICT Flagship, PFFS-21-47, Innosuisse – Swiss Innovation Agency (Switzerland, Bern)
SignGPT-EP/Z535370/1, APP24554, UK Research and Innovation (United Kingdom, Swindon) - UKRI
Grant note: This work was supported by the SNSF project ‘SMILE II’ (CRSII5 193686), the Innosuisse IICT Flagship (PFFS-21-47), EPSRC grant APP24554 (SignGPT-EP/Z535370/1), and through funding from Google.org via the AI for Global Goals scheme. This work reflects only the author’s views and the funders are not responsible for any use that may be made of the information it contains.
Identifiers: 991040066602346; WOS:001595408500001
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Journal article

Gloss-Free Sign Language Translation: An Unbiased Evaluation of Progress in the Field

Abstract

Files and links (1)

Metrics

Details

Usage Policy