Learnt Contrastive Concept Embeddings for Sign Recognition

Ryan Wong; Necati Cihan Camgoz; Richard Bowden

doi:10.48550/arxiv.2308.09515

Back

Preprint

Learnt Contrastive Concept Embeddings for Sign Recognition

Ryan Wong, Necati Cihan Camgoz and Richard Bowden

18/08/2023

DOI: https://doi.org/10.48550/arxiv.2308.09515

Abstract

Computer Science - Computer Vision and Pattern Recognition

In natural language processing (NLP) of spoken languages, word embeddings have been shown to be a useful method to encode the meaning of words. Sign languages are visual languages, which require sign embeddings to capture the visual and linguistic semantics of sign. Unlike many common approaches to Sign Recognition, we focus on explicitly creating sign embeddings that bridge the gap between sign language and spoken language. We propose a learning framework to derive LCC (Learnt Contrastive Concept) embeddings for sign language, a weakly supervised contrastive approach to learning sign embeddings. We train a vocabulary of embeddings that are based on the linguistic labels for sign video. Additionally, we develop a conceptual similarity loss which is able to utilise word embeddings from NLP methods to create sign embeddings that have better sign language to spoken language correspondence. These learnt representations allow the model to automatically localise the sign in time. Our approach achieves state-of-the-art keypoint-based sign recognition performance on the WLASL and BOBSL datasets.

Metrics

13 Record Views

Details

Title: Learnt Contrastive Concept Embeddings for Sign Recognition
Creators: Ryan Wong - University of Surrey, School of Computer Science and Electronic Engineering
Necati Cihan Camgoz
Richard Bowden - University of Surrey, School of Computer Science and Electronic Engineering
Grants: ExTOL, EP/R03298X/1, Engineering and Physical Sciences Research Council (United Kingdom, Swindon) - EPSRC
EASIER : Intelligent Automatic Sign Language Translation, 101016982, Horizon 2020
IICT Flagship, PFFS-21-47, Innosuisse – Swiss Innovation Agency (Switzerland, Bern)
Grant note: This work was supported by the EPSRC project ExTOL (EP/R03298X/1), SNSF project ’SMILE II’ (CRSII5 193686), European Union’s Horizon2020 programme (’EASIER’ grant agreement 101016982) and the Innosuisse IICT Flagship (PFFS-21-47).
Identifiers: 99812239102346
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Preprint

Learnt Contrastive Concept Embeddings for Sign Recognition

Abstract

Metrics

Details

Usage Policy