Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

AYAN KUMAR  BHUNIA; PINAKI NATH  CHOWDHURY; YONGXIN YANG; Timothy M. Hospedales; TAO XIANG; YI-ZHE SONG

doi:10.1109/CVPR46437.2021.00562

Back

Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

Conference proceeding

Open access

Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

AYAN KUMAR BHUNIA, PINAKI NATH CHOWDHURY, YONGXIN YANG, Timothy M. Hospedales, TAO XIANG and YI-ZHE SONG

2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.5668-5677

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 (Virtual, 19/06/2021–25/06/2021)

02/11/2021

DOI: https://doi.org/10.1109/CVPR46437.2021.00562

Abstract

Self-supervised learning has gained prominence due to its efficacy at learning powerful representations from un-labelled data that achieve excellent performance on many challenging downstream tasks. However, supervision-free pretext tasks are challenging to design and usually modality specific. Although there is a rich literature of self-supervised methods for either spatial (such as images) or temporal data (sound or text) modalities, a common pretext task that benefits both modalities is largely missing. In this paper, we are interested in defining a self-supervised pretext task for sketches and handwriting data. This data is uniquely characterised by its existence in dual modalities of rasterized images and vector coordinate sequences. We address and exploit this dual representation by proposing two novel cross-modal translation pretext tasks for self-supervised feature learning: Vectorization and Rasteriza-tion. Vectorization learns to map image space to vector coordinates and rasterization maps vector coordinates to image space. We show that our learned encoder modules benefit both raster-based and vector-based downstream approaches to analysing hand-drawn data. Empirical evidence shows that our novel pretext tasks surpass existing single and multi-modal self-supervision methods.

Files and links (2)

pdf

02542948.47 kBDownload View

Open Access

url

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 View

Organisation

Metrics

50 File views/ downloads

135 Record Views

20 Times Cited - Web of Science

Details

Title: Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting
Creators: AYAN KUMAR BHUNIA - University of Surrey, School of Computer Science and Electronic Engineering
PINAKI NATH CHOWDHURY - University of Surrey, School of Computer Science and Electronic Engineering
YONGXIN YANG - University of Surrey, School of Computer Science and Electronic Engineering
Timothy M. Hospedales - University of Edinburgh
TAO XIANG - University of Surrey, School of Computer Science and Electronic Engineering
YI-ZHE SONG - University of Surrey, School of Computer Science and Electronic Engineering
Publication Details: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.5668-5677
Conference: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 (Virtual, 19/06/2021–25/06/2021)
First online publication date: 02/11/2021
Date accepted for publication: 28/02/2021
Identifiers: 99550922502346; WOS:000739917305086
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Conference proceeding

Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting

Abstract

Files and links (2)

Metrics

Details

Usage Policy