Fast Text vs. Non-text Classification of Images

Jiri Kralicek; Jiri Matas

doi:10.1007/978-3-030-86337-1_2

Back

Fast Text vs. Non-text Classification of Images

Conference proceeding

Peer reviewed

Fast Text vs. Non-text Classification of Images

Jiri Kralicek and Jiri Matas

DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, Vol.12824, pp.18-32

Lecture Notes in Computer Science

01/01/2021

DOI: https://doi.org/10.1007/978-3-030-86337-1_2

Abstract

Computer Science, Artificial Intelligence

Computer Science, Software Engineering

Imaging Science & Photographic Technology

Science & Technology

Computer Science

Technology

We propose a fast method for classifying images as containing text, or with no scene text. The typical application is in processing large image streams, as encountered in social networks, for detection and recognition of scene text. The proposed classifier efficiently removes non-text images from consideration, thus allowing to apply the potentially computationally heavy scene text detection and OCR on only a fraction of the images. The proposed method, called Fast-Text-Classifier (FTC), utilizes a MobileNetV2 architecture as a feature extractor for fast inference. The text vs. non-text prediction is based on a block-level approach. FTC achieves 94.2% F-measure, 0.97 area under the ROC curve, and 74.8 ms and 8.6 ms inference times for CPU and GPU, respectively. A dataset of 1M images, automatically annotated with masks indicating text presence, is introduced and made public at http://cmp.felk.cvut.cz/data/twitter1M.

Metrics

19 Record Views

2 Times Cited - Web of Science

Details

Title: Fast Text vs. Non-text Classification of Images
Creators: Jiri Kralicek - Czech Technical University in Prague
Jiri Matas - Czech Technical University in Prague
Contributors: J Llados (Editor)
D Lopresti (Editor)
S Uchida (Editor)
Publication Details: DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, Vol.12824, pp.18-32
Series: Lecture Notes in Computer Science
Publisher: Springer Nature
Number of pages: 15
Date published: 01/01/2021
Grant note: CZ.02.1.01/-0.0/0.0/16 019/0000765 / Research Center for Informatics SGS20/171/OHK3/-3T/13 / CTU student grant
Identifiers: 99822313002346
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Conference proceeding

Fast Text vs. Non-text Classification of Images

Abstract

Metrics

Details

Usage Policy