Auxiliary Classifier based Residual RNN for Image Captioning

Ozkan Cayli; Volkan Kilic; Aytug Onan; Wenwu Wang

doi:10.23919/EUSIPCO55093.2022.9909624

Back

Conference proceeding

Auxiliary Classifier based Residual RNN for Image Captioning

Ozkan Cayli, Volkan Kilic, Aytug Onan and Wenwu Wang

2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), pp.1126-1130

European Signal Processing Conference

01/01/2022

DOI: https://doi.org/10.23919/EUSIPCO55093.2022.9909624

Abstract

Computer Science, Software Engineering

Engineering, Electrical & Electronic

Imaging Science & Photographic Technology

Science & Technology

Acoustics

Computer Science

Engineering

Technology

Telecommunications

Image captioning aims to generate a description of visual contents with natural language automatically. This is useful in several potential applications, such as image understanding and virtual assistants. With recent advances in deep neural networks, natural and semantic text generation has been improved in image captioning. However, maintaining the gradient flow between neurons in consecutive layers becomes challenging as the network gets deeper. In this paper, we propose to integrate an auxiliary classifier in the residual recurrent neural network, which enables the gradient flow to reach the bottom layers for enhanced caption generation. Experiments on the MSCOCO and VizWiz datasets demonstrate the advantage of our proposed approach over the state-of-the-art approaches in several performance metrics.

Metrics

28 Record Views

2 Times Cited - Web of Science

Details

Title: Auxiliary Classifier based Residual RNN for Image Captioning
Creators: Ozkan Cayli - Izmir Kâtip Çelebi University
Volkan Kilic - Izmir Kâtip Çelebi University
Aytug Onan - Izmir Kâtip Çelebi University
Wenwu Wang - University of Surrey, Centre for Vision, Speech & Signal Processing (CVSSP)
Publication Details: 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), pp.1126-1130
Series: European Signal Processing Conference
Publisher: IEEE
Number of pages: 5
Date published: 01/01/2022
Grant note: 120N995; 623805725 / Scientific and Technological Research Council of Turkey (TUBITAK)-British Council; Turkiye Bilimsel ve Teknolojik Arastirma Kurumu (TUBITAK) 2021-ODL-MUMF-0006 / scientific research projects coordination unit of Izmir Katip Celebi University; Izmir Katip Celebi University
Identifiers: 99822203002346
Academic Unit: School of Computer Science and Electronic Engineering
Language: English
Resource Type: Conference proceeding

Auxiliary Classifier based Residual RNN for Image Captioning

Abstract

Metrics

Details

Usage Policy