Finding the Right Words: Investigating Machine-Generated Video Description Quality Using a Corpus-Based Approach

SABINE BRAUN; Kim Linda Starr

doi:10.47476/jat.v2i2.103

Back

Finding the Right Words: Investigating Machine-Generated Video Description Quality Using a Corpus-Based Approach

Journal article

Open access

Peer reviewed

Finding the Right Words: Investigating Machine-Generated Video Description Quality Using a Corpus-Based Approach

SABINE BRAUN and Kim Linda Starr

Journal of Audiovisual Translation

31/12/2019

DOI: https://doi.org/10.47476/jat.v2i2.103

Abstract

computer vision, machine learning, accessibility,

audiovisual content, audio description, content

description, content retrieval, video description,

audiovisual translation, MeMAD

This paper examines first steps in identifying and compiling human-generated corpora for the purpose of determining the quality of computer-generated video descriptions. This is part of a study whose general ambition is to broaden the reach of accessible audiovisual content through semi-automation of its description for the benefit of both end-users (content consumers) and industry professionals (content creators). Working in parallel with machine-derived video and image description datasets created for the purposes of advancing computer vision research, such as Microsoft COCO (Lin et al., 2015) and TGIF (Li et al., 2016), we examine the usefulness of audio descriptive texts as a direct comparator. Cognisant of the limitations of this approach, we also explore alternative human-generated video description datasets including bespoke content description. Our research forms part of the MeMAD (Methods for Managing Audiovisual Data) project, funded by the EU Horizon 2020 programme.

Files and links (2)

pdf

document(11)980.45 kBDownload View

Open Access

url

https://www.jatjournal.org/index.php/jat/article/view/103View

Metrics

39 File views/ downloads

64 Record Views

Details

Title: Finding the Right Words: Investigating Machine-Generated Video Description Quality Using a Corpus-Based Approach
Creators: SABINE BRAUN (Corresponding Author) - University of Surrey, School of Literature and Languages
Kim Linda Starr (Contributor)
Publication Details: Journal of Audiovisual Translation
Date published: 31/12/2019
Grants: Methods for Managing Audiovisual Data: Combining Automatic Efficiency with Human Accuracy, MeMAD, 780069, European Commission (Belgium, Brussels) - EC
Identifiers: 99595023002346
Academic Unit: School of Literature and Languages
Language: English
Resource Type: Journal article

Finding the Right Words: Investigating Machine-Generated Video Description Quality Using a Corpus-Based Approach

Abstract

Files and links (2)

Metrics

Details

Usage Policy