Multimodal Information Fusion

N Poh; J Kittler

doi:10.1016/B978-0-12-374825-6.00017-4

Back

Journal article

Multimodal Information Fusion

N Poh and J Kittler

pp.153-169

01/12/2010

DOI: https://doi.org/10.1016/B978-0-12-374825-6.00017-4

Abstract

This chapter gives an overview of multimodal information fusion from the machine-learning perspective. Humans interact with each other using different modalities of communication. These include speech, gestures, documents, etc. It is therefore natural that human-computer interaction (HCI) should facilitate the same multimodal form of communication. To capture this information, one uses different types of sensors, i.e., microphones to capture the audio signal, cameras to capture life video images, 3D sensors to directly capture the surface information in real time. In each of these cases, commercial off-the-shelf (COTS) devices are already available and can be readily deployed for HCI applications. Examples of HCI applications include audio-visual speech recognition, gesture recognition, emotional recognition, and person recognition using biometrics. © 2010 Elsevier Ltd All rights reserved.

Metrics

17 Record Views

11 Times Cited - Web of Science

Details

Title: Multimodal Information Fusion
Creators: N Poh
J Kittler
Publication Details: pp.153-169
Date published: 01/12/2010
Date submitted: 17/05/2017
Identifiers: 99512832802346
Academic Unit: University of Surrey
Resource Type: Journal article

Multimodal Information Fusion

Abstract

Metrics

Details

Usage Policy