Logo image
Large-scale graph indexing using binary embeddings of node contexts for information spotting in document image databases
Journal article   Peer reviewed

Large-scale graph indexing using binary embeddings of node contexts for information spotting in document image databases

Pau Riba, Josep Llados, Alicia Fornes and Anjan Dutta
Pattern recognition letters, Vol.87, pp.203-211
01/02/2017

Abstract

Computer Science Computer Science, Artificial Intelligence Science & Technology Technology
Graph-based representations are experiencing a growing usage in visual recognition and retrieval due to their representational power in front of classical appearance-based representations. However, retrieving a query graph from a large dataset of graphs implies a high computational complexity. The most important property for a large-scale retrieval is the search time complexity to be sub-linear in the number of database examples. With this aim, in this paper we propose a graph indexation formalism applied to visual retrieval. A binary embedding is defined as hashing keys for graph nodes. Given a database of labeled graphs, graph nodes are complemented with vectors of attributes representing their local context. Then, each attribute vector is converted to a binary code applying a binary-valued hash function. Therefore, graph retrieval is formulated in terms of finding target graphs in the database whose nodes have a small Hamming distance from the query nodes, easily computed with bitwise logical operators. As an application example, we validate the performance of the proposed methods in different real scenarios such as handwritten word spotting in images of historical documents or symbol spotting in architectural floor plans. (C) 2016 Elsevier B.V. All rights reserved.

Metrics

Details

Logo image

Usage Policy