Multi-modal Similarity Retrieval with Distributed Key-value Store

Investor logo

Warning

This publication doesn't include Faculty of Economics and Administration. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

NOVÁK David

Year of publication 2015
Type Article in Periodical
Magazine / Source MOBILE NETWORKS & APPLICATIONS
MU Faculty or unit

Faculty of Informatics

Citation
Doi http://dx.doi.org/10.1007/s11036-014-0561-4
Field Informatics
Keywords Similarity search; Multi-modal search; Big Data; Scalability; Distributed hash table
Description We propose a system architecture for large-scale similarity search in various types of digital data. The architecture combines contemporary highly-scalable distributed data stores with recent efficient similarity indexes and also with other types of search indexes. The system enables various types of data access by distance-based similarity queries, standard term and attribute queries, and advanced queries combining several search aspects (modalities). The first part of this work describes the generic architecture and similarity index PPP-Codes, which is suitable for our system. In the second part, we describe two specific instances of this architecture that manage two large collections of digital images and provide content-based visual search, keyword search, attribute-based access, and their combinations. The first collection is the CoPhIR benchmark with 106 million images accessed by MPEG7 visual descriptors and the second collection contains 20 million images with complex features obtained from deep convolutional neural network.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.