M-Chord: A Scalable Distributed Similarity Search Structure

Investor logo

Warning

This publication doesn't include Faculty of Economics and Administration. It includes Faculty of Informatics. Official publication website can be found on muni.cz.
Authors

NOVÁK David ZEZULA Pavel

Year of publication 2006
Type Article in Proceedings
Conference InfoScale '06: Proceedings of the 1st international conference on Scalable information systems
MU Faculty or unit

Faculty of Informatics

Citation
Web http://doi.acm.org/10.1145/1146847.1146866
Field Informatics
Keywords distributed data structures; peer-to-peer; similarity search; indexing
Description The need for a retrieval based not on the attribute values but on the very data content has recently led to rise of the metric-based similarity search. The computational complexity of such a retrieval and large volumes of processed data call for distributed processing which allows to achieve scalability. In this paper, we propose M-Chord, a distributed data structure for metric-based similarity search. The structure takes advantage of the idea of a vector index method iDistance in order to transform the issue of similarity searching into the problem of interval search in one dimension. The proposed peer-to-peer organization, based on the Chord protocol, distributes the storage space and parallelizes the execution of similarity queries. Promising features of the structure are validated by experiments on the prototype implementation and two real-life datasets.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.