MapReduce Based Personalized Locality Sensitive Hashing for Similarity Joins on Large Scale Data

المؤلفون المشاركون

Wang, Jingjing
Lin, Chen

المصدر

Computational Intelligence and Neuroscience

العدد

المجلد 2015، العدد 2015 (31 ديسمبر/كانون الأول 2015)، ص ص. 1-13، 13ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2015-04-30

دولة النشر

مصر

عدد الصفحات

13

التخصصات الرئيسية

الأحياء

الملخص EN

Locality Sensitive Hashing (LSH) has been proposed as an efficient techniquefor similarity joins for high dimensional data.

The efficiency and approximationrate of LSH depend on the number of generated false positive instances and falsenegative instances.

In many domains, reducing the number of false positives iscrucial.

Furthermore, in some application scenarios, balancing false positives andfalse negatives is favored.

To address these problems, in this paper we proposePersonalized Locality Sensitive Hashing (PLSH), where a new banding scheme isembedded to tailor the number of false positives, false negatives, and the sum ofboth.

PLSH is implemented in parallel using MapReduce framework to deal withsimilarity joins on large scale data.

Experimental studies on real and simulated dataverify the efficiency and effectiveness of our proposed PLSH technique, comparedwith state-of-the-art methods.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Wang, Jingjing& Lin, Chen. 2015. MapReduce Based Personalized Locality Sensitive Hashing for Similarity Joins on Large Scale Data. Computational Intelligence and Neuroscience،Vol. 2015, no. 2015, pp.1-13.
https://search.emarefa.net/detail/BIM-1057674

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Wang, Jingjing& Lin, Chen. MapReduce Based Personalized Locality Sensitive Hashing for Similarity Joins on Large Scale Data. Computational Intelligence and Neuroscience No. 2015 (2015), pp.1-13.
https://search.emarefa.net/detail/BIM-1057674

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Wang, Jingjing& Lin, Chen. MapReduce Based Personalized Locality Sensitive Hashing for Similarity Joins on Large Scale Data. Computational Intelligence and Neuroscience. 2015. Vol. 2015, no. 2015, pp.1-13.
https://search.emarefa.net/detail/BIM-1057674

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1057674