Efficient and Scalable Graph Similarity Joins in MapReduce

المؤلفون المشاركون

Zhang, Weiming
Zhao, Xiang
Chen, Yifan
Xiao, Chuan
Tang, Jiuyang

المصدر

The Scientific World Journal

العدد

المجلد 2014، العدد 2014 (31 ديسمبر/كانون الأول 2014)، ص ص. 1-11، 11ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2014-07-08

دولة النشر

مصر

عدد الصفحات

11

التخصصات الرئيسية

الطب البشري
تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

Along with the emergence of massive graph-modeled data, it is of great importance to investigate graph similarity joins due to their wide applications for multiple purposes, including data cleaning, and near duplicate detection.

This paper considers graph similarity joins with edit distance constraints, which return pairs of graphs such that their edit distances are no larger than a given threshold.

Leveraging the MapReduce programming model, we propose MGSJoin, a scalable algorithm following the filtering-verification framework for efficient graph similarity joins.

It relies on counting overlapping graph signatures for filtering out nonpromising candidates.

With the potential issue of too many key-value pairs in the filtering phase, spectral Bloom filters are introduced to reduce the number of key-value pairs.

Furthermore, we integrate the multiway join strategy to boost the verification, where a MapReduce-based method is proposed for GED calculation.

The superior efficiency and scalability of the proposed algorithms are demonstrated by extensive experimental results.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Chen, Yifan& Zhao, Xiang& Xiao, Chuan& Zhang, Weiming& Tang, Jiuyang. 2014. Efficient and Scalable Graph Similarity Joins in MapReduce. The Scientific World Journal،Vol. 2014, no. 2014, pp.1-11.
https://search.emarefa.net/detail/BIM-1050894

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Chen, Yifan…[et al.]. Efficient and Scalable Graph Similarity Joins in MapReduce. The Scientific World Journal No. 2014 (2014), pp.1-11.
https://search.emarefa.net/detail/BIM-1050894

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Chen, Yifan& Zhao, Xiang& Xiao, Chuan& Zhang, Weiming& Tang, Jiuyang. Efficient and Scalable Graph Similarity Joins in MapReduce. The Scientific World Journal. 2014. Vol. 2014, no. 2014, pp.1-11.
https://search.emarefa.net/detail/BIM-1050894

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1050894