Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection

المؤلفون المشاركون

Choi, Mi-Jung
Kim, Sang-Pil
Gil, Myeong-Sun
Kim, Hajin
Moon, Yang-Sae
Won, Hee-Sun

المصدر

Security and Communication Networks

العدد

المجلد 2017، العدد 2017 (31 ديسمبر/كانون الأول 2017)، ص ص. 1-12، 12ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2017-03-28

دولة النشر

مصر

عدد الصفحات

12

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

Recently, the risk of information disclosure is increasing significantly.

Accordingly, privacy-preserving data mining (PPDM) is being actively studied to obtain accurate mining results while preserving the data privacy.

We here focus on secure similar document detection (SSDD), which identifies similar documents of two parties when each party does not disclose its own sensitive documents to the another party.

In this paper, we propose an efficient two-step protocol that exploits a feature selection as a lower-dimensional transformation, and we present discriminative feature selections to maximize the performance of the protocol.

The proposed protocol consists of two steps: the filtering step and the postprocessing step.

For the feature selection, we first consider the simplest one, random projection (RP), and propose its two-step solution, SSDD-RP.

We then present two discriminative feature selections and their solutions: SSDD-LF which selects a few dimensions locally frequent in the current querying vector and SSDD-GF which selects ones globally frequent in the set of all document vectors.

We finally propose a hybrid one, SSDD-HF, which takes advantage of both SSDD-LF and SSDD-GF.

We empirically show that the proposed two-step protocol significantly outperforms the previous one-step protocol by three or four orders of magnitude.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Kim, Sang-Pil& Gil, Myeong-Sun& Kim, Hajin& Choi, Mi-Jung& Moon, Yang-Sae& Won, Hee-Sun. 2017. Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection. Security and Communication Networks،Vol. 2017, no. 2017, pp.1-12.
https://search.emarefa.net/detail/BIM-1203073

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Kim, Sang-Pil…[et al.]. Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection. Security and Communication Networks No. 2017 (2017), pp.1-12.
https://search.emarefa.net/detail/BIM-1203073

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Kim, Sang-Pil& Gil, Myeong-Sun& Kim, Hajin& Choi, Mi-Jung& Moon, Yang-Sae& Won, Hee-Sun. Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection. Security and Communication Networks. 2017. Vol. 2017, no. 2017, pp.1-12.
https://search.emarefa.net/detail/BIM-1203073

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1203073