Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection
Joint Authors
Choi, Mi-Jung
Kim, Sang-Pil
Gil, Myeong-Sun
Kim, Hajin
Moon, Yang-Sae
Won, Hee-Sun
Source
Security and Communication Networks
Issue
Vol. 2017, Issue 2017 (31 Dec. 2017), pp.1-12, 12 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2017-03-28
Country of Publication
Egypt
No. of Pages
12
Main Subjects
Information Technology and Computer Science
Abstract EN
Recently, the risk of information disclosure is increasing significantly.
Accordingly, privacy-preserving data mining (PPDM) is being actively studied to obtain accurate mining results while preserving the data privacy.
We here focus on secure similar document detection (SSDD), which identifies similar documents of two parties when each party does not disclose its own sensitive documents to the another party.
In this paper, we propose an efficient two-step protocol that exploits a feature selection as a lower-dimensional transformation, and we present discriminative feature selections to maximize the performance of the protocol.
The proposed protocol consists of two steps: the filtering step and the postprocessing step.
For the feature selection, we first consider the simplest one, random projection (RP), and propose its two-step solution, SSDD-RP.
We then present two discriminative feature selections and their solutions: SSDD-LF which selects a few dimensions locally frequent in the current querying vector and SSDD-GF which selects ones globally frequent in the set of all document vectors.
We finally propose a hybrid one, SSDD-HF, which takes advantage of both SSDD-LF and SSDD-GF.
We empirically show that the proposed two-step protocol significantly outperforms the previous one-step protocol by three or four orders of magnitude.
American Psychological Association (APA)
Kim, Sang-Pil& Gil, Myeong-Sun& Kim, Hajin& Choi, Mi-Jung& Moon, Yang-Sae& Won, Hee-Sun. 2017. Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection. Security and Communication Networks،Vol. 2017, no. 2017, pp.1-12.
https://search.emarefa.net/detail/BIM-1203073
Modern Language Association (MLA)
Kim, Sang-Pil…[et al.]. Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection. Security and Communication Networks No. 2017 (2017), pp.1-12.
https://search.emarefa.net/detail/BIM-1203073
American Medical Association (AMA)
Kim, Sang-Pil& Gil, Myeong-Sun& Kim, Hajin& Choi, Mi-Jung& Moon, Yang-Sae& Won, Hee-Sun. Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection. Security and Communication Networks. 2017. Vol. 2017, no. 2017, pp.1-12.
https://search.emarefa.net/detail/BIM-1203073
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-1203073