Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection

Joint Authors

Choi, Mi-Jung
Kim, Sang-Pil
Gil, Myeong-Sun
Kim, Hajin
Moon, Yang-Sae
Won, Hee-Sun

Source

Security and Communication Networks

Issue

Vol. 2017, Issue 2017 (31 Dec. 2017), pp.1-12, 12 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2017-03-28

Country of Publication

Egypt

No. of Pages

12

Main Subjects

Information Technology and Computer Science

Abstract EN

Recently, the risk of information disclosure is increasing significantly.

Accordingly, privacy-preserving data mining (PPDM) is being actively studied to obtain accurate mining results while preserving the data privacy.

We here focus on secure similar document detection (SSDD), which identifies similar documents of two parties when each party does not disclose its own sensitive documents to the another party.

In this paper, we propose an efficient two-step protocol that exploits a feature selection as a lower-dimensional transformation, and we present discriminative feature selections to maximize the performance of the protocol.

The proposed protocol consists of two steps: the filtering step and the postprocessing step.

For the feature selection, we first consider the simplest one, random projection (RP), and propose its two-step solution, SSDD-RP.

We then present two discriminative feature selections and their solutions: SSDD-LF which selects a few dimensions locally frequent in the current querying vector and SSDD-GF which selects ones globally frequent in the set of all document vectors.

We finally propose a hybrid one, SSDD-HF, which takes advantage of both SSDD-LF and SSDD-GF.

We empirically show that the proposed two-step protocol significantly outperforms the previous one-step protocol by three or four orders of magnitude.

American Psychological Association (APA)

Kim, Sang-Pil& Gil, Myeong-Sun& Kim, Hajin& Choi, Mi-Jung& Moon, Yang-Sae& Won, Hee-Sun. 2017. Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection. Security and Communication Networks،Vol. 2017, no. 2017, pp.1-12.
https://search.emarefa.net/detail/BIM-1203073

Modern Language Association (MLA)

Kim, Sang-Pil…[et al.]. Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection. Security and Communication Networks No. 2017 (2017), pp.1-12.
https://search.emarefa.net/detail/BIM-1203073

American Medical Association (AMA)

Kim, Sang-Pil& Gil, Myeong-Sun& Kim, Hajin& Choi, Mi-Jung& Moon, Yang-Sae& Won, Hee-Sun. Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document Detection. Security and Communication Networks. 2017. Vol. 2017, no. 2017, pp.1-12.
https://search.emarefa.net/detail/BIM-1203073

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1203073