Flow Chart Generation-Based Source Code Similarity Detection Using Process Mining

المؤلفون المشاركون

Li, Lulu
Zeng, Qingtian
Liu, Cong
Zhang, Feng

المصدر

Scientific Programming

العدد

المجلد 2020، العدد 2020 (31 ديسمبر/كانون الأول 2020)، ص ص. 1-15، 15ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2020-07-07

دولة النشر

مصر

عدد الصفحات

15

التخصصات الرئيسية

الرياضيات

الملخص EN

Source code similarity detection has extensive applications in computer programming teaching and software intellectual property protection.

In the teaching of computer programming courses, students may utilize some complex source code obfuscation techniques, e.g., opaque predicates, loop unrolling, and function inlining and outlining, to reduce the similarity between code fragments and avoid the plagiarism detection.

Existing source code similarity detection approaches only consider static features of source code, making it difficult to cope with more complex code obfuscation techniques.

In this paper, we propose a novel source code similarity detection approach by considering the dynamic features at runtime of source code using process mining.

More specifically, given two pieces of source code, their running logs are obtained by source code instrumentation and execution.

Next, process mining is used to obtain the flow charts of the two pieces of source code by analyzing their collected running logs.

Finally, similarity of the two pieces of source code is measured by computing the similarity of these two flow charts.

Experimental results show that the proposed approach can deal with more complex obfuscation techniques including opaque predicates and loop unrolling as well as function inlining and outlining, which cannot be handled by existing work properly.

Therefore, we argue that our approach can defeat commonly used code obfuscation techniques more effectively for source code similarity detection than the existing state-of-the-art approaches.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Zhang, Feng& Li, Lulu& Liu, Cong& Zeng, Qingtian. 2020. Flow Chart Generation-Based Source Code Similarity Detection Using Process Mining. Scientific Programming،Vol. 2020, no. 2020, pp.1-15.
https://search.emarefa.net/detail/BIM-1209263

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Zhang, Feng…[et al.]. Flow Chart Generation-Based Source Code Similarity Detection Using Process Mining. Scientific Programming No. 2020 (2020), pp.1-15.
https://search.emarefa.net/detail/BIM-1209263

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Zhang, Feng& Li, Lulu& Liu, Cong& Zeng, Qingtian. Flow Chart Generation-Based Source Code Similarity Detection Using Process Mining. Scientific Programming. 2020. Vol. 2020, no. 2020, pp.1-15.
https://search.emarefa.net/detail/BIM-1209263

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1209263