Malicious Domain Names Detection Algorithm Based on N-Gram

Joint Authors

Zeng, Xiang-Yan
Chang, Zhaobin
Bao, Guangbin
Hong, Zhao

Source

Journal of Computer Networks and Communications

Issue

Vol. 2019, Issue 2019 (31 Dec. 2019), pp.1-9, 9 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2019-02-03

Country of Publication

Egypt

No. of Pages

9

Main Subjects

Information Technology and Computer Science

Abstract EN

Malicious domain name attacks have become a serious issue for Internet security.

In this study, a malicious domain names detection algorithm based on N-Gram is proposed.

The top 100,000 domain names in Alexa 2013 are used in the N-Gram method.

Each domain name excluding the top-level domain is segmented into substrings according to its domain level with the lengths of 3, 4, 5, 6, and 7.

The substring set of the 100,000 domain names is established, and the weight value of a substring is calculated according to its occurrence number in the substring set.

To detect a malicious attack, the domain name is also segmented by the N-Gram method and its reputation value is calculated based on the weight values of its substrings.

Finally, the judgment of whether the domain name is malicious is made by thresholding.

In the experiments on Alexa 2017 and Malware domain list, the proposed detection algorithm yielded an accuracy rate of 94.04%, a false negative rate of 7.42%, and a false positive rate of 6.14%.

The time complexity is lower than other popular malicious domain names detection algorithms.

American Psychological Association (APA)

Hong, Zhao& Chang, Zhaobin& Bao, Guangbin& Zeng, Xiang-Yan. 2019. Malicious Domain Names Detection Algorithm Based on N-Gram. Journal of Computer Networks and Communications،Vol. 2019, no. 2019, pp.1-9.
https://search.emarefa.net/detail/BIM-1172286

Modern Language Association (MLA)

Hong, Zhao…[et al.]. Malicious Domain Names Detection Algorithm Based on N-Gram. Journal of Computer Networks and Communications No. 2019 (2019), pp.1-9.
https://search.emarefa.net/detail/BIM-1172286

American Medical Association (AMA)

Hong, Zhao& Chang, Zhaobin& Bao, Guangbin& Zeng, Xiang-Yan. Malicious Domain Names Detection Algorithm Based on N-Gram. Journal of Computer Networks and Communications. 2019. Vol. 2019, no. 2019, pp.1-9.
https://search.emarefa.net/detail/BIM-1172286

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1172286