Preprocessing Method for Encrypted Traffic Based on Semisupervised Clustering

Joint Authors

Liu, Liang
Niu, Weina
Zheng, Rongfeng
Liu, Jiayong
Liao, Shan
Li, Kai

Source

Security and Communication Networks

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-13, 13 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-07-27

Country of Publication

Egypt

No. of Pages

13

Main Subjects

Information Technology and Computer Science

Abstract EN

The explosive growth in network traffic in recent times has resulted in increased processing pressure on network intrusion detection systems.

In addition, there is a lack of reliable methods for preprocessing network traffic generated by benign applications that do not steal users’ data from their devices.

To alleviate these problems, this study analyzed the differences between benign and malicious traffic produced by benign applications and malware, respectively.

To fully express these differences, this study proposed a new set of statistical features for training a clustering model.

Furthermore, to mine the communication channels generated by benign applications in batches, a semisupervised clustering method was adopted.

Using a small number of labeled samples, our method aggregated historical network traffic into two types of clusters.

The cluster that did not contain labeled malicious samples was regarded as a benign traffic cluster.

The experimental results were compared using four types of clustering algorithms.

The density-based spatial clustering of applications with noise (DBSCAN) clustering algorithm was selected to mine benign communication channels.

We also compared our method with two other methods, and the results demonstrated that the benign channels mined through our method were more reliable.

Finally, using our method, 1,811 benign transport layer security (TLS) channels were mined from 18,357 TLS communication channels.

The number of flows carried by these benign channels comprised 65.37% of the entire network flows, and no malicious flow was included in our results, which proves the effectiveness of our method.

American Psychological Association (APA)

Zheng, Rongfeng& Liu, Jiayong& Niu, Weina& Liu, Liang& Li, Kai& Liao, Shan. 2020. Preprocessing Method for Encrypted Traffic Based on Semisupervised Clustering. Security and Communication Networks،Vol. 2020, no. 2020, pp.1-13.
https://search.emarefa.net/detail/BIM-1208603

Modern Language Association (MLA)

Zheng, Rongfeng…[et al.]. Preprocessing Method for Encrypted Traffic Based on Semisupervised Clustering. Security and Communication Networks No. 2020 (2020), pp.1-13.
https://search.emarefa.net/detail/BIM-1208603

American Medical Association (AMA)

Zheng, Rongfeng& Liu, Jiayong& Niu, Weina& Liu, Liang& Li, Kai& Liao, Shan. Preprocessing Method for Encrypted Traffic Based on Semisupervised Clustering. Security and Communication Networks. 2020. Vol. 2020, no. 2020, pp.1-13.
https://search.emarefa.net/detail/BIM-1208603

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1208603