Incremental Learning for Malware Classification in Small Datasets

Joint Authors

Li, Jingmei
Xue, Di
Wu, Weifei
Wang, Jiaxiang

Source

Security and Communication Networks

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-12, 12 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-02-20

Country of Publication

Egypt

No. of Pages

12

Main Subjects

Information Technology and Computer Science

Abstract EN

Information security is an important research area.

As a very special yet important case, malware classification plays an important role in information security.

In the real world, the malware datasets are open-ended and dynamic, and new malware samples belonging to old classes and new classes are increasing continuously.

This requires the malware classification method to enable incremental learning, which can efficiently learn the new knowledge.

However, existing works mainly focus on feature engineering with machine learning as a tool.

To solve the problem, we present an incremental malware classification framework, named “IMC,” which consists of opcode sequence extraction, selection, and incremental learning method.

We develop an incremental learning method based on multiclass support vector machine (SVM) as the core component of IMC, named “IMCSVM,” which can incrementally improve its classification ability by learning new malware samples.

In IMC, IMCSVM adds the new classification planes (if new samples belong to a new class) and updates all old classification planes for new malware samples.

As a result, IMC can improve the classification quality of known malware classes by minimizing the prediction error and transfer the old model with known knowledge to classify unknown malware classes.

We apply the incremental learning method into malware classification, and the experimental results demonstrate the advantages and effectiveness of IMC.

American Psychological Association (APA)

Li, Jingmei& Xue, Di& Wu, Weifei& Wang, Jiaxiang. 2020. Incremental Learning for Malware Classification in Small Datasets. Security and Communication Networks،Vol. 2020, no. 2020, pp.1-12.
https://search.emarefa.net/detail/BIM-1208471

Modern Language Association (MLA)

Li, Jingmei…[et al.]. Incremental Learning for Malware Classification in Small Datasets. Security and Communication Networks No. 2020 (2020), pp.1-12.
https://search.emarefa.net/detail/BIM-1208471

American Medical Association (AMA)

Li, Jingmei& Xue, Di& Wu, Weifei& Wang, Jiaxiang. Incremental Learning for Malware Classification in Small Datasets. Security and Communication Networks. 2020. Vol. 2020, no. 2020, pp.1-12.
https://search.emarefa.net/detail/BIM-1208471

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1208471