Incremental Learning for Malware Classification in Small Datasets
Joint Authors
Li, Jingmei
Xue, Di
Wu, Weifei
Wang, Jiaxiang
Source
Security and Communication Networks
Issue
Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-12, 12 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2020-02-20
Country of Publication
Egypt
No. of Pages
12
Main Subjects
Information Technology and Computer Science
Abstract EN
Information security is an important research area.
As a very special yet important case, malware classification plays an important role in information security.
In the real world, the malware datasets are open-ended and dynamic, and new malware samples belonging to old classes and new classes are increasing continuously.
This requires the malware classification method to enable incremental learning, which can efficiently learn the new knowledge.
However, existing works mainly focus on feature engineering with machine learning as a tool.
To solve the problem, we present an incremental malware classification framework, named “IMC,” which consists of opcode sequence extraction, selection, and incremental learning method.
We develop an incremental learning method based on multiclass support vector machine (SVM) as the core component of IMC, named “IMCSVM,” which can incrementally improve its classification ability by learning new malware samples.
In IMC, IMCSVM adds the new classification planes (if new samples belong to a new class) and updates all old classification planes for new malware samples.
As a result, IMC can improve the classification quality of known malware classes by minimizing the prediction error and transfer the old model with known knowledge to classify unknown malware classes.
We apply the incremental learning method into malware classification, and the experimental results demonstrate the advantages and effectiveness of IMC.
American Psychological Association (APA)
Li, Jingmei& Xue, Di& Wu, Weifei& Wang, Jiaxiang. 2020. Incremental Learning for Malware Classification in Small Datasets. Security and Communication Networks،Vol. 2020, no. 2020, pp.1-12.
https://search.emarefa.net/detail/BIM-1208471
Modern Language Association (MLA)
Li, Jingmei…[et al.]. Incremental Learning for Malware Classification in Small Datasets. Security and Communication Networks No. 2020 (2020), pp.1-12.
https://search.emarefa.net/detail/BIM-1208471
American Medical Association (AMA)
Li, Jingmei& Xue, Di& Wu, Weifei& Wang, Jiaxiang. Incremental Learning for Malware Classification in Small Datasets. Security and Communication Networks. 2020. Vol. 2020, no. 2020, pp.1-12.
https://search.emarefa.net/detail/BIM-1208471
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-1208471