Malware Family Classification Model Using User Defined Features and Representation Learning - Computational Intelligence in Data Science
Conference Papers Year : 2020

Malware Family Classification Model Using User Defined Features and Representation Learning

T. Gayathri
  • Function : Author
  • PersonId : 1117238
M. S. Vijaya
  • Function : Author
  • PersonId : 1117239

Abstract

Malware is very dangerous for system and network user. Malware identification is essential tasks in effective detecting and preventing the computer system from being infected, protecting it from potential information loss and system compromise. Commonly, there are 25 malware families exists. Traditional malware detection and anti-virus systems fail to classify the new variants of unknown malware into their corresponding families. With development of malicious code engineering, it is possible to understand the malware variants and their features for new malware samples which carry variability and polymorphism. The detection methods can hardly detect such variants but it is significant in the cyber security field to analyze and detect large-scale malware samples more efficiently. Hence it is proposed to develop an accurate malware family classification model contemporary deep learning technique. In this paper, malware family recognition is formulated as multi classification task and appropriate solution is obtained using representation learning based on binary array of malware executable files. Six families of malware have been considered here for building the models. The feature dataset with 690 instances is applied to deep neural network to build the classifier. The experimental results, based on a dataset of 6 classes of malware families and 690 malware files trained model provides an accuracy of over 86.8% in discriminating from malware families. The techniques provide better results for classifying malware into families.
Fichier principal
Vignette du fichier
507484_1_En_14_Chapter.pdf (343.36 Ko) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-03434789 , version 1 (18-11-2021)

Licence

Identifiers

Cite

T. Gayathri, M. S. Vijaya. Malware Family Classification Model Using User Defined Features and Representation Learning. 3rd International Conference on Computational Intelligence in Data Science (ICCIDS), Feb 2020, Chennai, India. pp.185-195, ⟨10.1007/978-3-030-63467-4_14⟩. ⟨hal-03434789⟩
136 View
28 Download

Altmetric

Share

More