Impact of Dataset Representation on Smartphone Malware Detection Performance - Trust Management VII
Conference Papers Year : 2013

Impact of Dataset Representation on Smartphone Malware Detection Performance

Abstract

Improving Smartphone anomaly-based malware detection techniques is widely studied in recent years. Previous studies explore three factors: dataset size, dataset type and normal profile model. These factors improve the performance, but increase computation complexity and the required memory space. In this paper we explore a new factor: the dataset representation. Dataset representation is the format adopted to organize and represent data. To investigate the impact of this factor, we examine four machine learning classifiers with three different dataset representations. Those dataset representations are: successive system calls, bag of system calls and patterns frequency system calls. The used dataset is a collection of system call traces of Smartphone executing Android 2.2. We analyse the performance of each classifier and deduce the influence of dataset representation on accuracy and false positive rates. The results show that the dataset representation has a potential impact on the performance of classifiers with low computational and memory cost.
Fichier principal
Vignette du fichier
978-3-642-38323-6_12_Chapter.pdf (478.2 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01468169 , version 1 (15-02-2017)

Licence

Identifiers

Cite

Abdelfattah Amamra, Chamseddine Talhi, Jean-Marc Robert. Impact of Dataset Representation on Smartphone Malware Detection Performance. 7th Trust Management (TM), Jun 2013, Malaga, Spain. pp.166-176, ⟨10.1007/978-3-642-38323-6_12⟩. ⟨hal-01468169⟩
583 View
108 Download

Altmetric

Share

More