%0 Conference Proceedings %T Semantic Set Analysis for Malware Detection %+ Vietnam National University – HCMC %+ University of Information Technology, Vietnam National University - HCMC %A Nhuong, Nguyen, Van %A Nhi, Vo, Yen %A Cam, Nguyen, Tan %A Phu, Mai, Xuan %A Tan, Cao, Dang %Z Part 9: Various Aspects of Computer Security %< avec comité de lecture %( Lecture Notes in Computer Science %B 13th IFIP International Conference on Computer Information Systems and Industrial Management (CISIM) %C Ho Chi Minh City, Vietnam %Y Khalid Saeed %Y Václav Snášel %I Springer %3 Computer Information Systems and Industrial Management %V LNCS-8838 %P 688-700 %8 2014-11-05 %D 2014 %R 10.1007/978-3-662-45237-0_62 %K Data mining algorithm for classification %K x86 instruction set %K obfuscation techniques %K malware detection %K semantic set %Z Computer Science [cs] %Z Humanities and Social Sciences/Library and information sciencesConference papers %X Nowadays, malware is growing rapidly through the last few years and becomes more and more sophisticated as well as dangerous. A striking malware is obfuscation malware that is very difficult to detect. This kind of malware can create new variants that are similar to original malware feature but different about code. In order to deal with such types of malware, many approaches have been proposed, however, some of these approaches are ineffective due to their limited detection range, huge overheads or manual stages. Malware detection based on signature, for example, cannot overcome the obfuscation techniques of malware. Likewise, the behavior-based methods have the natural problems of a monitoring system such as recovery costs and long-lasting detection time. In this paper, we propose a new method (semantic set method) to detect metamorphic malware effectively by using semantic set (a set of changed values of registers or variables allocated in memory when a program is executed). For more details, this semantic set is analyzed by n-gram separator and Naïve Bayes classifier to increase detection accuracy and reduce detection time. This system has been already experimented on different datasets and got the accuracy up to 98% and detection rate almost 100%. %G English %Z TC 8 %2 https://inria.hal.science/hal-01405667/document %2 https://inria.hal.science/hal-01405667/file/978-3-662-45237-0_62_Chapter.pdf %L hal-01405667 %U https://inria.hal.science/hal-01405667 %~ SHS %~ IFIP-LNCS %~ IFIP %~ IFIP-TC %~ IFIP-TC8 %~ IFIP-LNCS-8838 %~ IFIP-CISIM