Type of Publication: Journal Articles
Authors: Asaf Shabtai, Robert Moskovitch, Yuval Elovici, Chanan Glezer,
Title: Detection of Malicious Code by Applying Machine Learning Classifiers on Static Features – a State-of-the-Art Survey
Name of the Journal: Information Security Technical Report
Year: 2009
Volume: 14
Issue: 1
Pages: 16-29
Abstract: This research synthesizes a taxonomy for classifying detection methods of new malicious code by Machine Learning (ML) methods based on static features extracted from executables. The taxonomy is then operationalized to classify research on this topic and pinpoint critical open research issues in light of emerging threats. The article addresses various facets of the detection challenge, including: file representation and feature selection methods, classification algorithms, weighting ensembles, as well as the imbalance problem, active learning, and chronological evaluation. From the survey we conclude that a framework for detecting new malicious code in executable files can be designed to achieve very high accuracy while maintaining low false positives (i.e. misclassifying benign files as malicious). The framework should include training of multiple classifiers on various types of features (mainly OpCode and byte n-grams and Portable Executable Features), applying weighting algorithm on the classification results of the individual classifiers, as well as an active learning mechanism to maintain high detection accuracy. The training of classifiers should also consider the imbalance problem by generating classifiers that will perform accurately in a real-life situation where the percentage of malicious files among all files is estimated to be approximately 10%.
Last Updated: 1/10/2010 1:02:21 PM
Powered by Rami Palombo © 2005
Search in: Google Scholar  |  Scitation