Abstract-Malicious software, also known as malware, is a huge problem that costs consumers billions of dollars each year. To solve this problem, a significant amount of research has been dedicated towards detecting malware. In this paper, we introduce a genetic and evolutionary feature selection technique for the identification of HTML code associated with malware. We believe that there may be an association between malware and the HTML code that it is embedded in. Our results show that this technique outperforms previous techniques in terms of recognition accuracy as well as the total number of features needed for recognition.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.