Science Fair Projects

Malware Identification by Statistical Opcode Analysis

Abstract

The objective: This project determined the efficacy of statistical analysis of program assembly instruction (opcode) frequencies to identify Malware from Goodware.

Methods/Materials

Malware and Goodware binaries were obtained and a python script was created to extract opcode frequencies from specific parts of these files. Naive Bayes models and Kmeans based models were then trained using these executables. These models were tested using a different set of programs to determine their efficacy at identifying Malware from Goodware.

Results

The best Naive Bayes model had a recall of 1 for Malware and .8 for Goodware.

Conclusions/Discussion

Differences in opcode frequencies can differentiate Malware from Goodware. Certain instructions occur much more frequently in one group than in the other; these differences can be used to identify the two types of programs.

TThis project examines models that differentiate Malware from Goodware using the frequencies of program assembly instructions.

Science Fair Project done By Ryan P. Batterman

 

Related Projects : Quadrocopter Aerial Monocular Vision for Improved Autonomous Robot Navigation, Software Complexity Measurement, Innovating Credit Card Security through Smartphone Based Handshake Protocols, Fingerprinting, and Encryption, Developing a Computer Program That Effectively Mimics Human Creativity, Post-Disaster Response Using a Novel Adaptive Object Recognition Algorithm, Unbeatable PONG through Artificial Intelligence, Study and Analysis of Q-Learning Algorithm Parameters for Decision Making using a Developed Simulation Tool, Evolving Neural Networks to Play Mastermind, Programming for Optimal Error Reduction in Line-Following Applications, Artificial Intelligence

 

<<Back To Topics Page........................................................................................>> Next Topic

 

 

Copyright © sciencefairprojects.co.in 2013 through 2015