Research Article

An Efficient Convolutional Neural Network with Transfer Learning for Malware Classification

Table 1

Summary of the literature.

S. no.SchemeSummary

1[17](i) Classify types of malwares in three ways
 (1) Image-based
 (2) Dynamic
 (3) Static
2[18](i) Shown many deep convolutional neural networks (DCNs) potential in image processing
3[15, 19](i) Used the raw malware executable files to extract grayscale representations of malicious characteristics
(ii) By extracting visual elements from such photos, malware can be analyzed.
4[20](i) Worked with a malware picture dataset that included 9,342 malware samples from 25 distinct types.
(ii) They were the first to examine the use of byte graphs as grayscale pictures for automated malware categorization.
5[21](i) Utilized the approach of [20] to extract GIST characteristics from grayscale pictures and categorize them using the Euclidean distance as a metric.
(ii) However, their method has a considerable computational cost.
6[22](i) Built an effective texture-based feature vector from the malware images using the wavelet transform.
(ii) Conducted malware classification using a multiclass support vector machine with malware input as images.
7[24](i) Performed a simple CNN detected the variant of codes by turning them into grayscale images. Kalash et al. [25] classified malware by using two datasets, Malimg [15] and Microsoft [26].
(ii) Converted the malware binaries into malware images, their approach achieved high accuracy of 98.52% and 99.97%.
8[27](i) Used two datasets, Malimg dataset and BigData gathering to build CNN model with four layers.
9[28](i) Built a model by using deep transfer learning to classify two datasets, ImageNet [16] and Malimg [15].
(ii) Demonstrated high accuracy of 99.18%.
10[29](i) Used two different approaches of feature extractions to classify Windows API Calls database.
(ii) Shows that they depend on inverse document frequency vector and categorical vector.
(iii) Proposed method score high accuracy above 90.0%.
11[30](i) Presented one-dimensional CNNs to detect and classify malware families by using two datasets, Malimg [15] and Microsoft [26].