Academy & Industry Research Collaboration Center (AIRCC)

Volume 11, Number 04, March 2021

An Experience in Enhancing Machine Learning Classifier Against Low-Entropy Packed Malwares

  Authors

Shang-Wen Chen, Tzu-Hsien Chuang, Chin-Wei Tien and Chih-Wei Chen, Institute for Information Industry, Taiwan

  Abstract

Both benign applications and malwares would take packing for their different purposes to conceal the real part of the program processes. According to recent research reports, existing machine learning (ML) approach-based malware detection engines are difficult to effectively classify the packed malwares, especially when they are in low entropy packed.

Recently, we counted and found that the ratio of low-entropy packed ransomware is extremely high. This would cause a high error rate of the result on currently used ML approaches. Thus, we propose a new method to extract entropy-related features and use a stack model to build up an ML malware engine to effectively detect low-entropy packed malwares. We evaluate our method by using over 15,000 malware samples collected from VirusTotal and compare the result to related researches. This experience reports our adopted model and features can significantly lower the error rate of low-entropy packed detection from 11% to 1%.

  Keywords

Malware detection, low-entropy packing, machine learning classification.