Exposing the Limitations of Machine Learning for Malware Detection Under Concept Drift

Ahmed Abusnaina, Afsah Anwar, Muhammad Saad, Abdulrahman Alabduljabbar, Rhongho Jang, Saeed Salem, David Mohaisen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The arms race between malware authors and defenders is characterized by (1) mutations to the malware samples and (2) model retraining to detect those mutations. Due to an exponential growth in the number of new malware samples reported per day (1.5 million [4]), detection frameworks’ reliance on model retraining naturally increased. Model retraining is the de facto approach to counter malware mutations. In this paper, we question the efficacy of machine learning in the context of malware detection by exposing various limitations in the retraining approaches. We show that model retraining only provides a marginal performance improvement for malicious sample detection while simultaneously degrading the benign sample detection performance. To address various issues in malware detection, we investigate the efficiency of several model retraining approaches. Our proposed approaches allow the malware detectors to retrain models in time to enable malware family emergence detection while concurrently monitoring the evolving patterns of malware family mutations.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering – WISE 2024 - 25th International Conference, Proceedings
EditorsMahmoud Barhamgi, Hua Wang, Xin Wang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages273-289
Number of pages17
ISBN (Print)9789819605668
DOIs
StatePublished - 2025
Externally publishedYes
Event25th International Conference on Web Information Systems Engineering, WISE 2024 - Doha, Qatar
Duration: 2 Dec 20245 Dec 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15437 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference25th International Conference on Web Information Systems Engineering, WISE 2024
Country/TerritoryQatar
CityDoha
Period2/12/245/12/24

Keywords

  • Adversarial Machine Learning
  • Robust Malware Detection

Fingerprint

Dive into the research topics of 'Exposing the Limitations of Machine Learning for Malware Detection Under Concept Drift'. Together they form a unique fingerprint.

Cite this