A hybrid approach for adversarial attack detection based on sentiment analysis model using Machine learning

Rashid Amin; Rahma Gantassi; Naeem Ahmed; Asma Hassan Alshehri; Faisal S. Alsubaei; Jaroslav Frnda

doi:10.1016/j.jestch.2024.101829

A hybrid approach for adversarial attack detection based on sentiment analysis model using Machine learning

Rashid Amin
, Rahma Gantassi
, Naeem Ahmed
, Asma Hassan Alshehri
, Faisal S. Alsubaei
, Jaroslav Frnda

Computer Sciences

Research output: Contribution to journal › Article › peer-review

12 Scopus citations

Abstract

One of the main subfields of Machine Learning (ML) that deals with human language for intelligent applications is Natural Language Processing (NLP). One of the biggest problems NLP models encounter is adversarial assaults, which lead to inaccurate predictions. To increase an NLP model's resilience, adversarial text must be used to examine assaults and defenses. several strategies for detecting adversarial attacks have been put forth; nonetheless, they face several obstacles, such as low attack success rates on particular datasets. Some other attack methods can already be effectively defended against by existing defensive strategies. As a result, such attackers are unable to delve further into the limitations of NLP models to guide future advancements in defense. Consequently, it is required to develop an adversarial attack strategy with a larger attack duration and better performance. Firstly, we train the Convolutional Neural Network (CNN) using the IMDB dataset, which consists of labeled movie reviews that represent positive and negative sentiments on movie reviews. The CNN model performs the sentiment classification of data. Subsequently, adversarial examples are generated from the IMDB dataset utilizing the Fast Gradient Sign Method (FGSM), a well-liked and effective method in the adversarial machine learning domain. After that, a Long Short-Term Memory (LSTM) model is developed utilizing the FGSM-generated hostile cases to identify adversarial attempts on sentiment analysis systems. The LSTM model was trained using a combination of original IMDB data and adversarial cases generated using the FGSM technique. The models are tested on various standard metrics including Accuracy, precision, F1-score, etc., and it achieve about 95.6% accuracy in detecting adversarial attacks.

Original language	English
Article number	101829
Journal	Engineering Science and Technology, an International Journal
Volume	58
DOIs	https://doi.org/10.1016/j.jestch.2024.101829
State	Published - Oct 2024

Keywords

Adversarial Attack
CNN
FGSM
LSTM
Natural language Processing

Access to Document

10.1016/j.jestch.2024.101829

Cite this

@article{e223f636ad9f4f83ab229c14779ca627,

title = "A hybrid approach for adversarial attack detection based on sentiment analysis model using Machine learning",

abstract = "One of the main subfields of Machine Learning (ML) that deals with human language for intelligent applications is Natural Language Processing (NLP). One of the biggest problems NLP models encounter is adversarial assaults, which lead to inaccurate predictions. To increase an NLP model's resilience, adversarial text must be used to examine assaults and defenses. several strategies for detecting adversarial attacks have been put forth; nonetheless, they face several obstacles, such as low attack success rates on particular datasets. Some other attack methods can already be effectively defended against by existing defensive strategies. As a result, such attackers are unable to delve further into the limitations of NLP models to guide future advancements in defense. Consequently, it is required to develop an adversarial attack strategy with a larger attack duration and better performance. Firstly, we train the Convolutional Neural Network (CNN) using the IMDB dataset, which consists of labeled movie reviews that represent positive and negative sentiments on movie reviews. The CNN model performs the sentiment classification of data. Subsequently, adversarial examples are generated from the IMDB dataset utilizing the Fast Gradient Sign Method (FGSM), a well-liked and effective method in the adversarial machine learning domain. After that, a Long Short-Term Memory (LSTM) model is developed utilizing the FGSM-generated hostile cases to identify adversarial attempts on sentiment analysis systems. The LSTM model was trained using a combination of original IMDB data and adversarial cases generated using the FGSM technique. The models are tested on various standard metrics including Accuracy, precision, F1-score, etc., and it achieve about 95.6\% accuracy in detecting adversarial attacks.",

keywords = "Adversarial Attack, CNN, FGSM, LSTM, Natural language Processing",

author = "Rashid Amin and Rahma Gantassi and Naeem Ahmed and \{Hassan Alshehri\}, Asma and Alsubaei, \{Faisal S.\} and Jaroslav Frnda",

note = "Publisher Copyright: {\textcopyright} 2024 THE AUTHORS",

year = "2024",

month = oct,

doi = "10.1016/j.jestch.2024.101829",

language = "English",

volume = "58",

journal = "Engineering Science and Technology, an International Journal",

issn = "2215-0986",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - A hybrid approach for adversarial attack detection based on sentiment analysis model using Machine learning

AU - Amin, Rashid

AU - Gantassi, Rahma

AU - Ahmed, Naeem

AU - Hassan Alshehri, Asma

AU - Alsubaei, Faisal S.

AU - Frnda, Jaroslav

PY - 2024/10

Y1 - 2024/10

N2 - One of the main subfields of Machine Learning (ML) that deals with human language for intelligent applications is Natural Language Processing (NLP). One of the biggest problems NLP models encounter is adversarial assaults, which lead to inaccurate predictions. To increase an NLP model's resilience, adversarial text must be used to examine assaults and defenses. several strategies for detecting adversarial attacks have been put forth; nonetheless, they face several obstacles, such as low attack success rates on particular datasets. Some other attack methods can already be effectively defended against by existing defensive strategies. As a result, such attackers are unable to delve further into the limitations of NLP models to guide future advancements in defense. Consequently, it is required to develop an adversarial attack strategy with a larger attack duration and better performance. Firstly, we train the Convolutional Neural Network (CNN) using the IMDB dataset, which consists of labeled movie reviews that represent positive and negative sentiments on movie reviews. The CNN model performs the sentiment classification of data. Subsequently, adversarial examples are generated from the IMDB dataset utilizing the Fast Gradient Sign Method (FGSM), a well-liked and effective method in the adversarial machine learning domain. After that, a Long Short-Term Memory (LSTM) model is developed utilizing the FGSM-generated hostile cases to identify adversarial attempts on sentiment analysis systems. The LSTM model was trained using a combination of original IMDB data and adversarial cases generated using the FGSM technique. The models are tested on various standard metrics including Accuracy, precision, F1-score, etc., and it achieve about 95.6% accuracy in detecting adversarial attacks.

AB - One of the main subfields of Machine Learning (ML) that deals with human language for intelligent applications is Natural Language Processing (NLP). One of the biggest problems NLP models encounter is adversarial assaults, which lead to inaccurate predictions. To increase an NLP model's resilience, adversarial text must be used to examine assaults and defenses. several strategies for detecting adversarial attacks have been put forth; nonetheless, they face several obstacles, such as low attack success rates on particular datasets. Some other attack methods can already be effectively defended against by existing defensive strategies. As a result, such attackers are unable to delve further into the limitations of NLP models to guide future advancements in defense. Consequently, it is required to develop an adversarial attack strategy with a larger attack duration and better performance. Firstly, we train the Convolutional Neural Network (CNN) using the IMDB dataset, which consists of labeled movie reviews that represent positive and negative sentiments on movie reviews. The CNN model performs the sentiment classification of data. Subsequently, adversarial examples are generated from the IMDB dataset utilizing the Fast Gradient Sign Method (FGSM), a well-liked and effective method in the adversarial machine learning domain. After that, a Long Short-Term Memory (LSTM) model is developed utilizing the FGSM-generated hostile cases to identify adversarial attempts on sentiment analysis systems. The LSTM model was trained using a combination of original IMDB data and adversarial cases generated using the FGSM technique. The models are tested on various standard metrics including Accuracy, precision, F1-score, etc., and it achieve about 95.6% accuracy in detecting adversarial attacks.

KW - Adversarial Attack

KW - CNN

KW - FGSM

KW - LSTM

KW - Natural language Processing

UR - https://www.scopus.com/pages/publications/85204483722

U2 - 10.1016/j.jestch.2024.101829

DO - 10.1016/j.jestch.2024.101829

M3 - Article

AN - SCOPUS:85204483722

SN - 2215-0986

VL - 58

JO - Engineering Science and Technology, an International Journal

JF - Engineering Science and Technology, an International Journal

M1 - 101829

ER -

A hybrid approach for adversarial attack detection based on sentiment analysis model using Machine learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this