TY - JOUR
T1 - RoBERTa, ResNeXt and BiLSTM with self-attention
T2 - The ultimate trio for customer sentiment analysis
AU - Lak, Amir Jabbary
AU - Boostani, Reza
AU - Alenizi, Farhan A.
AU - Mohammed, Amin Salih
AU - Fakhrahmad, Seyed Mostafa
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/10
Y1 - 2024/10
N2 - Sentiment analysis of customer feedback is a pivotal component of Natural Language Processing (NLP), enabling businesses to gauge consumer emotions towards their products and services. The inherent variability of language introduces substantial challenges in the analysis of unstructured customer data. Sentiment analysis techniques generally fall into two categories: traditional methods, which involve the extraction of hand-crafted features and the application of machine learning algorithms, and end-to-end deep learning approaches that process raw data, transforming them layer by layer into high-level textual features. Each method presents its own balance between simplicity and speed versus analytical power and flexibility, and they all grapple with the need for extensive labeled data and computational resources. To address these challenges, this paper presents an innovative hybrid model that amalgamates RoBERTa, ResNeXt, BiLSTM, and self-attention mechanisms. This integration capitalizes on the collective strengths of these models to surmount the limitations inherent in each. The proposed model proves to be a powerful and efficient tool for sentiment analysis, demonstrating proficiency across various data types and tasks. We undertake a comprehensive evaluation of the proposed model's accuracy and training efficiency using four benchmark datasets. Our investigation also explores the impact of different similarity measures and convolutional neural network architectures on the model's performance. The results affirm that our model not only achieves high accuracy but also significantly reduces training time compared to RoBERTa's fine-tuning. Furthermore, the model exhibits exceptional domain adaptability, particularly when fine-tuned on the IMDb dataset following initial training on the Yelp Review Full dataset. The practicality of the proposed model is underscored by its reduced computational complexity and its adeptness at navigating semantic and syntactic nuances.
AB - Sentiment analysis of customer feedback is a pivotal component of Natural Language Processing (NLP), enabling businesses to gauge consumer emotions towards their products and services. The inherent variability of language introduces substantial challenges in the analysis of unstructured customer data. Sentiment analysis techniques generally fall into two categories: traditional methods, which involve the extraction of hand-crafted features and the application of machine learning algorithms, and end-to-end deep learning approaches that process raw data, transforming them layer by layer into high-level textual features. Each method presents its own balance between simplicity and speed versus analytical power and flexibility, and they all grapple with the need for extensive labeled data and computational resources. To address these challenges, this paper presents an innovative hybrid model that amalgamates RoBERTa, ResNeXt, BiLSTM, and self-attention mechanisms. This integration capitalizes on the collective strengths of these models to surmount the limitations inherent in each. The proposed model proves to be a powerful and efficient tool for sentiment analysis, demonstrating proficiency across various data types and tasks. We undertake a comprehensive evaluation of the proposed model's accuracy and training efficiency using four benchmark datasets. Our investigation also explores the impact of different similarity measures and convolutional neural network architectures on the model's performance. The results affirm that our model not only achieves high accuracy but also significantly reduces training time compared to RoBERTa's fine-tuning. Furthermore, the model exhibits exceptional domain adaptability, particularly when fine-tuned on the IMDb dataset following initial training on the Yelp Review Full dataset. The practicality of the proposed model is underscored by its reduced computational complexity and its adeptness at navigating semantic and syntactic nuances.
KW - BiLSTM
KW - Customer sentiment analysis
KW - ResNeXt
KW - RoBERTa
KW - Self-attention mechanism
UR - http://www.scopus.com/inward/record.url?scp=85199470010&partnerID=8YFLogxK
U2 - 10.1016/j.asoc.2024.112018
DO - 10.1016/j.asoc.2024.112018
M3 - Article
AN - SCOPUS:85199470010
SN - 1568-4946
VL - 164
JO - Applied Soft Computing
JF - Applied Soft Computing
M1 - 112018
ER -