TY - JOUR
T1 - Optimal Deep Hybrid Boltzmann Machine Based Arabic Corpus Classification Model
AU - Al Duhayyim, Mesfer
AU - Al-Onazi, Badriyya B.
AU - Nour, Mohamed K.
AU - Yafoz, Ayman
AU - Mehanna, Amal S.
AU - ISHFAQ YASEEN YASEEN, null
AU - Abdelmageed, Amgad Atta
AU - GOUSE PASHA MOHAMMED, null
N1 - Publisher Copyright:
© 2023 CRL Publishing. All rights reserved.
PY - 2023
Y1 - 2023
N2 - Natural Language Processing (NLP) for the Arabic language has gained much significance in recent years. The most commonly-utilized NLP task is the 'Text Classification' process. Its main intention is to apply the Machine Learning (ML) approaches for automatically classifying the textual files into one or more pre-defined categories. In ML approaches, the first and foremost crucial step is identifying an appropriate large dataset to test and train the method. One of the trending ML techniques, i.e., Deep Learning (DL) technique needs huge volumes of different types of datasets for training to yield the best outcomes. The current study designs a new Dice Optimization with a Deep Hybrid Boltzmann Machinebased Arabic Corpus Classification (DODHBM-ACC) model in this background. The presented DODHBM-ACC model primarily relies upon different stages of pre-processing and the word2vec word embedding process. For Arabic text classification, the DHBM technique is utilized. This technique is a hybrid version of the Deep Boltzmann Machine (DBM) and Deep Belief Network (DBN). It has the advantage of learning the decisive intention of the classification process. To adjust the hyperparameters of the DHBM technique, the Dice Optimization Algorithm (DOA) is exploited in this study. The experimental analysis was conducted to establish the superior performance of the proposed DODHBM-ACC model. The outcomes inferred the better performance of the proposed DODHBM-ACC model over other recent approaches.
AB - Natural Language Processing (NLP) for the Arabic language has gained much significance in recent years. The most commonly-utilized NLP task is the 'Text Classification' process. Its main intention is to apply the Machine Learning (ML) approaches for automatically classifying the textual files into one or more pre-defined categories. In ML approaches, the first and foremost crucial step is identifying an appropriate large dataset to test and train the method. One of the trending ML techniques, i.e., Deep Learning (DL) technique needs huge volumes of different types of datasets for training to yield the best outcomes. The current study designs a new Dice Optimization with a Deep Hybrid Boltzmann Machinebased Arabic Corpus Classification (DODHBM-ACC) model in this background. The presented DODHBM-ACC model primarily relies upon different stages of pre-processing and the word2vec word embedding process. For Arabic text classification, the DHBM technique is utilized. This technique is a hybrid version of the Deep Boltzmann Machine (DBM) and Deep Belief Network (DBN). It has the advantage of learning the decisive intention of the classification process. To adjust the hyperparameters of the DHBM technique, the Dice Optimization Algorithm (DOA) is exploited in this study. The experimental analysis was conducted to establish the superior performance of the proposed DODHBM-ACC model. The outcomes inferred the better performance of the proposed DODHBM-ACC model over other recent approaches.
KW - Arabic corpus
KW - deep learning
KW - dice optimization
KW - machine learning
KW - text classification
UR - http://www.scopus.com/inward/record.url?scp=85158854885&partnerID=8YFLogxK
U2 - 10.32604/csse.2023.034609
DO - 10.32604/csse.2023.034609
M3 - Article
AN - SCOPUS:85158854885
SN - 0267-6192
VL - 46
SP - 2755
EP - 2772
JO - Computer Systems Science and Engineering
JF - Computer Systems Science and Engineering
IS - 3
ER -