Gender identification using marginalised stacked denoising autoencoders on twitter data

Badriyya B. Al-Onazi, Mohamed K. Nour, Hassan Alshamrani, Mesfer Al Duhayyim, Heba Mohsen, Amgad Atta Abdelmageed, GOUSE PASHA MOHAMMED, ABU SARWAR ZAMANI

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Gender analysis of Twitter could reveal significant socio-cultural differences between female and male users. Efforts had been made to analyze and automatically infer gender formerly for more commonly spoken languages’ content, but, as we now know that limited work is being undertaken for Arabic. Most of the research works are done mainly for English and least amount of effort for non-English language. The study for Arabic demographic inference like gender is relatively uncommon for social networking users, especially for Twitter. Therefore, this study aims to design an optimal marginalized stacked denoising autoencoder for gender identification on Arabic Twitter (OMSDAE-GIAT) model. The presented OMSDAE-GIAR technique mainly concentrates on the identification and classification of gender exist in the Twitter data. To attain this, the OMSDAE- GIAT model derives initial stages of data pre-processing and word embedding. Next, the MSDAE model is exploited for the identification of gender into two classes namely male and female. In the final stage, the OMSDAE-GIAT technique uses enhanced bat optimization algorithm (EBOA) for parameter tuning process, showing the novelty of our work. The performance validation of the OMSDAE-GIAT model is inspected against an Arabic corpus dataset and the results are measured under distinct metrics. The comparison study reported the enhanced performance of the OMSDAE-GIAT model over other recent approaches.

Original languageEnglish
Pages (from-to)2529-2544
Number of pages16
JournalIntelligent Automation and Soft Computing
Volume36
Issue number3
DOIs
StatePublished - 2023

Keywords

  • Arabic corpus
  • Arabic twitter
  • Bat algorithm
  • Gender identification
  • Hybrid deep learning
  • Social media

Fingerprint

Dive into the research topics of 'Gender identification using marginalised stacked denoising autoencoders on twitter data'. Together they form a unique fingerprint.

Cite this