Author profiling from Romanized Urdu text using transfer learning models

Abid Ali, Muhammad Sohail khan, Muhammad Amin Khan, Sajid Ullah Khan, Faheem Khan

Research output: Contribution to journalArticlepeer-review

Abstract

This research concentrates on author profiling using transfer learning models for classifying age and gender. The investigation encompassed a diverse set of transfer learning techniques, including Roberta, BERT, ALBERT, Distil BERT, Distil Roberta, ELECTRA, and XLNet. Through meticulous evaluation using metrics such as the Matthews Correlation Coefficient, Accuracy, Precision, Recall, and F1 Score, the study examined the efficacy of these models. The curated dataset was divided for gender and age tasks, resulting in robust gender prediction with the XLNet model and age prediction with the BERT model. Notably, the XLNet model achieved the highest MCC (0.7946), Accuracy (0.8957), Precision (0.8992), Recall (0.8957), and F1 Score (0.8958) values in gender classification, while the BERT model excelled in age prediction with an MCC of (0.7338), Accuracy of (0.8220), Precision of (0.8324), Recall of (0.8220), and F1 Score of (0.8243). Visualized outcomes provide valuable insights into the model’s performance nuances, paving the way for their practical implementation. This research offers novel contributions to author profiling tasks, bridging the gap between theory and real-world applications.

Original languageEnglish
Pages (from-to)4455-4470
Number of pages16
JournalNeural Computing and Applications
Volume37
Issue number6
DOIs
StatePublished - Feb 2025

Keywords

  • Author profiling
  • BERT
  • Gender and age classification & Roman Urdu
  • Transfer learning
  • XLNet

Fingerprint

Dive into the research topics of 'Author profiling from Romanized Urdu text using transfer learning models'. Together they form a unique fingerprint.

Cite this