ARTC: feature selection using association rules for text classification

Mozamel M. Saeed, Zaher Al Aghbari

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

Feature vectors are extracted to represent objects in many classification tasks, such as text classification. Due to the high dimensionality of these raw feature vectors, the classification efficiency and accuracy are reduced. Therefore, reducing the size of feature vectors by selecting the relevant features that better represent the objects is an important aspect in text classification. Feature selection not only reduces the dimensionality of the feature vectors, but also produces more efficient classification models with higher predictive power. In this paper, we propose ARTC, which is an effective feature selection method that is based on the extraction of association rules to classify text documents. The extracted association rules discover the hidden relationships and correlations between the relevant words within the textual documents of a class and a cross different classes. Consequently, each class of documents is represented by a small set of contrasting features that are more effective in text classification. Our experiments show that ARTC outperforms other relevant techniques in terms of classification performance and efficiency.

Original languageEnglish
Pages (from-to)22519-22529
Number of pages11
JournalNeural Computing and Applications
Volume34
Issue number24
DOIs
StatePublished - Dec 2022

Keywords

  • Association rules
  • Contrasting feature set
  • Feature selection
  • Text binary vector
  • Text classification

Fingerprint

Dive into the research topics of 'ARTC: feature selection using association rules for text classification'. Together they form a unique fingerprint.

Cite this