Effect of outliers on the coefficient of determination in multiple regression analysis with the application on the GPA for student

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

This study aims to solve the problem of contradiction between the statistical significance and real significance of regression parameters when using multiple linear regression analysis. In this regard, an algorithm was presented based on the simple and multiple of determination coefficient, and the sum of averages to estimate multiple outliers when outliers are real. Regression analysis was applied to a phenomenon, whose results are known in advance (The relationship between Semester average and Cumulative average). The results were misleading, and we cannot firmly stand on analysis results. Also, the regression model did not improve much when an increased sample size more than doubled, so the study presents an algorithm for finding a solution to this contradiction. After checking Ordinary Least Squares (OLS) assumptions, outliers were identified, based on Cook's distance because it was the best. The proposed algorithm was compared with some robust regression methods, [Weighted Least Squares, Fully Modified Least Squares, and Least Median of Squares]. The results proved that the proposed method is a robust solution for outliers’ estimation. Therefore, it is recommended to use the proposed algorithm to estimate multiple outliers on other similar phenomena (e.g., The algorithm can be applied to a credit card transaction control system in a bank), and also software Packages statistical for the proposed algorithm. Also, the novelty of this study can be observed by investigating testing the significance of outliers as most of the previous researchers were interested in diagnosing the outliers without checking its significance.

Original languageEnglish
Pages (from-to)30-37
Number of pages8
JournalInternational Journal of Advanced and Applied Sciences
Volume7
Issue number10
DOIs
StatePublished - Oct 2020

Keywords

  • Cumulative average
  • Determination coefficient
  • Outliers
  • Regression

Fingerprint

Dive into the research topics of 'Effect of outliers on the coefficient of determination in multiple regression analysis with the application on the GPA for student'. Together they form a unique fingerprint.

Cite this