Implication of Different Data Split Ratio on the Performance of Model in Price Prediction of Used Vehicles Using Regression Analysis

Alimul Haque, Shams Raza, Sultan Ahmad, Alamgir Hossain, HIKMAT AWAD ABDELJABER, A. E.M. Eljialy, Sultan Alanazi, Jabeen Nazeer

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Introduction: artificial intelligence (AI) and Machine Learning have become buzzwords lately due to technological changes and data quality testing, especially in shape and finish analysis. Lots of research has been conducted for linear regression algorithms to predict the price in different sectors for share stock, rental properties, prices of used cars etc. This study provides suitable data split ratio for optimum cost estimation based on linear regression model. In present days there is an increasing demand for having own car for every middle-class family therefore this have given opportunity to motor vehicle business to offer wide range of used vehicle for re-sale especially companies like Maruti Suzuki, Tata motors & Mahendra motors in Indian motor vehicle industries. Therefore, it is important to know the current value of your car before spending your hard-earned money on any item. Objective: the objective of this paper is finding appropriate value of cars in Metropolitans or even in state capitals. Features like model, mileage, AC, seating capacities, fuel type automatic will be taken into account when doing this. This estimate is designed to help customers find the right options to suit their needs. Method: we have used a linear regression model to estimate the value of the respective car. Results: for doing this price prediction in this paper using liner regression we have tried to find the optimum accuracy of model by varying data split ratio for training and test data set and concluded with the result that 80/20 ratio is the best ratio with optimum model accuracy for business domain analysis with labelled data set. Conclusions: the findings underscore the importance of careful consideration when selecting a data split ratio for price prediction models in the used vehicle market. The insights gleaned from this study can inform future research and contribute to the development of more accurate and reliable regression models in similar domains.

Original languageEnglish
Article number425
JournalData and Metadata
Volume3
DOIs
StatePublished - 8 Feb 2024

Keywords

  • Artificial Intelligence
  • Commerce
  • Dataset
  • Linear Models
  • Machine Learning

Fingerprint

Dive into the research topics of 'Implication of Different Data Split Ratio on the Performance of Model in Price Prediction of Used Vehicles Using Regression Analysis'. Together they form a unique fingerprint.

Cite this