TY - JOUR
T1 - A Novel Price Prediction Service for E-Commerce Categorical Data
AU - Fathalla, Ahmed
AU - Salah, Ahmad
AU - Ali, Ahmed
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/4
Y1 - 2023/4
N2 - Most e-commerce data include items that belong to different categories, e.g., product types on Amazon and eBay. The accurate prediction of an item’s price on an e-commerce platform will facilitate the maximization of economic benefits for the seller and buyer. Consequently, the task of price prediction of e-commerce items can be seen as a multiple regression on categorical data. Performing multiple regression tasks with categorical independent variables is tricky since the observations of each product type might have different distribution shapes, whereas the distribution shape of all the data might not be representative of each group. In this vein, we propose a service for facilitating the price prediction task of e-commerce categorical products. The main novelty of the proposed service relies on two unique data transformations aiming at increasing the between-group variance and decreasing the within-group variance to improve the task of regression analysis on categorical data. The proposed data transformations are tested on four different e-commerce datasets over a set of linear, non-linear, and neural network-based regression models. Comparing the best existing regression models without applying the proposed transformation, the proposed transformation results show improvements in the range of 1.98% to 8.91% for the four evaluation metrics scores, namely, (Formula presented.), MAE, RMSE, and MAPE. However, the best metrics improvement on each dataset has average values of 16.8%, 8.0%, 6.0%, and 25.0% for (Formula presented.), MAE, RMSE, and MAPE, respectively.
AB - Most e-commerce data include items that belong to different categories, e.g., product types on Amazon and eBay. The accurate prediction of an item’s price on an e-commerce platform will facilitate the maximization of economic benefits for the seller and buyer. Consequently, the task of price prediction of e-commerce items can be seen as a multiple regression on categorical data. Performing multiple regression tasks with categorical independent variables is tricky since the observations of each product type might have different distribution shapes, whereas the distribution shape of all the data might not be representative of each group. In this vein, we propose a service for facilitating the price prediction task of e-commerce categorical products. The main novelty of the proposed service relies on two unique data transformations aiming at increasing the between-group variance and decreasing the within-group variance to improve the task of regression analysis on categorical data. The proposed data transformations are tested on four different e-commerce datasets over a set of linear, non-linear, and neural network-based regression models. Comparing the best existing regression models without applying the proposed transformation, the proposed transformation results show improvements in the range of 1.98% to 8.91% for the four evaluation metrics scores, namely, (Formula presented.), MAE, RMSE, and MAPE. However, the best metrics improvement on each dataset has average values of 16.8%, 8.0%, 6.0%, and 25.0% for (Formula presented.), MAE, RMSE, and MAPE, respectively.
KW - E-commerce
KW - between-group variance
KW - categorical data
KW - data transformation
KW - price prediction
KW - within-group variance
UR - https://www.scopus.com/pages/publications/85153763461
U2 - 10.3390/math11081938
DO - 10.3390/math11081938
M3 - Article
AN - SCOPUS:85153763461
SN - 2227-7390
VL - 11
JO - Mathematics
JF - Mathematics
IS - 8
M1 - 1938
ER -