TY - JOUR
T1 - A new generalized rayleigh distribution with analysis to big data of an online community
AU - Shen, Zhongjie
AU - Alrumayh, Amani
AU - Ahmad, Zubair
AU - Abu-Shanab, Reman
AU - Al - Mutairi, Maha
AU - Aldallal, Ramy
N1 - Publisher Copyright:
© 2022 THE AUTHORS
PY - 2022/12
Y1 - 2022/12
N2 - Big data is a collection of complex and large volumes of data that is not easily handled by the traditional process. The top ten big data science communities include Kaggle, IBM data community, Reddit, open data science, data science central, data community DC, stack overflow, data quest, the data science society, and driven data. Among the online communities, Reddit is a promising online platform that connects millions of people to each other. It is a fruitful online platform that offers business firms to reach the maximum audience. In this paper, we introduce a new extended form of the generalized Rayleigh distribution to model the Reddit advertising and breast cancer data sets. The proposed model is called a new generalized Rayleigh distribution and possesses heavy-tailed properties. The maximum likelihood estimators along with certain mathematical properties are obtained. Finally, the new generalized Rayleigh distribution is applied to the Reddit advertising and breast cancer data sets and its comparisons are done with the other generalized forms of the Rayleigh distribution.
AB - Big data is a collection of complex and large volumes of data that is not easily handled by the traditional process. The top ten big data science communities include Kaggle, IBM data community, Reddit, open data science, data science central, data community DC, stack overflow, data quest, the data science society, and driven data. Among the online communities, Reddit is a promising online platform that connects millions of people to each other. It is a fruitful online platform that offers business firms to reach the maximum audience. In this paper, we introduce a new extended form of the generalized Rayleigh distribution to model the Reddit advertising and breast cancer data sets. The proposed model is called a new generalized Rayleigh distribution and possesses heavy-tailed properties. The maximum likelihood estimators along with certain mathematical properties are obtained. Finally, the new generalized Rayleigh distribution is applied to the Reddit advertising and breast cancer data sets and its comparisons are done with the other generalized forms of the Rayleigh distribution.
KW - Big data
KW - Breast cancer data
KW - Online communities
KW - Rayleigh distribution
KW - Reddit data
KW - Statistical modeling
UR - http://www.scopus.com/inward/record.url?scp=85136300872&partnerID=8YFLogxK
U2 - 10.1016/j.aej.2022.05.010
DO - 10.1016/j.aej.2022.05.010
M3 - Article
AN - SCOPUS:85136300872
SN - 1110-0168
VL - 61
SP - 11523
EP - 11535
JO - Alexandria Engineering Journal
JF - Alexandria Engineering Journal
IS - 12
ER -