A new generalized rayleigh distribution with analysis to big data of an online community

Zhongjie Shen, Amani Alrumayh, Zubair Ahmad, Reman Abu-Shanab, Maha Al - Mutairi, Ramy Aldallal

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

Big data is a collection of complex and large volumes of data that is not easily handled by the traditional process. The top ten big data science communities include Kaggle, IBM data community, Reddit, open data science, data science central, data community DC, stack overflow, data quest, the data science society, and driven data. Among the online communities, Reddit is a promising online platform that connects millions of people to each other. It is a fruitful online platform that offers business firms to reach the maximum audience. In this paper, we introduce a new extended form of the generalized Rayleigh distribution to model the Reddit advertising and breast cancer data sets. The proposed model is called a new generalized Rayleigh distribution and possesses heavy-tailed properties. The maximum likelihood estimators along with certain mathematical properties are obtained. Finally, the new generalized Rayleigh distribution is applied to the Reddit advertising and breast cancer data sets and its comparisons are done with the other generalized forms of the Rayleigh distribution.

Original languageEnglish
Pages (from-to)11523-11535
Number of pages13
JournalAlexandria Engineering Journal
Volume61
Issue number12
DOIs
StatePublished - Dec 2022

Keywords

  • Big data
  • Breast cancer data
  • Online communities
  • Rayleigh distribution
  • Reddit data
  • Statistical modeling

Fingerprint

Dive into the research topics of 'A new generalized rayleigh distribution with analysis to big data of an online community'. Together they form a unique fingerprint.

Cite this