TY - JOUR
T1 - ElStream
T2 - An Ensemble Learning Approach for Concept Drift Detection in Dynamic Social Big Data Stream Learning
AU - Abbasi, Ahmad
AU - Javed, Abdul Rehman
AU - Chakraborty, Chinmay
AU - Nebhen, Jamel
AU - Zehra, Wisha
AU - Jalil, Zunera
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2021
Y1 - 2021
N2 - With the rapid increase in communication technologies and smart devices, an enormous surge in data traffic has been observed. A huge amount of data gets generated every second by different applications, users, and devices. This rapid generation of data has created the need for solutions to analyze the change in data over time in unforeseen ways despite resource constraints. These unforeseeable changes in the underlying distribution of streaming data over time are identified as concept drifts. This paper presents a novel approach named ElStream that detects concept drift using ensemble and conventional machine learning techniques using both real and artificial data. ElStream utilizes the majority voting technique making only optimum classifier to vote for decision. Experiments were conducted to evaluate the performance of the proposed approach. According to experimental analysis, the ensemble learning approach provides a consistent performance for both artificial and real-world data sets. Experiments prove that the ElStream provides better accuracy of 12.49%, 11.98%, 10.06%, 1.2%, and 0.33% for PokerHand, LED, Random RBF, Electricity, and SEA dataset respectively, which is better as compared to previous state-of-the-art studies and conventional machine learning algorithms.
AB - With the rapid increase in communication technologies and smart devices, an enormous surge in data traffic has been observed. A huge amount of data gets generated every second by different applications, users, and devices. This rapid generation of data has created the need for solutions to analyze the change in data over time in unforeseen ways despite resource constraints. These unforeseeable changes in the underlying distribution of streaming data over time are identified as concept drifts. This paper presents a novel approach named ElStream that detects concept drift using ensemble and conventional machine learning techniques using both real and artificial data. ElStream utilizes the majority voting technique making only optimum classifier to vote for decision. Experiments were conducted to evaluate the performance of the proposed approach. According to experimental analysis, the ensemble learning approach provides a consistent performance for both artificial and real-world data sets. Experiments prove that the ElStream provides better accuracy of 12.49%, 11.98%, 10.06%, 1.2%, and 0.33% for PokerHand, LED, Random RBF, Electricity, and SEA dataset respectively, which is better as compared to previous state-of-the-art studies and conventional machine learning algorithms.
KW - big data
KW - ensemble learning
KW - Internet of Things
KW - online learning
KW - smart concept drift
KW - social data
UR - https://www.scopus.com/pages/publications/85105113305
U2 - 10.1109/ACCESS.2021.3076264
DO - 10.1109/ACCESS.2021.3076264
M3 - Article
AN - SCOPUS:85105113305
SN - 2169-3536
VL - 9
SP - 66408
EP - 66419
JO - IEEE Access
JF - IEEE Access
M1 - 9417164
ER -