TY - JOUR
T1 - Prediction of flood severity level via processing IoT sensor data using a data science approach
AU - Khan, Wasiq
AU - Hussain, Abir Jaafar
AU - Alaskar, Haya
AU - Baker, Thar
AU - Ghali, Fawaz
AU - A-Jumeily, Dhiya
AU - Al-Shamma'a, Ahmed
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2020/12
Y1 - 2020/12
N2 - River flooding is deemed a catastrophic phenomenon caused by extreme climate change and other ecological factors (e.g., amount of sunlight), which are difficult to predict and monitor. However, the use of the Internet of Things (IoT), various types of sensing including social sensing, 5G wireless communication, and big data analysis has devised advanced tools for early prediction and management of distrust events. To this end, this article amalgamates machine learning models and data analytics approaches along-with IoT sensor data to investigate attribute importance for the prediction of risk levels in flooding. The article presents three river levels: normal, medium, and high-risk river levels for machine learning models. Performance is evaluated with varying configurations and evaluations set up including training and testing of support vector machine and random forest using a principal-components-analysis-based dimension reduced dataset. In addition, we investigate the use of synthetic minority oversampling technique to balance the class representations within a dataset. As expected, the results indicate that a 'balanced' representation of data samples achieved high accuracy (nearly 93 percent) when benchmarked with 'imbalanced' data samples using random forest classifier 10-fold cross-validations.
AB - River flooding is deemed a catastrophic phenomenon caused by extreme climate change and other ecological factors (e.g., amount of sunlight), which are difficult to predict and monitor. However, the use of the Internet of Things (IoT), various types of sensing including social sensing, 5G wireless communication, and big data analysis has devised advanced tools for early prediction and management of distrust events. To this end, this article amalgamates machine learning models and data analytics approaches along-with IoT sensor data to investigate attribute importance for the prediction of risk levels in flooding. The article presents three river levels: normal, medium, and high-risk river levels for machine learning models. Performance is evaluated with varying configurations and evaluations set up including training and testing of support vector machine and random forest using a principal-components-analysis-based dimension reduced dataset. In addition, we investigate the use of synthetic minority oversampling technique to balance the class representations within a dataset. As expected, the results indicate that a 'balanced' representation of data samples achieved high accuracy (nearly 93 percent) when benchmarked with 'imbalanced' data samples using random forest classifier 10-fold cross-validations.
UR - https://www.scopus.com/pages/publications/85183182030
U2 - 10.1109/IOTM.0001.1900110
DO - 10.1109/IOTM.0001.1900110
M3 - Article
AN - SCOPUS:85183182030
SN - 2576-3180
VL - 3
SP - 10
EP - 15
JO - IEEE Internet of Things Magazine
JF - IEEE Internet of Things Magazine
IS - 4
M1 - 9319623
ER -