TY - GEN
T1 - Errors Detection Mechanism in Big Data
AU - Eljialy, A. E.M.
AU - Ahmad, Sultan
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - The rapid growth of information technologies has not been exempt from errors in the transmission and storage of data, which has affected the services, offered to customers on big data platforms. In these platforms, it has been particularly difficult to identify the causes given their complex nature, the large size and the growth of the data set. The detection and correction of errors in the processing of information in big data platforms are essential to guarantee the integrity of the stored data and thus give users confidence that the information is correctly stored without alterations. However, the options that exist to detect errors of this type lack efficiency in the methods used or use third party software that does not always allow determining the cause of the errors. In this research, big data analysis models are used to find bottlenecks of data processing management. Further, it is developed an alternative error detection mechanism that is based on analyzing error-log file with a variation of Fuzzy clustering mechanism. This approach, based on text mining, helps to look for error frequency and entity that had caused it. Moreover, it is shown how the errors that are presented are distributed according to the source.
AB - The rapid growth of information technologies has not been exempt from errors in the transmission and storage of data, which has affected the services, offered to customers on big data platforms. In these platforms, it has been particularly difficult to identify the causes given their complex nature, the large size and the growth of the data set. The detection and correction of errors in the processing of information in big data platforms are essential to guarantee the integrity of the stored data and thus give users confidence that the information is correctly stored without alterations. However, the options that exist to detect errors of this type lack efficiency in the methods used or use third party software that does not always allow determining the cause of the errors. In this research, big data analysis models are used to find bottlenecks of data processing management. Further, it is developed an alternative error detection mechanism that is based on analyzing error-log file with a variation of Fuzzy clustering mechanism. This approach, based on text mining, helps to look for error frequency and entity that had caused it. Moreover, it is shown how the errors that are presented are distributed according to the source.
KW - Big Data
KW - Errors Detection
KW - Fuzzy clustering
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=85080069690&partnerID=8YFLogxK
U2 - 10.1109/ICSSIT46314.2019.8987890
DO - 10.1109/ICSSIT46314.2019.8987890
M3 - Conference contribution
AN - SCOPUS:85080069690
T3 - Proceedings of the 2nd International Conference on Smart Systems and Inventive Technology, ICSSIT 2019
SP - 323
EP - 328
BT - Proceedings of the 2nd International Conference on Smart Systems and Inventive Technology, ICSSIT 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Smart Systems and Inventive Technology, ICSSIT 2019
Y2 - 27 November 2019 through 29 November 2019
ER -