An Efficient Distributed Algorithm for Big Data Processing

Mohammed S. Al-kahtani; Lutful Karim

doi:10.1007/s13369-016-2405-y

An Efficient Distributed Algorithm for Big Data Processing

Mohammed S. Al-kahtani
, Lutful Karim

Computer Engineering

Seneca College

Research output: Contribution to journal › Article › peer-review

11 Scopus citations

Abstract

This paper introduces an efficient distributed data analysis framework for big data which comprises data processing at the data collecting nodes and the central server end as opposed to the existing framework that only comprises data processing at the central server end. As data are being processed at the data collecting end in the proposed framework, the amount of data is reduced to be processed at the server side by the commodity computers. The proposed distributed algorithm works both in low-powered nodes such as sensors and high-speed commodity computers and also performs sequential and parallel processing based on the amount of data received at the central server. Simulation results demonstrate that the proposed distributed algorithm outperforms traditional distributed algorithms in terms of the size of data to be processed at the central server and data processing time.

Original language	English
Pages (from-to)	3149-3157
Number of pages	9
Journal	Arabian Journal for Science and Engineering
Volume	42
Issue number	8
DOIs	https://doi.org/10.1007/s13369-016-2405-y
State	Published - 1 Aug 2017

Keywords

Big data
Commodity hardware
DBMS
Distributed algorithms
MapReduce
Sensor

Access to Document

10.1007/s13369-016-2405-y

Cite this

@article{5e1f59ffa8fd44e1bf78845f68657649,

title = "An Efficient Distributed Algorithm for Big Data Processing",

abstract = "This paper introduces an efficient distributed data analysis framework for big data which comprises data processing at the data collecting nodes and the central server end as opposed to the existing framework that only comprises data processing at the central server end. As data are being processed at the data collecting end in the proposed framework, the amount of data is reduced to be processed at the server side by the commodity computers. The proposed distributed algorithm works both in low-powered nodes such as sensors and high-speed commodity computers and also performs sequential and parallel processing based on the amount of data received at the central server. Simulation results demonstrate that the proposed distributed algorithm outperforms traditional distributed algorithms in terms of the size of data to be processed at the central server and data processing time.",

keywords = "Big data, Commodity hardware, DBMS, Distributed algorithms, MapReduce, Sensor",

author = "Al-kahtani, \{Mohammed S.\} and Lutful Karim",

note = "Publisher Copyright: {\textcopyright} 2017, King Fahd University of Petroleum \& Minerals.",

year = "2017",

month = aug,

day = "1",

doi = "10.1007/s13369-016-2405-y",

language = "English",

volume = "42",

pages = "3149--3157",

journal = "Arabian Journal for Science and Engineering",

issn = "2193-567X",

publisher = "Springer Nature",

number = "8",

}

TY - JOUR

T1 - An Efficient Distributed Algorithm for Big Data Processing

AU - Al-kahtani, Mohammed S.

AU - Karim, Lutful

PY - 2017/8/1

Y1 - 2017/8/1

N2 - This paper introduces an efficient distributed data analysis framework for big data which comprises data processing at the data collecting nodes and the central server end as opposed to the existing framework that only comprises data processing at the central server end. As data are being processed at the data collecting end in the proposed framework, the amount of data is reduced to be processed at the server side by the commodity computers. The proposed distributed algorithm works both in low-powered nodes such as sensors and high-speed commodity computers and also performs sequential and parallel processing based on the amount of data received at the central server. Simulation results demonstrate that the proposed distributed algorithm outperforms traditional distributed algorithms in terms of the size of data to be processed at the central server and data processing time.

AB - This paper introduces an efficient distributed data analysis framework for big data which comprises data processing at the data collecting nodes and the central server end as opposed to the existing framework that only comprises data processing at the central server end. As data are being processed at the data collecting end in the proposed framework, the amount of data is reduced to be processed at the server side by the commodity computers. The proposed distributed algorithm works both in low-powered nodes such as sensors and high-speed commodity computers and also performs sequential and parallel processing based on the amount of data received at the central server. Simulation results demonstrate that the proposed distributed algorithm outperforms traditional distributed algorithms in terms of the size of data to be processed at the central server and data processing time.

KW - Big data

KW - Commodity hardware

KW - DBMS

KW - Distributed algorithms

KW - MapReduce

KW - Sensor

UR - https://www.scopus.com/pages/publications/85024931747

U2 - 10.1007/s13369-016-2405-y

DO - 10.1007/s13369-016-2405-y

M3 - Article

AN - SCOPUS:85024931747

SN - 2193-567X

VL - 42

SP - 3149

EP - 3157

JO - Arabian Journal for Science and Engineering

JF - Arabian Journal for Science and Engineering

IS - 8

ER -

An Efficient Distributed Algorithm for Big Data Processing

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this