Efficient enhanced k-means clustering algorithm

A. M. Fahim; A. M. Salem; F. A. Torkey; M. A. Ramadan

doi:10.1631/jzus.2006.A1626

Efficient enhanced k-means clustering algorithm

A. M. Fahim
, A. M. Salem
, F. A. Torkey
, M. A. Ramadan

Research output: Contribution to journal › Article › peer-review

266 Scopus citations

Abstract

In k-means clustering, we are given a set of n data points in d-dimensional space ℝ^d and an integer k and the problem is to determine a set of k points in ℝ^d, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation.

Original language	English
Pages (from-to)	1626-1633
Number of pages	8
Journal	Journal of Zhejinag University: Science
Volume	7
Issue number	10
DOIs	https://doi.org/10.1631/jzus.2006.A1626
State	Published - Oct 2006
Externally published	Yes

Keywords

Cluster analysis
Clustering algorithms
Data analysis
k-means algorithm

Access to Document

10.1631/jzus.2006.A1626

Cite this

@article{3100aa706bc54c49895c4d6adcb71e9f,

title = "Efficient enhanced k-means clustering algorithm",

abstract = "In k-means clustering, we are given a set of n data points in d-dimensional space ℝd and an integer k and the problem is to determine a set of k points in ℝd, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation.",

keywords = "Cluster analysis, Clustering algorithms, Data analysis, k-means algorithm",

author = "Fahim, \{A. M.\} and Salem, \{A. M.\} and Torkey, \{F. A.\} and Ramadan, \{M. A.\}",

year = "2006",

month = oct,

doi = "10.1631/jzus.2006.A1626",

language = "English",

volume = "7",

pages = "1626--1633",

journal = "Journal of Zhejinag University: Science",

issn = "1009-3095",

publisher = "Zhejiang University",

number = "10",

}

TY - JOUR

T1 - Efficient enhanced k-means clustering algorithm

AU - Fahim, A. M.

AU - Salem, A. M.

AU - Torkey, F. A.

AU - Ramadan, M. A.

PY - 2006/10

Y1 - 2006/10

N2 - In k-means clustering, we are given a set of n data points in d-dimensional space ℝd and an integer k and the problem is to determine a set of k points in ℝd, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation.

AB - In k-means clustering, we are given a set of n data points in d-dimensional space ℝd and an integer k and the problem is to determine a set of k points in ℝd, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation.

KW - Cluster analysis

KW - Clustering algorithms

KW - Data analysis

KW - k-means algorithm

UR - https://www.scopus.com/pages/publications/33750596732

U2 - 10.1631/jzus.2006.A1626

DO - 10.1631/jzus.2006.A1626

M3 - Article

AN - SCOPUS:33750596732

SN - 1009-3095

VL - 7

SP - 1626

EP - 1633

JO - Journal of Zhejinag University: Science

JF - Journal of Zhejinag University: Science

IS - 10

ER -

Efficient enhanced k-means clustering algorithm

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this