Distributed Data Clustering : A Comparative Analysis
Keywords:
Distributed cluster, Centroid, k-Means, k-Medoid, CLARAAbstract
Distributed computing plays an important role in the Data Mining process. Cluster analysis is one of the most common techniques in data mining. Clustering is a task of grouping a set of objects in such a way that objects is in the same group. Data mining is a function that assigns items in a collection to target categories or classes. There are many different techniques and algorithms are available for distributed data clustering. Cluster analysis itself is not one specific algorithm, but the general task to be solved. Many researchers have proposed clustering algorithms, which work efficiently in the distributed mining. This paper compares the performance of distributed clustering algorithms namely, Distributed k-means algorithm and partition algorithm. In this research paper we have to discuss, the comparative analysis of some of these distributed clustering.
References
- Jiawey Han, MichelineKamber," Data Mining Concepts And Techniques" Morgan Kaufmann Publishers, NewDelhi, 2001.
- Chauhan R, Kaur H, Alam M A, "Data Clustering Method for Discovering Clusters in Spatial Cancer Databases", International Journal of Computer Applications, (0975 – 8887) November2010Vol.10– No.6. .
- DataClusteringAlgorithmsAvailable https://sites.google.com/site/dataclusteringalgorithms/density-based-clustering-algorithm.3(2).
- K.Kameshwaran and K. Malarvizhi, "Survey on Clustering Techniques in Data Mining", International Journal of Computer Science and Information Technologies (0975-9646), Vol. 5(2), 2014.
- G. Olive, R. Setola, and C. Hadjicostis, "Distributed
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT
This work is licensed under a Creative Commons Attribution 4.0 International License.