Distributed Data Clustering : A Comparative Analysis

Authors

  • V. Maria Antoniate Martin  Research Scholar, Department of Computer Science, Research and Development Centre, Bharathiar University, Coimbatore, Tamil Nadu, India
  • Dr. K. David  Assistant Professor, Department of Computer Science, The Rajah’s College, Pudukkottai, Tamil Nadu, India
  • B. Merlinsuganthi  Student, Department of Information Technology, St. Joseph’s College, Trichy, Tamil Nadu, India

Keywords:

Distributed cluster, Centroid, k-Means, k-Medoid, CLARA

Abstract

Distributed computing plays an important role in the Data Mining process. Cluster analysis is one of the most common techniques in data mining. Clustering is a task of grouping a set of objects in such a way that objects is in the same group. Data mining is a function that assigns items in a collection to target categories or classes. There are many different techniques and algorithms are available for distributed data clustering. Cluster analysis itself is not one specific algorithm, but the general task to be solved. Many researchers have proposed clustering algorithms, which work efficiently in the distributed mining. This paper compares the performance of distributed clustering algorithms namely, Distributed k-means algorithm and partition algorithm. In this research paper we have to discuss, the comparative analysis of some of these distributed clustering.

References

  1. Jiawey Han, MichelineKamber," Data Mining Concepts And Techniques" Morgan Kaufmann Publishers, NewDelhi, 2001.
  2. Chauhan R, Kaur H, Alam M A, "Data Clustering Method for Discovering Clusters in Spatial Cancer Databases", International Journal of Computer Applications, (0975 – 8887) November2010Vol.10– No.6. .
  3. DataClusteringAlgorithmsAvailable https://sites.google.com/site/dataclusteringalgorithms/density-based-clustering-algorithm.3(2).
  4. K.Kameshwaran and K. Malarvizhi, "Survey on Clustering Techniques in Data Mining", International Journal of Computer Science and Information Technologies (0975-9646), Vol. 5(2), 2014.
  5. G. Olive, R. Setola, and C. Hadjicostis, "Distributed

Downloads

Published

2018-04-30

Issue

Section

Research Articles

How to Cite

[1]
V. Maria Antoniate Martin, Dr. K. David, B. Merlinsuganthi, " Distributed Data Clustering : A Comparative Analysis, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 3, pp.860-865, March-April-2018.