Case Study on Static k-Means Clustering Algorithm

Authors

  • Dr. Chatti Subba Lakshmi  Department of CSE, Guru Nanak Institutions, Hyderabad, India

Keywords:

Data Clustering, Partitioning Clustering, K-Means Clustering Algorithm, Static

Abstract

Data clustering is frequent research problem in many data mining applications. In this view, many clustering methods proposed in literature. One type of clustering is partitioning method which is centroid based technique. In this paper we are presenting the case study on conventional or static k-means partition clustering algorithm. Here we used static means the basic input parameter given to k-means is number of cluster (k), which constant for complete execution of data set. We need to decide the k values before algorithm starts and It does not changes, when there is a change in data set. We considered the some cases like distance measures, what is right number of clusters and relations between the algorithm parameters. We executed k-means algorithm on small data set and large data set and we presented the detailed steps for each case by showing the results

References

  1. Bailey, Ken (1994). "Numerical Taxonomy and Cluster Analysis". Typologies and Taxonomies. p. 34. ISBN 9780803952591.
  2. Tryon, Robert C. (1939). Cluster Analysis: Correlation Profile and Orthometric (factor) Analysis for the Isolation of Unities in Mind and Personality. Edwards Brothers.
  3. Cattell, R. B. (1943). "The description of personality: Basic traits resolved into clusters". Journal of Abnormal and Social Psychology 38: 476-506.
  4. Estivill-Castro, Vladimir (20 June 2002). "Why so many clustering algorithms — A Position Paper". ACM SIGKDD Explorations Newsletter 4 (1): 65-75.doi:10.1145/568574.568575
  5. Lloyd, S. (1982). "Least squares quantization in PCM". IEEE Transactions on Information Theory 28 (2): 129-137. doi:10.1109/TIT.1982.1056489M. Young, The Technical Writer's Handbook. Mill Valley, CA: University Science, 1989.
  6. Elena Deza & Michel Marie Deza (2009) Encyclopedia of Distances, page 94, Springer.
  7. Eugene F. Krause (1987). Taxicab Geometry. Dover. ISBN 0-486-25202-7
  8. Yuli Zhang,Huaiyu Wu,Lei Cheng (June 2012). "Some new deformation formulas about variance and covariance". Proceedings of 4th International Conference on Modelling, Identification and Control(ICMIC2012). pp. 987-992.
  9. Goodman, Leo A., "The variance of the product of K random variables," Journal of the American Statistical Association, March 1962, 54ff.

Downloads

Published

2018-02-28

Issue

Section

Research Articles

How to Cite

[1]
Dr. Chatti Subba Lakshmi, " Case Study on Static k-Means Clustering Algorithm, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 3, Issue 1, pp.1160-1167, January-February-2018.