Development of Text Clustering Method with K-Means for Analysis of Text Data

Authors

  • R. J. Wadnare  Sant Gadge Baba Amravati University, Amravati, Maharashtra, India
  • Dr. S. S. Sherekar  Sant Gadge Baba Amravati University, Amravati, Maharashtra, India
  • Dr. V. M. Thakare  Sant Gadge Baba Amravati University, Amravati, Maharashtra, India

DOI:

https://doi.org/10.32628/CSEIT217237

Keywords:

K-means, tag clustering algorithm, K-means, latent semanticanalysis (LSA), min-max similarity (MMS), Latent Dirichlet Allocation (LDA).

Abstract

Clustering is a widely used unsupervised data mining technique. In clustering, the main aim is to put similar data objects in one cluster and dissimilar in another cluster. The k-implies is the most famous clustering algorithm because of its effortlessness. But the performance of the k-means clustering algorithm depends upon the parameter selection. Parameter selection like number of cluster and initial cluster center are key of k-means algorithm. Distance augmentation method, density method quadratic clustering methods are utilized to initial cluster selection. This paper examination five unique methods, for example, improved k-means text clustering algorithm, revisiting k-means, LMMK algorithm, SELF-DATA architecture, Clustering Approach for Relation e.t.c. But these techniques have some limitations. To improve these approach, this paper has proposed the development of text clustering method with k-means for analysis of text data.

References

  1. Caiquan Xiong Zhen Hua KeLv Xuan Li“An Improved K-means text clustering algorithm By Optimizing initial cluster centers” International Conference on Cloud Computing and Big Data2016
  2. M.Alhawarat And M. Hegazi“Revisiting K-Means and Topic Modeling, a Comparison Study to Cluster Arabic Documents” IEEE Access2017
  3. Jing Yang and Jun Wang “Tag clustering algorithm LMMSK: improved K-means algorithm based on latent semantic “Journal of Systems Engineering and ElectronicsApril2017
  4. Tania Cerquitelli Evelina Di Corso Francesco Ventura Silvia Chiusano“Data miners’ little helper: Data transformation activity cues for cluster analysis on document collections”ACM Reference format June 2017
  5. P. DAS,A. K. DAS, J. NAYAK, D. PELUSI, W. DING. “A Graph based Clustering Approach for Relation Extraction from Crime Data” IEEE Access.2019

Downloads

Published

2021-04-30

Issue

Section

Research Articles

How to Cite

[1]
R. J. Wadnare, Dr. S. S. Sherekar, Dr. V. M. Thakare, " Development of Text Clustering Method with K-Means for Analysis of Text Data" International Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 7, Issue 2, pp.143-151, March-April-2021. Available at doi : https://doi.org/10.32628/CSEIT217237