Development of Text Clustering Method with K-Means for Analysis of Text Data
DOI:
https://doi.org/10.32628/CSEIT217237Keywords:
K-means, tag clustering algorithm, K-means, latent semanticanalysis (LSA), min-max similarity (MMS), Latent Dirichlet Allocation (LDA).Abstract
Clustering is a widely used unsupervised data mining technique. In clustering, the main aim is to put similar data objects in one cluster and dissimilar in another cluster. The k-implies is the most famous clustering algorithm because of its effortlessness. But the performance of the k-means clustering algorithm depends upon the parameter selection. Parameter selection like number of cluster and initial cluster center are key of k-means algorithm. Distance augmentation method, density method quadratic clustering methods are utilized to initial cluster selection. This paper examination five unique methods, for example, improved k-means text clustering algorithm, revisiting k-means, LMMK algorithm, SELF-DATA architecture, Clustering Approach for Relation e.t.c. But these techniques have some limitations. To improve these approach, this paper has proposed the development of text clustering method with k-means for analysis of text data.
References
- Caiquan Xiong Zhen Hua KeLv Xuan Li“An Improved K-means text clustering algorithm By Optimizing initial cluster centers” International Conference on Cloud Computing and Big Data2016
- M.Alhawarat And M. Hegazi“Revisiting K-Means and Topic Modeling, a Comparison Study to Cluster Arabic Documents” IEEE Access2017
- Jing Yang and Jun Wang “Tag clustering algorithm LMMSK: improved K-means algorithm based on latent semantic “Journal of Systems Engineering and ElectronicsApril2017
- Tania Cerquitelli Evelina Di Corso Francesco Ventura Silvia Chiusano“Data miners’ little helper: Data transformation activity cues for cluster analysis on document collections”ACM Reference format June 2017
- P. DAS,A. K. DAS, J. NAYAK, D. PELUSI, W. DING. “A Graph based Clustering Approach for Relation Extraction from Crime Data” IEEE Access.2019
Downloads
Published
Issue
Section
License
Copyright (c) IJSRCSEIT

This work is licensed under a Creative Commons Attribution 4.0 International License.