Development of Text Clustering Method with K-Means for Analysis of Text Data

R. J. Wadnare; Dr. S. S. Sherekar; Dr. V. M. Thakare

doi:10.32628/CSEIT217237

Authors

R. J. Wadnare Sant Gadge Baba Amravati University, Amravati, Maharashtra, India
Dr. S. S. Sherekar Sant Gadge Baba Amravati University, Amravati, Maharashtra, India
Dr. V. M. Thakare Sant Gadge Baba Amravati University, Amravati, Maharashtra, India

DOI:

https://doi.org/10.32628/CSEIT217237

Keywords:

K-means, tag clustering algorithm, K-means, latent semanticanalysis (LSA), min-max similarity (MMS), Latent Dirichlet Allocation (LDA).

Abstract

Clustering is a widely used unsupervised data mining technique. In clustering, the main aim is to put similar data objects in one cluster and dissimilar in another cluster. The k-implies is the most famous clustering algorithm because of its effortlessness. But the performance of the k-means clustering algorithm depends upon the parameter selection. Parameter selection like number of cluster and initial cluster center are key of k-means algorithm. Distance augmentation method, density method quadratic clustering methods are utilized to initial cluster selection. This paper examination five unique methods, for example, improved k-means text clustering algorithm, revisiting k-means, LMMK algorithm, SELF-DATA architecture, Clustering Approach for Relation e.t.c. But these techniques have some limitations. To improve these approach, this paper has proposed the development of text clustering method with k-means for analysis of text data.

References

Caiquan Xiong Zhen Hua KeLv Xuan Li“An Improved K-means text clustering algorithm By Optimizing initial cluster centers” International Conference on Cloud Computing and Big Data2016
M.Alhawarat And M. Hegazi“Revisiting K-Means and Topic Modeling, a Comparison Study to Cluster Arabic Documents” IEEE Access2017
Jing Yang and Jun Wang “Tag clustering algorithm LMMSK: improved K-means algorithm based on latent semantic “Journal of Systems Engineering and ElectronicsApril2017
Tania Cerquitelli Evelina Di Corso Francesco Ventura Silvia Chiusano“Data miners’ little helper: Data transformation activity cues for cluster analysis on document collections”ACM Reference format June 2017
P. DAS,A. K. DAS, J. NAYAK, D. PELUSI, W. DING. “A Graph based Clustering Approach for Relation Extraction from Crime Data” IEEE Access.2019

Development of Text Clustering Method with K-Means for Analysis of Text Data

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite