Clustering Analysis using an Unsupervised Machine Learning Method

Authors

  • Tashfin Ansari  Computer Science and Engineering, P.E.S. College of Engineering, Aurangabad, Maharashtra, India
  • Dr. Almas Siddiqui  Assistant Professor, Vivekanand College, Aurangabad, Maharashtra, India
  • Awasthi G. K  Assistant Professor, Vivekanand College, Aurangabad, Maharashtra, India

DOI:

https://doi.org//10.32628/CSEIT12173174

Keywords:

Machine Learning (ML), Artificial Intelligence (AI), K- Means Clustering, Classification, Unsupervised Learning.

Abstract

Artificial Intelligence (AI) and Machine Learning (ML), which are becoming a part of interest rapidly for various researchers. ML is the field of Computer Science study, which gives capability to learn without being absolutely programmed. This work focuses on the standard k-means clustering algorithm and analysis the shortcomings of the standard k-means algorithm. The k-means clustering algorithm calculates the distance between each data object and not all cluster centres in every iteration, which makes the efficiency of clustering is high. In this work, we have to try to improve the k-means algorithm to solve simple data to store some information in every iteration, which is to be used in the next interaction. This method avoids computing distance of data object to the cluster centre repeatedly, saving the running time. An experimental result shows the enhanced speed of clustering, accuracy, reducing the computational complexity of the k-means. In this, we have work on iris dataset extracted from Kaggle.

References

  1. Bhattacharya, Sambit & Czejdo, Bogdan & Agrawal, Rajeev & Erdemir, Erdem & Gokaraju, Balakrishna. (2018). 1-4. 10.1109/SECON.2018.8479098. Sambit Bhattacharya,
  2. Mohamed Alloghani, Dhiya Al-Jumeily, Jamila Mustafina “A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science” January 2020 DOI: 10.1007/978-3-030-22475-2_1 In book: Supervised and Unsupervised Learning for Data Science (pp.3-21)
  3. https://www.ibm.com/cloud/learn/unsupervised-learning
  4. L. B. Goncalves, M. M. B. R. Vellasco, M. A. C. Pacheco and Flavio Joaquim de Souza, "Inverted hierarchical neuro-fuzzy BSP system: a novel neuro-fuzzy model for pattern classification and rule extraction in databases," in IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 36, no. 2, pp. 236-248, March 2006.
  5. P. H. Ahmad and S. Dang, "Performance evaluation of clustering algorithm using different datasets", Int. J. Adv. Res. Comput. Sci. Manag. Stud., vol. 3, no. 1, pp. 167-173, 2015.
  6. Panahi N, Shayesteh MG, Mihandoost S, Zali Varghahan B, "Recognition of different datasets using PCA, LDA, and various classifiers", In 5th International Conference on Application of Information and Communication Technologies (AICT), Baku, Azerbaijan, 2011; 1– 5.
  7. U. Tiankaew, P. Chunpongthong and V. Mettanant, "A Food Photography App with Image Recognition for Thai Food," 2018 Seventh ICT International Student Project Conference (ICT-ISPC), Nakhonpathom, 2018, pp. 1-6.
  8. Dang, Shilpa. (2015). Performance Evaluation of Clustering Algorithm Using Different Datasets. IJARCSMS. 3. 167-173. R. Nicole, “Title of paper with only first word capitalized,” J. Name Stand. Abbrev., in press.
  9. https://www.kaggle.com/arshid/iris-flower-dataset
  10. JAIN A K, DUBES R C. Algorithms for clustering data[M].New Jersey:Prentice-Hall,1988.
  11. ZhangYufang etc. A kind of improved K-means algorithm [J]. Computer Application,p31Ěš33, 2003, (8).
  12. Yu Yang “A study of pattern recognition of Iris flower based on Machine Learning” Degree Program: Information Technology | Specialization: Internet Technology 2013
  13. Dataset Tanvi Gupta, Supriya P. Panda2 “A Comparison of K-Means Clustering Algorithm and CLARA Clustering Algorithm on Iris” International Journal of Engineering & Technology, 7 (4) (2018) 4766-4768 International Journal of Engineering & Technology Website: www.sciencepubco.com/index.php/IJET doi: 10.14419/ijet. v7i4.21472
  14. K. Maheswari, “Finding Best Possible Number of Clusters using K-Means Algorithm”, International Journal of Engineering and Advanced Technology (IJEAT), ISSN: 2249 – 8958, Volume-9, Issue-1S4, December-2019.

Downloads

Published

2021-06-30

Issue

Section

Research Articles

How to Cite

[1]
Tashfin Ansari, Dr. Almas Siddiqui, Awasthi G. K, " Clustering Analysis using an Unsupervised Machine Learning Method, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 7, Issue 3, pp.602-609, May-June-2021. Available at doi : https://doi.org/10.32628/CSEIT12173174