Breast Cancer Prediction using SVM with PCA Feature Selection Method

Authors

  • Akshya Yadav  Computer Engineering Department, MPSTME, NMIMS, Shirpur, District: Dhule, Maharashtra, India
  • Imlikumla Jamir  Computer Engineering Department, MPSTME, NMIMS, Shirpur, District: Dhule, Maharashtra, India
  • Raj Rajeshwari Jain  Computer Engineering Department, MPSTME, NMIMS, Shirpur, District: Dhule, Maharashtra, India
  • Mayank Sohani  Assistant Professor, Computer Engineering Department, MPSTME, NMIMS, Shirpur, Dis-trict: Dhule, Maharashtra, India

DOI:

https://doi.org//10.32628/CSEIT1952277

Keywords:

Breast Cancer, Support Vector Machine, Principle Component Analysis, Min-Max Scaling

Abstract

Cancer has been characterized as one of the leading diseases that cause death in humans. Breast cancer, being a subtype of cancer, causes death in one out of every eight women worldwide. The solution to counter this is by conducting early and accurate diagnosis for faster treatment. To achieve such accuracy in a short span of time proves difficult with existing techniques. Also, the medical tests conducted in hospitals for detecting cancer is expensive and is difficult for any common man to afford. To counter these problems, in this paper, we use the concept of applying Support Vector machine a Machine Learning algorithm to predict whether a person is prone to breast cancer. We evaluate the performance of this algorithm by calculating its accuracy and apply a min-max scaling method so as to counter and overcome the problem of overfitting and outliers. After scaling of the dataset, we apply a feature selection method called Principle component analysis to improve the algorithms accuracy by decreasing the number of parameters. The final algorithm has improved accuracy with the absence of overfitting and outliers, thus this algorithm can be used to develop and build systems that can be deployed in clinics, hospitals and medical centers for early and quick diagnosis of breast cancer. The training dataset is from the University of Wisconsin (UCI) Machine Learning Repository which is used to evaluate the performance of the Support vector machine by calculating its accuracy.

References

  1. Comparative Study of Machine Learn-ing Algorithms for Breast Cancer Detec-tion and Diagnosis Dana Bazazeh1 and Raed Shubair 1,2 1Electrical &amp.
  2. D. Parkin, “Epidemiology of cancer: global patterns and trends” Toxicology Letters. vol. 5, pp. 102-103, 1998.
  3. Meriem Amrane, Saliha Oukid, Breat Cancer Clasification,Using Machine Learn-ing, Proceedings of 2010 IEEE Student Conference on Research and Development (SCOReD 2010), 13 - 14 Dec 2010,Malaysia.
  4. R. Setiono, “Generating concise and accurate classification rules for breast can-cer diagnosis” Artificial Intelligence in Medicine. vol. 18, pp. 205-219,2000
  5. Subhagata Chattopadhyay,”A neuro-fuzzy approach for the diagnosis of de-pression”,Applied Computing and Infor-matics Volume 13, Issue 1, January 2017
  6. https://skymind.ai/wiki/eigenvector
  7. K. Kourou, T. P. Exarchos, K. P. Exarchos, M. V. Karamouzis, and D. I. Fotiadis,“Machine learning applications in cancer prognosis and prediction,” Comput. Struct.,Biotechnol. J., vol. 13, pp. 8-17, 2015.
  8. Noushin Jafarpisheh, Nahid Nafisi “Breast Cancer Relapse Prognosis by Clas-sic and Modern Structures of Machine Learning Algorithms” 2018 6th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS)
  9. Rohit Arora and Suman "Comparative Analysis of Classification Algorithms on Different Datasets using WEKA," 2012 International Journal of Computer Applica-tions (0975 - 8887) Volume 54- No.13, September 2012.
  10. Yu-Len Huang, Kao-Lun Wang “Di-agnosis of breast tumors with ultrasonic texture analysis using support vector ma-chine” Neural Comput & Applic (2006) 15: 164–169 DOI 10.1007/s00521-005-0019-5
  11. A. Soltani Sarvestani, A. A. Safavi “Predicting Breast Cancer Survivability Using Data Mining Techniques” 2010 2nd International Conference on Software Technology and Engineering(ICSTE)
  12. Runjie Shen, Yuanyuan Yan, “Intelli-gent Breast Cancer Prediction model using data mining techniques”, 2014, 6th Interna-tional Conference on Intelliegent Human machine system & Cybernetics, Tongji University Shanghai, China.
  13. Subhagata Chattopadhyay “A neuro-fuzzy approach for the diagnosis of de-pression” Department of Computer Sci-ence and Engineering, National Institute of Science and Technology, Berhampur 761008, Odisha, India.
  14. Liton Chandra Paul, Abdulla Al Sumam, “Face Recognition Using Principal Component Analysis Method” Interna-tional Journal of Advanced Research in Computer Engineering & Technology (IJARCET) Volume 1, Issue 9, November 2012.
  15. M. Lichman, UCI Machine Learning Repositry, 2013. Online]. Availa-ble:https://archive.ics.uci.edu/.
  16. Boulehmi Hela, Mahersia Hela, Ham-rouni Kamel, Breast Cancer Detection ,AReview On Mammograms Analysis Techniques, 2013 10th International Multi-Conference on Systems, Signals & Devices (SSD) Hammamet, Tunisia.
  17. 2014 IEEE 10th International Collo-quium on Signal Processing & its Ap-plications,(CSPA2014), 7 - 9 Mac. 2014, Kuala Lumpur, Malaysia
  18. G. Williams, “Descriptive and Predic-tive Analytics”, Data Min. with Ratt. R Art,Excav. Data Knowl. Discov. Use R, pp. 193-203, 2011.
  19. Muhammad Sufyian Bin Mohd Azmi,Zaihisma Che Cob,”Breast Cancer Prediction Based On Backpropagation Al-gorithm ”,Proceedings of 2010 IEEE Stu-dent Conference on Research and Devel-opment (SCOReD 2010), 13 - 14 Dec 2010,Putrajaya, Malaysia.
  20. Mandeep Kaur, Rajeev Vashisht “Recognition of Facial Expressions with Principal Component Analysis and Singu-lar Value Decomposition” International Journal of Computer Applications (0975 – 8887) Volume 9– No.12, November 2010

Downloads

Published

2019-04-30

Issue

Section

Research Articles

How to Cite

[1]
Akshya Yadav, Imlikumla Jamir, Raj Rajeshwari Jain, Mayank Sohani, " Breast Cancer Prediction using SVM with PCA Feature Selection Method, IInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology(IJSRCSEIT), ISSN : 2456-3307, Volume 5, Issue 2, pp.969-978, March-April-2019. Available at doi : https://doi.org/10.32628/CSEIT1952277