Breast Cancer Classification Using Machine Learning

Authors

  • Ankit Assistant Professor, Department of CSE, Lovely Professional University, Phagwara, Punjab, India Author
  • Harsh Bansal B.Tech Scholar, Department of CSE, Lovely Professional University, Phagwara, Punjab, India Author
  • Dhruva Arora B.Tech Scholar, Department of CSE, Lovely Professional University, Phagwara, Punjab, India Author
  • Kanak Soni B.Tech Scholar, Department of CSE, Lovely Professional University, Phagwara, Punjab, India Author
  • Rishita Chugh B.Tech Scholar, Department of CSE, Lovely Professional University, Phagwara, Punjab, India Author
  • Swarna Jaya Vardhan B.Tech Scholar, Department of CSE, Lovely Professional University, Phagwara, Punjab, India Author

DOI:

https://doi.org/10.32628/CSEIT2410274

Keywords:

Breast Cancer Classification, Convolutional Neural Networks, Naïve Bayesian Classifier, k-Nearest Neighbors

Abstract

In the pursuit of precise forecasts in machine learning-based breast cancer categorization, a plethora of algorithms and optimizers have been explored. Convolutional Neural Networks (CNNs) have emerged as a prominent choice, excelling in discerning hierarchical representations in image data. This attribute renders them apt for tasks such as detecting malignant lesions in mammograms. Furthermore, the adaptability of CNN architectures enables customization tailored to specific datasets and objectives, enhancing early detection and treatment strategies. Despite the efficacy of screening mammography, the persistence of false positives and negatives poses challenges. Computer-Aided Design (CAD) software has shown promise, albeit early systems exhibited limited improvements. Recent strides in deep learning offer optimism for heightened accuracy, with studies demonstrating comparable performance to radiologists. Nonetheless, the detection of sub-clinical cancer remains arduous, primarily due to small tumor sizes. The amalgamation of fully annotated datasets with larger ones lacking Region of Interest (ROI) annotations is pivotal for training robust deep learning models. This review delves into recent high-throughput analyses of breast cancers, elucidating their implications for refining classification methodologies through deep learning. Furthermore, this research facilitates the prediction of whether cancer is benign or malignant, fostering advancements in diagnostic accuracy and patient care.

Downloads

Download data is not yet available.

References

American Cancer Society, Cancer Facts & Figures 2020, no. 4, American Cancer Society, Atlanta, 2020.

A.Sennoga, “Ultrasound imaging,” in Bioengineering Innovative Solutions for Cancer, pp. 123–161, Academic Press, 2020. DOI: https://doi.org/10.1016/B978-0-12-813886-1.00007-3

R. L. Siegel, K. D. Miller, and A. Jemal, "Cancer statistics, 2019," CA: a cancer journal for clinicians, vol. 69, no. 1, pp. 7-34, 2019. DOI: https://doi.org/10.3322/caac.21551

Aisha Patel, MBBS, MRCP . Howard (Jack) West, MD.London North West University Healthcare NHS Trust, London, United Kingdom.

U.S. Cancer Statistics Working Group. United States Cancer Statistics: 1999–2008 Incidence and Mortality Web-based Report. Atlanta (GA): Department of Health and Human Services, Centers for Disease Control and Prevention, and National Cancer Institute; 2012.

Siegel RL, Miller KD, Jemal A. Cancer Statistics , 2016. 2016;00(00):1-24. doi:10.3322/caac.21332. DOI: https://doi.org/10.3322/caac.21332

“Globocan 2012 - Home.” [Online]. Available: http://globocan.iarc.fr/Default.aspx. [Accessed: 28-Dec-2015].

Asri H, Mousannif H, Al Moatassime H, Noel T. Big data in healthcare: Challenges and opportunities. 2015 Int Conf Cloud Technol Appl. 2015:1-7. doi:10.1109/CloudTech.2015.7337020. DOI: https://doi.org/10.1109/CloudTech.2015.7337020

L. Adi Tarca, V.J.C., X. Chen, R. Romero, S. Drăghici, "Machine Learning and Its Applications to Biology", PLoS Comput Biol,, Vol. 3, pp. 116- 122, 2007. DOI: https://doi.org/10.1371/journal.pcbi.0030116

JF McCarthy, M.K., PE Hoffman, "Applications of machine learning and high-dimensional visualization in cancer detection, diagnosis, and management", Ann N Y Acad Sci, Vol.62, pp. 10201259, 2004.

AC. Tan, D. Gilbert, "Ensemble machine learning on gene expression data for cancer classification", Appl. Bioinform, Vol. 2, pp. 75-83, 2003.

S. Kanta Sarkar, A.N., "Identifying patients at risk of breast cancer through decision trees", International Journal of Advanced Research in Computer Science. Vol. 08, pp. 88-96, 2017. DOI: https://doi.org/10.26483/ijarcs.v8i8.4602

JA. Cruz, W.D, "Applications of Machine Learning in Cancer Prediction and Prognosis". Cancer Inform, Vol. 2, pp. 56-77, 2006. DOI: https://doi.org/10.1177/117693510600200030

M. Sugiyama, "Introduction to Statistical Machine Learning "1ed, ed. T. Green: Morgan Kaufmann, 2006.

L. Breiman, "Random Forests," Machine Learning, vol. 45, p. 5–32, 2001. DOI: https://doi.org/10.1023/A:1010933404324

E. Alickovic and A. Subasi, "Medical Decision Support System for Diagnosis of Heart Arrhythmia using DWT and Random Forests Classifier," Journal of Medical Systems, vol. 40, no. 108, 2016. DOI: https://doi.org/10.1007/s10916-016-0467-8

E. Alickovic and A. Subasi, "Breast cancer diagnosis using GA feature selection and Rotation Forest," Neural Computing and Applications, pp. 1-11, 2015. DOI: https://doi.org/10.1007/s00521-015-2103-9

L. Rokach and O. Maimon, Data Mining and Knowledge Discovery Handbook, 2nd ed., M. Oded and R. Lior, Eds., New York: Springer, 2010.

A. Lavecchia, "machine-learning approaches in the context of ligand-based virtual screening for addressing complex compound classification problems and predicting n

https://books.google.co.in/books?hl=en&lr=&id=c-xzkDMDev0C&oi=fnd&pg=PR2&dq=numpy&ots=Z8PKCWqeuh&sig=Gb7ti9uUKPFNrhYh3mCKJWHv51c&redir_esc=y#v=onepage&q=numpy&f=false

https://pandas.pydata.org/pandas-docs/version/0.7.3/pandas.pdf

https://www.tutorialspoint.com/scikit_learn/scikit_learn_introduction.htm

https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html

https://scikit-learn.org/stable/modules/model_evaluation.html#accuracy-score

https://keras.io/about/

P. Baldi, S.R.B., Bioinformatics: The machine learning approach. 2 ed, ed. S.r.B. Pierre Baldi, 2001.

N. Bhatia, "Survey of Nearest Neighbor Techniques", International Journal of Computer Science and Information Security, Vol. 8, No. 2, 2010. DOI: https://doi.org/10.14569/IJACSA.2011.021110

A. Francillon, P.R., "Smart Card Research and Advanced Applications": 12th International Conference, CARDIS 2013, Berlin, Germany, 2013.Revised Selected Papers. 1 ed. Lecture Notes in Computer Science 8419 Security and Cryptology2014: Springer International Publishing, November 27-29.

A. Alarabeyyat, A.M., "Breast Cancer Detection Using K-Nearest Neighbor Machine Learning Algorithm", in 9th International Conference on. IEEE, v.i.e.E. (DeSE), pp. 35-39, 2016.

MF. Akay. "Support vector machines combined with feature selection for breast cancer diagnosis". Expert Syst Appl Vol. 36, Issue. 2, Part. 2, pp. 3240-3247, March 2009. DOI: https://doi.org/10.1016/j.eswa.2008.01.009

S.K. Prabhakar, H. Rajaguru, "Performance Analysis of Breast Cancer Classification with Softmax Discriminant Classifier and Linear Discriminant Analysis", In: Maglaveras N., Chouvarda I., de Carvalho P. (eds) Precision Medicine Powered by pHealth and Connected Health. IFMBE Proceedings, vol 66. Springer, Singapore, 2018. DOI: https://doi.org/10.1007/978-981-10-7419-6_33

J. S. Snchez, R.A.M., J. M. Sotoca. "An analysis of how training data complexity affects the nearest neighbor classifiers", Pattern Analysis and Applications, Vol. 10, Issue 3, pp 189–201, August 2007. DOI: https://doi.org/10.1007/s10044-007-0061-2

M. Raniszewski, "Sequential reduction algorithm for nearest neighbor rule", Computer Vision and Graphics, 2010. DOI: https://doi.org/10.1007/978-3-642-15907-7_27

P.BhuvaneswariaA, B. Therese, "Detection of Cancer in Lung with K-NN Classification Using Genetic Algorithm", Procedia Materials Science, Vol. 10, pp. 433-440, 2015. DOI: https://doi.org/10.1016/j.mspro.2015.06.077

Z. Zhou, Y.J., Y. Yang, S.F. Chen, "Lung Cancer Cell Identification Based on Artificial Neural Network Ensembles Artificial Intelligence", Medicine Elsevier, Vol. 24, pp. 25-36, 2002. DOI: https://doi.org/10.1016/S0933-3657(01)00094-X

A. Pradesh, A.o.F.S.w.C.B.C.D.,” Indian J. Comput. Sci. Eng., vol. 2, no. 5, pp. 756–763, 2011.

Mook S, Schmidt MK, Rutgers EJ, et al. Calibration and discriminatory accuracy of prognosis calculation for breast cancer with the online Adjuvant! program: a hospital-based retrospective cohort study. Lancet Oncol. 2009;11:1070–1076 DOI: https://doi.org/10.1016/S1470-2045(09)70254-2

Goldhirsch A, Ingle JN, Gelber RD, et al. Thresholds for therapies: highlights of the St. Gallen International Expert Consensus on the primary therapy of early breast cancer 2009. Ann Oncol 2009;8:1319–1329. DOI: https://doi.org/10.1093/annonc/mdp322

Hayes DF, Bast RC, Desch CE, et al. Tumor marker utility grading system: a framework to evaluate clinical utility of tumor markers. J Natl Cancer Inst. 1996;20:1456–1466. DOI: https://doi.org/10.1093/jnci/88.20.1456

Rakha EA, El-Sayed ME, Powe DG, et al. Invasive lobular carcinoma of the breast: response to hormonal therapy and outcomes. Eur J Cancer. 2008;1:73–83. DOI: https://doi.org/10.1016/j.ejca.2007.10.009

Westenend PJ, Meurs CJ, Damhuis RA, Tumour size and vascular invasion predict distant metastasis in stage I breast cancer: grade distinguishes early and late metastasis. J Clin Pathol. 2005;2:196–201. DOI: https://doi.org/10.1136/jcp.2004.018515

Ichikura T, Tomimatsu S, Okusa Y, et al. Comparison of the prognostic significance between the number of metastatic lymph nodes and nodal stage based on their location in patients with gastric cancer. J Clin Oncol. 1993;10:1894–1900. DOI: https://doi.org/10.1200/JCO.1993.11.10.1894

Allred DC, Carlson RW, Berry DA, et al. NCCN Task Force Report: Estrogen Receptor and Progesterone Receptor Testing in Breast Cancer by Immunohistochemistry. J Natl Compr Canc Netw. 2009:S1–S21; quiz S22-S23 DOI: https://doi.org/10.6004/jnccn.2009.0079

Pathology reporting of breast disease. A Joint Document Incorporating the Third Edition of the NHS Breast Screening Programme’s Guidelines for Pathology Reporting in Breast Cancer Screening and the Second Edition of The Royal College of Pathologists’ Minimum Dataset for Breast Cancer Histopathology. January 2005. NHSBSP Pub. No 58.

Harris L, Fritsche H, Mennel R, et al. American Society of Clinical Oncology 2007 update of recommendations for the use of tumor markers in breast cancer. J Clin Oncol. 2007;33:5287–5312. DOI: https://doi.org/10.1200/JCO.2007.14.2364

Ma XJ, Salunga R, Tuggle JT, et al. Gene expression profiles of human breast cancer progression. Proc Natl Acad Sci U S A. 2003;10:5974–5975979. DOI: https://doi.org/10.1073/pnas.0931261100

Warwick J, Tabar L, Vitak B, et al. Time-dependent effects on survival in breast carcinoma: results of 20 years of follow-up from the Swedish Two-County Study. Cancer. 2004;7:1331–1336. DOI: https://doi.org/10.1002/cncr.20140

Balslev I, Axelsson CK, Zedeler K, et al. The Nottingham Prognostic Index applied to 9,149 patients from the studies of the Danish Breast Cancer Cooperative Group (DBCG). Breast Cancer Res Treat. 1994;3:281–290 DOI: https://doi.org/10.1007/BF00666005

Smith JA III, Gamez-Araujo JJ, Gallager HS, et al. Carcinoma of the breast: analysis of total lymph node involvement versus level of metastasis. Cancer. 1977;2: 527–532. DOI: https://doi.org/10.1002/1097-0142(197702)39:2<527::AID-CNCR2820390221>3.0.CO;2-N

Reed J, Rosman M, Verbanac KM, et al. Prognostic implications of isolated tumor cells and micrometastases in sentinel nodes of patients with invasive breast cancer: 10-year analysis of patients enrolled in the prospective East Carolina University/Anne Arundel Medical Center Sentinel Node Multicenter Study. J Am Coll Surg. 2009;3:333–340. DOI: https://doi.org/10.1016/j.jamcollsurg.2008.10.036

Putti TC, El-Rehim DM, Rakha EA, et al. Estrogen receptornegative breast carcinomas: a review of morphology and immunophenotypical analysis. Mod Pathol. 2005;1:26–35. DOI: https://doi.org/10.1038/modpathol.3800255

Wirapati P, Sotiriou C, Kunkel S, et al. Meta-analysis of gene expression profiles in breast cancer: toward a unified understanding of breast cancer subtyping and prognosis signatures. Breast Cancer Res. 2008;4:R65. DOI: https://doi.org/10.1186/bcr2124

Mohammed RA, Martin SG, Mahmmod AM, et al. Objective assessment of lymphatic and blood vascular invasion in lymph node-negative breast carcinoma: findings from a large case series with long-term follow-up. J Pathol. 2011;3:358–365. DOI: https://doi.org/10.1002/path.2810

Downloads

Published

19-04-2024

Issue

Section

Research Articles

How to Cite

[1]
Ankit, Harsh Bansal, Dhruva Arora, Kanak Soni, Rishita Chugh, and Swarna Jaya Vardhan, “Breast Cancer Classification Using Machine Learning”, Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol, vol. 10, no. 2, pp. 575–588, Apr. 2024, doi: 10.32628/CSEIT2410274.

Similar Articles

1-10 of 46

You may also start an advanced similarity search for this article.