Machine Learning based Predictive Analysis and Algorithm for Analysis Severity of Breast Cancer

Pp: 83-97 (15)

* (Excluding Mailing and Handling)

Abstract

Breast cancer is the 2nd frequent occurrence of cancer among women, after skin cancer, according to the American Cancer Society. By using mammography, it is possible to detect breast cancer before it has spread to other parts of the body. It primarily affects females, though males can be affected as well. Early identification of breast cancer improves survival chances significantly, however, the detection procedure remains difficult in clinical studies. To solve this problem, a Machine Learning (ML) algorithm is used to detect breast cancer in mammogram images. In this study, 100 images from the mini-MIAS mammogram database were used, 50 of which were malignant and 50 of which were benign breast cancer mammograms. Before training the model, the sample image datasets are pre-processed using numerous techniques. The required features are then extracted from the sample images using Feature Extraction (FE) techniques, such as Daubechies (DB4) and HAAR. Finally, the extracted features are fed into ML classifiers such as Linear Discriminant Analysis (LDA), Support Vector Machine (SVM), and Random Forest (RF) to create a model. Several performance metrics are used to evaluate FE and classification. According to the results of the analysis, the HAAR FE with the RF model is the ideal combination, with an accuracy level of 91%.

Keywords: Accuracy, Breast Cancer, Confusion Matrix, Diagnosis, Feature, Metrics.

Cite as