Open Access Open Access  Restricted Access Subscription or Fee Access

Detection of Breast Cancer with Python

Neha Singh


Global cancer data confirms more than 2 million women diagnosed with breast cancer each year reflecting majority of new cancer cases and related deaths, making it significant public health concern. But fortunately, it is also the curable cancer in its early stage. Early diagnosis of breast cancer with timely and effective treatment services improves the prognosis and survival of patients. During classifying tumors, there are significant chances of error and false diagnosis which is needed to be worked upon. Accurate classification can prevent patients from unnecessary treatments. Thus, it is important to accurately classify patients into malignant and benign groups with right diagnosis. This study is based on machine learning (ML) algorithms, aiming to review python technique and its application in breast cancer diagnosis and prognosis by building simple machine learning model. Machine learning has unique advantage as it detects critical features from complex breast cancer datasets. The methodology is widely used for classification of pattern and forecast modelling. The primary data for this study is extracted from Wisconsin breast cancer database (WBCD). It is the benchmark database which compares result via different algorithms.


Breast cancer, predictive algorithm, machine learning, Python, classification models, analysis

Full Text:


References (2021). What is Breast Cancer? | General Information on Breast Cancer | Imaginis—The Women’s Health & Wellness Resource Network. [online] Available at:


what-is-breast-cancer-2 [Accessed Dec. 2020].

West D, Mangiameli P, Rampal R, West V. (2005). Ensemble strategies for a medical diagnosis decision support system: A breast cancer diagnosis application. European Journal of Operational Research. (162), 532–551

Mayo Clinic. (2020). Breast cancer—Symptoms and causes. [online] Available at: [Accessed Dec. 2020].

Pavlopoulos SA. Delopoulos AN. Designing and implementing the transition to a fully digital hospital. IEEE Trans. Inf. Technol. Biomed. 1999, 3, 6–19.

Barracliffe L, Arandjelović O, Humphris G. A pilot study of breast cancer patients: Can machine learning predict healthcare professionals’ responses to patient emotions? In Proceedings of the International Conference on Bioinformatics and Computational Biology, Honolulu, HI, USA, 20–22 March 2017; pp. 101–106.

Birkett C, Arandjelović O, Humphris G. Towards objective and reproducible study of patient-doctor interaction: Automatic text analysis based VR-CoDES annotation of consultation transcripts. In Proceedings of the IEEE Engineering in Medicine and Biology Society Conference, Jeju Island, Korea, 11–15 July 2017; pp. 2638–2641.

AJ Cruz, DS Wishart. Applications of machine learning in cancer prediction and prognosis. Cancer Informatics. vol. 2, pp. 59–77, 2006.

G Valvano, G Santini, N Martini et al. Convolutional neural networks for the segmentation of microcalcification in mammography imaging. Journal of Healthcare Engineering, vol. 2019, Article ID 9360941, 9 pages, 2019.

MF Akay. Support vector machines combined with feature selection for breast cancer diagnosis. Expert Systems with Applications. vol. 36, no. 2, pp. 3240–3247, 2009.

P Salembier, L Garrido. Binary partition tree as an efficient representation for image processing, segmentation, and information retrieval. IEEE Transactions on Image Processing, vol. 9, no. 4, pp. 561–576, 2000.

Mangasarian OL, Setiono R, Wolberg WH. Pattern recognition via linear programming: Theory and application to medical diagnosis. In Large-Scale Numerical Optimization; SIAM: Philadelphia, PA, USA, 1990; pp. 22–31.

Wolberg WH, Mangasarian OL. Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc. Natl. Acad. Sci. USA 1990, 87, 9193–9196.

Sharma A, Kulshrestha S, Daniel S. Machine learning approaches for breast cancer diagnosis and prognosis. In Proceedings of the International Conference on Soft Computing and Its Engineering Applications, Changa, India, 1–2 December 2017.

UCI Machine Learning (2016). Breast Cancer Wisconsin (Diagnostic) Data Set. [online] Available at: https:// [Accessed Dec. 2020].

Y Sun, CF Babbs, EJ Delp. A comparison of feature selection methods for the detection of breast cancers in mammograms: adaptive sequential floating search vs. genetic algorithm. in Proceedings of the 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference. pp. 6532–6535, Shanghai, China, September 2005.

J. Malek, A. Sebri, S. Mabrouk, K. Torki, and R. Tourki. Automated breast cancer diagnosis based on GVF-snake segmentation, wavelet features extraction and fuzzy classification. Journal of Signal Processing Systems. vol. 55, no. 1–3, pp. 49–66, 2009.

B Zheng, SW Yoon, SS Lam. Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms. Expert Systems with Applications. vol. 41, no. 4, pp. 1476–1482, 2014.

E Aličković and A. Subasi. Breast cancer diagnosis using GA feature selection and Rotation Forest. Neural Computing and Applications. vol. 28, no. 4, pp. 753–763, 2017.

M Banaie, H Soltanian-Zadeh, H-R Saligheh-Rad, M Gity. Spatiotemporal features of DCE-MRI for breast cancer diagnosis. Computer Methods and Programs in Biomedicine. vol. 155, pp. 153–164, 2018.

MF Akay. Support vector machines combined with feature selection for breast cancer diagnosis. Expert Systems with Applications. vol. 36, no. 2, pp. 3240–3247, 2009.

Alireza Osarech, Bita Shadgar. A Computer Aided Diagnosis System for Breast Cancer. International Journal of Computer Science Issues, Vol. 8, Issue 2, March 2011.

Mandeep Rana, Pooja Chandorkar and Alishiba Dsouza. Breast cancer diagnosis and recurrence prediction using machine learning techniques. International Journal of Research in Engineering and Technology Volume 04, Issue 04, April 2015.

Vikas Chaurasia, BB Tiwari and Saurabh Pal. Prediction of benign and malignant breast cancer using data mining techniques. Journal of Algorithms and Computational Technology.

Haifeng Wang and Sang Won Yoon, Breast Cancer Prediction using Data Mining Method, IEEE Conference paper.

D Dubey, S Kharya, S Soni. Predictive Machine Learning techniques for Breast Cancer Detection. International Journal of Computer Science and Information Technologies. Vol. 4 (6), 2013,


Nidhi Mishra, Naresh Khuriwal. Breast cancer diagnosis using adaptive voting ensemble machine learning algorithm. 2018 IEEMA Engineer Infinite Conference (eTechNxT), 2018.


  • There are currently no refbacks.