Threshold-Optimized and Calibrated Logistic Regression for Breast Cancer Classification
DOI:
https://doi.org/10.56294/saludcyt20252241Keywords:
Breast cancer classification, Logistic regression, Clinical decision support, Probability calibration, Fine needle aspirate cytologyAbstract
Breast cancer affects over 2.3 million individuals annually worldwide. Traditional diagnostic methods face limitations in consistency and objectivity, particularly in resource-constrained settings. This study developed a logistic regression-based clinical decision support system for breast cancer classification. We analyzed the Wisconsin Diagnostic Breast Cancer dataset containing 569 samples with 30 quantitative morphological features from fine needle aspirate cytology. The dataset comprised 357 benign and 212 malignant cases. Data underwent standardization via StandardScaler, followed by 75-25 train-test partitioning (426 training, 143 testing samples). We evaluated the logistic regression model through confusion matrix analysis, ROC curve assessment, threshold optimization via Youden's Index, and probability calibration using Expected Calibration Error (ECE). The model achieved 95.8% accuracy, 96.2% sensitivity, and 95.6% specificity on independent testing data, with AUC-ROC of 0.993. Threshold optimization identified 0.560 as the optimal decision boundary, yielding 3.77% false negative rate and 4.44% false positive rate. Probability calibration demonstrated reliable predictions with ECE of 0.0390, improved to 0.0328 through isotonic regression. The model correctly classified 137 of 143 test samples (86 true negatives, 51 true positives, 4 false positives, 2 false negatives). The logistic regression model demonstrated strong discriminative performance for breast cancer classification. However, single train-test validation and dataset-specific characteristics require cautious interpretation. Cross-validation and external validation remain necessary for clinical translation.
References
1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6).
2. DeSantis CE, Ma J, Gaudet MM, Newman LA, Miller KD, Goding Sauer A, et al. Breast cancer statistics, 2019. CA Cancer J Clin. 2019;69(6).
3. Al Muhaisen S, Safi O, Ulayan A, Aljawamis S, Fakhoury M, Baydoun H, et al. Artificial Intelligence-Powered Mammography: Navigating the Landscape of Deep Learning for Breast Cancer Detection. Cureus. 2024 Mar 26;
4. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin. 2021;71(3).
5. Madabhushi A, Lee G. Image analysis and machine learning in digital pathology: Challenges and opportunities. Vol. 33, Medical Image Analysis. 2016.
6. Evans AJ, Brown RW, Bui MM, Chlipala EA, Lacchetti C, Milner DA, et al. Validating Whole Slide Imaging Systems for Diagnostic Purposes in Pathology. Arch Pathol Lab Med. 2022;146(4).
7. Krane JF. Koss’ Diagnostic Cytology and Its Histopathologic Bases, Fifth Edition. International Journal of Gynecological Pathology. 2007;26(3).
8. Ramos-Vara JA, Miller MA. When Tissue Antigens and Antibodies Get Along: Revisiting the Technical Aspects of Immunohistochemistry-The Red, Brown, and Blue Technique. Vol. 51, Veterinary Pathology. 2014.
9. Rajpurkar P, Chen E, Banerjee O, Topol EJ. AI in health and medicine. Vol. 28, Nature Medicine. 2022.
10. Esteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K, et al. A guide to deep learning in healthcare. Vol. 25, Nature Medicine. 2019.
11. McKinney SM, Sieniek M, Godbole V, Godwin J, Antropova N, Ashrafian H, et al. International evaluation of an AI system for breast cancer screening. Nature. 2020;577(7788).
12. Liu Y, Jain A, Eng C, Way DH, Lee K, Bui P, et al. A deep learning system for differential diagnosis of skin diseases. Nat Med. 2020;26(6).
13. Samsir S, Sitorus JHP, Zulkifli, Ritonga Z, Nasution FA, Watrianthos R. Comparison of machine learning algorithms for chest X-ray image COVID-19 classification. J Phys Conf Ser. 2021;1933(1):012040.
14. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Vol. 25, Nature Medicine. 2019.
15. Jimma BL. Artificial intelligence in healthcare: A bibliometric analysis. Vol. 9, Telematics and Informatics Reports. 2023.
16. Jiang F, Jiang Y, Zhi H, Dong Y, Li H, Ma S, et al. Artificial intelligence in healthcare: Past, present and future. Vol. 2, Stroke and Vascular Neurology. 2017.
17. Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Vol. 1, Nature Machine Intelligence. 2019.
18. Ghassemi M, Oakden-Rayner L, Beam AL. The false hope of current approaches to explainable artificial intelligence in health care. Vol. 3, The Lancet Digital Health. 2021.
19. Beam AL, Kohane IS. Big data and machine learning in health care. Vol. 319, JAMA - Journal of the American Medical Association. 2018.
20. Char DS, Shah NH, Magnus D. Implementing Machine Learning in Health Care — Addressing Ethical Challenges. New England Journal of Medicine. 2018;378(11).
21. Kelly CJ, Karthikesalingam A, Suleyman M, Corrado G, King D. Key challenges for delivering clinical impact with artificial intelligence. Vol. 17, BMC Medicine. 2019.
22. Naylor CD. On the prospects for a (Deep) learning health care system. Vol. 320, JAMA - Journal of the American Medical Association. 2018.
23. Scott AJ, Hosmer DW, Lemeshow S. Applied Logistic Regression. Biometrics. 1991;47(4).
24. Anderson KM, Odell PM, Wilson PWF, Kannel WB. Cardiovascular disease risk profiles. Am Heart J. 1991;121(1 PART 2).
25. Guo C, Pleiss G, Sun Y, Weinberger KQ. On calibration of modern neural networks. In: 34th International Conference on Machine Learning, ICML 2017. 2017.
26. Minderer M, Djolonga J, Romijnders R, Hubis F, Zhai X, Houlsby N, et al. Revisiting the Calibration of Modern Neural Networks. In: Advances in Neural Information Processing Systems. 2021.
27. Vickers AJ, Elkin EB. Decision curve analysis: A novel method for evaluating prediction models. Medical Decision Making. 2006;26(6).
28. Al Kuwaiti A, Nazer K, Al-Reedy A, Al-Shehri S, Al-Muhanna A, Subbarayalu AV, et al. A Review of the Role of Artificial Intelligence in Healthcare. Vol. 13, Journal of Personalized Medicine. 2023.
29. Ueda D, Kakinuma T, Fujita S, Kamagata K, Fushimi Y, Ito R, et al. Fairness of artificial intelligence in healthcare: review and recommendations. Vol. 42, Japanese Journal of Radiology. 2024.
30. Vickers AJ, Van Calster B, Steyerberg EW. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ (Online). 2016;352.
31. Sendak MP, Gao M, Brajer N, Balu S. Presenting machine learning model information to clinical end users with model facts labels. Vol. 3, npj Digital Medicine. 2020.
32. López-Ratón M, Rodríguez-Álvarez MX, Cadarso-Suárez C, Gude-Sampedro F. Optimalcutpoints: An R package for selecting optimal cutpoints in diagnostic tests. J Stat Softw. 2014;61(8).
33. Perkins NJ, Schisterman EF. The inconsistency of “optimal” cutpoints obtained using two criteria based on the receiver operating characteristic curve. Am J Epidemiol. 2006;163(7).
34. Wu N, Phang J, Park J, Shen Y, Kim SG, Heacock L, et al. Breast Cancer Wisconsin (Diagnostic) Data Set | Kaggle. Kaggle. 2019;4(November).
35. W.N. S, W.H. W, O.L. M. Nuclear feature extraction for breast tumor diagnosis. IS&T/SPIE 1993 International Symposium on Electronic Imaging: Science and Technology. 1993;1905(870).
36. Alshayeji MH, Ellethy H, Abed S, Gupta R. Computer-aided detection of breast cancer on the Wisconsin dataset: An artificial neural networks approach. Biomed Signal Process Control. 2022;71.
37. Panda NR, Pati JK, Mohanty JN, Bhuyan R. A Review on Logistic Regression in Medical Research. Vol. 13, National Journal of Community Medicine. 2022.
38. Prasad R, Anjali P, Adil S, Deepa N. Heart disease prediction using logistic regression algorithm using machine learning. Int J Eng Adv Technol. 2019;8(3 Special Issue).
39. Lasko TA, Bhagwat JG, Zou KH, Ohno-Machado L. The use of receiver operating characteristic curves in biomedical informatics. Vol. 38, Journal of Biomedical Informatics. 2005.
40. Zweig MH, Campbell G. Receiver-operating characteristic (ROC) plots: A fundamental evaluation tool in clinical medicine. Clin Chem. 1993;39(4).
41. Posocco N, Bonnefoy A. Estimating Expected Calibration Errors. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2021.
42. Böken B. On the appropriateness of Platt scaling in classifier calibration. Inf Syst. 2021;95.
43. Huang L, Zhao J, Zhu B, Chen H, Broucke S Vanden. An Experimental Investigation of Calibration Techniques for Imbalanced Data. IEEE Access. 2020;8.
44. Hosmer DW, Hosmer T, Le Cessie S, Lemeshow S. A comparison of goodness-of-fit tests for the logistic regression model. Stat Med. 1997;16(9).
45. Cox DR. Two Further Applications of a Model for Binary Regression. Biometrika. 1958;45(3/4).
46. Dhanya R, Paul IR, Sindhu Akula S, Sivakumar M, Nair JJ. A comparative study for breast cancer prediction using machine learning and feature selection. In: 2019 International Conference on Intelligent Computing and Control Systems, ICCS 2019. 2019.
47. LG A, AT E. Using Three Machine Learning Techniques for Predicting Breast Cancer Recurrence. J Health Med Inform. 2013;04(02).
48. Aličković E, Subasi A. Breast cancer diagnosis using GA feature selection and Rotation Forest. Neural Comput Appl. 2017;28(4).
49. Salama Gouda I, .Abdelhalim M.B., Zeid Magdy Abd-elghany. Breast Cancer Diagnosis on Three Different Datasets Using Multi-Classifiers. International Journal of Computer and Information Technology. 2012;01(01).
50. Bella A, Ferri C, Hernández-Orallo J, Ramírez-Quintana MJ. Quantification via probability estimators. In: Proceedings - IEEE International Conference on Data Mining, ICDM. 2010.
51. Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. Vol. 110, Journal of Clinical Epidemiology. 2019.
52. Asri H, Mousannif H, Al Moatassime H, Noel T. Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis. In: Procedia Computer Science. 2016.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Ronal Watrianthos, Rayendra, Ervan Asri, Yuhefizar, Humaira (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.
The article is distributed under the Creative Commons Attribution 4.0 License. Unless otherwise stated, associated published material is distributed under the same licence.