Data Science Capstone: Breast Cancer Prediction

Classification of benign and malignant breast tumors by ML algorithms

This was my capstone project for the HarvardX Data Science Professional Certificate. It demonstrates competency in R programming, statstical modeling and basic machine learning.

The goal of this study was to train a machine learning classifier to predict whether a breast tumor is malignant (cancer) or benign (not cancer) based on cell nucleus features extracted from digitized images of breast biopsy slides. It makes use of the Breast Cancer Wisconsin Diagnostic Data Set, a classic machine learning benchmarking dataset. Results and implications are analyzed from the perspective of a cancer biologist.

Amy Gill
Lead Content Developer, Data Science

Biomedical researcher, data scientist and educator.