Applying Machine Learning to Ovarian Cancer Predicting Biomarkers

Identifying biomarkers that predict patient’s risk for Ovarian Cancer is a key factor in the fight to improve survival rates. Ovarian Cancer is a group of diseases that originate in the ovaries, fallopian tubes or peritoneum. Ovarian Cancer is best treated at its earliest stages when it is most treatable. Therefore, early screening and diagnosis is key to successfully treating or curing the disease. This study will use heatmap visualization, pearson correlation coefficient method, scatterplot visualizations, logistic regression, and existing literature to determine the best biomarkers of importance in comparison with elevated CA125 levels importance identified include Age, Menopause, Human Epididymis Protein 4 (HE4), Alkaline Phosphatase (ALP), and Calcium. Preliminary analysis shows variables of interest, except HE4, correspond with elevated CA125 levels and would be biomarkers to play closer attention to in predicting ovarian cancer with machine learning models. To optimize performance of the prediction model, removal of non-biomarkers, Age and Menopause, is necessary. Menopause is a nominal category that could still decrease performance even if its cleaned and converted to numeric form.

Table of Contents
    Add a header to begin generating the table of contents

    From this Series

    LaPorchia Davis photo

    “Meharry trained me how to learn, think critically, and problem-solve with data.” Somewhere in a CT scan, there is a pattern too subtle for the...

    Graphic

    Meharry SACS is pleased to announce that David Lockett, Ph.D. student in data science, and Courtney Quarterman, Ph.D. candidate in data science, have been accepted...

    SACS Studendy at SXSW
    This specialized graduate program bridges the gap between technical AI development and commercialization skills....