Applying Machine Learning to Ovarian Cancer Predicting Biomarkers

Identifying biomarkers that predict patient’s risk for Ovarian Cancer is a key factor in the fight to improve survival rates. Ovarian Cancer is a group of diseases that originate in the ovaries, fallopian tubes or peritoneum. Ovarian Cancer is best treated at its earliest stages when it is most treatable. Therefore, early screening and diagnosis is key to successfully treating or curing the disease. This study will use heatmap visualization, pearson correlation coefficient method, scatterplot visualizations, logistic regression, and existing literature to determine the best biomarkers of importance in comparison with elevated CA125 levels importance identified include Age, Menopause, Human Epididymis Protein 4 (HE4), Alkaline Phosphatase (ALP), and Calcium. Preliminary analysis shows variables of interest, except HE4, correspond with elevated CA125 levels and would be biomarkers to play closer attention to in predicting ovarian cancer with machine learning models. To optimize performance of the prediction model, removal of non-biomarkers, Age and Menopause, is necessary. Menopause is a nominal category that could still decrease performance even if its cleaned and converted to numeric form.

Table of Contents
    Add a header to begin generating the table of contents

    From this Series

    Photo of Dyani Peterson feature

    M.S. Biomedical Data Science student Dyani Peterson has received an AIM-AHEAD Research Fellowship. She is the first Meharry student and the only master’s student from...

    SACS Staff

    Nashville, TN — Meharry Medical College announced today that the School of Applied Computational Sciences has received a $1 million grant from the Robert Wood...

    Brian Strong Headshot

    Brian C. Strong, a master’s student in data science at Meharry Medical College’s School of Applied Computational Sciences, spent summer 2025 as an National Institute...