Student Capstone Projects

The capstone is the culminating project for each student in a SACS Master of Science program. The comprehensive, real-life industry-type projects are oriented toward the student’s domain of interest.

Each project includes: formulation of a question to be answered by the data; collection, cleaning and processing of data; choosing and applying a suitable model and/or analytic method to the problem; and communicating the results to a non-technical audience.

Two photos of a smiling man in a blue shirt, standing in front of a wall with a wooden bench.

Julian Broughton

M.S. Biomedical Data Science

Learning Unbiased Risk Prediction Based Algorithms in Healthcare

The rapid advancement of Artificial Intelligence (AI) has significantly transformed healthcare, enhancing traditional methods of diagnosis and treatment. These innovations have enabled quicker disease detection, improved management, and more personalized care. However, many AI tools currently used in clinical settings suffer from algorithmic and data-driven biases, often due to inadequate representation of certain racial, gender, and age groups. These gaps can result in misdiagnoses, health disparities, and inequitable outcomes. Therefore, addressing these biases is essential.

This project investigates the presence and impact of such biases by examining both pre-processing and post-processing stages of AI model development, using a widely adopted real-world healthcare dataset from primary care patients. It uncovers previously overlooked biases and offers practical strategies to reduce disparities related to race, gender, and age. By applying machine learning algorithms and utilizing the Fairlearn toolkit, the study identifies and quantifies the biases, evaluates their effects on predictive performance, and presents methods to mitigate them. The findings provide strong evidence of systemic bias in healthcare AI systems, particularly in how they influence resource distribution and decision-making. As a result, it is imperative to incorporate bias detection and mitigation techniques to ensure that AI technologies in healthcare are fair, dependable, and ethically sound.

A man in a suit smiling at the camera.

Chris Brown

M.S. Data Science

Antiphospholipid Syndrome: Unraveling Adverse Outcomes in Pregnancy

Antiphospholipid syndrome (APS) refers to the clinical association between antiphospholipid antibodies and a hypercoagulable state, which increases the risk of blood clot formation within blood vessels. APS is more prevalent in women than in men. Research shows that women with APS face an elevated risk of adverse pregnancy outcomes, particularly during the fetal period (ten or more weeks of gestation). These outcomes include preeclampsia, characterized by high blood pressure and proteinuria (excess protein in urine), recurrent early pregnancy loss, fetal demise, and intrauterine growth restriction. APS-related pregnancy losses tend to occur later in pregnancy compared to sporadic or recurrent miscarriages, which typically happen earlier in the pre-embryonic or embryonic period. Factors such as placental insufficiency, hypertensive disorders of pregnancy, thrombophilia, and underlying autoimmune conditions play a role. This research aims to study the complex interplay of these factors to improve outcomes for affected women. Notably, APS is more prevalent among underserved communities.

Woman in black shirt and long hair, smiling, posing for portrait.

Brittany City

M.S. Data Science

A Technology Career Recommendation System Based on Personality, Skills, and Interests

With the rapid expansion of technology, there is an increased demand for individuals with technology skills, leading to an interest in technology careers. Despite the abundance of technology job opportunities, many individuals struggle to identify which technology career field is best suited for their skills, interests, and personality. This lack of clarity can lead to high job turnover rates, low job satisfaction, and lack of productivity in the workplace. The research proposal aims to develop a career recommendation system based on key personalities, skills, and interests using machine learning algorithms to suggest viable technology career decisions. The research will analyze and develop a predictive model and recommendation system based on the personalities, technical skills, and interests collected through a survey.

Laporchia Davis

Laporchia Davis

MS. Biomedical Data Science

Integrating Clinical Predictive Modeling and Image-Based Analysis to Identify Bone Health Complications in Burn Patients

Severe burn injuries often trigger musculoskeletal complications that extend far beyond skin damage. Many survivors develop “hidden” bone-related conditions such as fractures, bone mineral density loss, and osteomyelitis, which may not be easily detected through routine clinical assessment. Because these complications can progress silently and lead to long-term skeletal deterioration, early prediction and targeted monitoring are essential for preserving mobility and quality of life. This project presents a multimodal framework that combines clinical predictive modeling with advanced imaging analysis to evaluate bone health complications in burn patients. The clinical component analyzes an AI-generated synthetic dataset containing 1,538 burn patient records using machine learning and principal component analysis (PCA) to identify key factors associated with fractures and osteomyelitis. The imaging component applies deep learning–based feature extraction using ResNet-50, followed by Uniform Manifold Approximation and Projection (UMAP), to explore structural variation across a 72-slice Computed Tomography (CT) scan derived from a larger dataset of over 5,000 CT images of fractured limbs.

Preliminary results indicate that burn severity index, bone damage score, duration of hospital stay, comorbidity score, and treatment material properties are the strongest predictors of fracture risk and osteomyelitis development. Additionally, the CT slice embeddings form distinct anatomical clusters, demonstrating that the model captures meaningful structural variation within bone regions. A bone segmentation step further enhances this analysis by isolating bone structures and reducing soft-tissue variability in imaging features.

Overall, this multimodal approach highlights the complementary value of clinical and imaging data in understanding burn-related skeletal complications and is hoped to advance early detection strategies for these conditions. While the imaging analysis remains exploratory, the findings suggest that deep-learning based CT features may support the future development of integrated models for improved bone health assessment in burn patients.

Two images of a woman with curly hair, smiling and posing.

Gabrielle Dawkins

M.S. Biomedical Data Science

Voice Analysis to Differentiate between Neurological, Respiratory, Cardiovascular Conditions

The human voice, a biomarker of complex movement of communication has been known to change for a person as they age or have changes in health.  Neurological, cardiovascular, and respiratory disorders are disease that can alter the acoustic features of voice. Fluid accumulation in vocal fold, fatigue can lead to change the features of voice such as pitch, formant, jitter, shimmer and higher order features such as MFCCs. 

In this study, we aim to distinguish the feature change between patients with Neurological, cardiovascular, and respiratory disorders and have multiple comorbidities. We have used Bridge2AI voice dataset, which consists of 302 subjects with the disorders. We have analyzed 140 acoustic features to investigate significant difference between diseases and found several features were different.

We have further implemented machine learning algorithms to identify the patients (for both subject dependent and independent cases) from their voice features and achieved F1 score above 0.60. We aim to enhance this study in future to increase the accuracy level.

Judith Dike

Judith Dike

M.S. Data Science

The Impact of Stress and Anxiety on Cardiovascular Outcomes in Thyroid Dysfunction

Background: Thyroid dysfunction affects cardiovascular health through metabolic alterations. However, the effects of psychological stress and anxiety remain understudied. This study examines how chronic stress and anxiety influence cardiovascular disease (CVD) in patients with thyroid disorders.

Methods: We analyzed electronic health records (EHRs) from patients aged 18-80 years diagnosed with thyroid dysfunction between January 2018 and July 2022 in the All of Us research program. Predictors included stress and anxiety (ICD-10 codes), while outcomes were CVD diagnoses (CHF, COPD, arrhythmias). Using logistic regression adjusted for age and sex, we assessed associations between mental health conditions and CVD, ensuring thyroid diagnosis preceded cardiovascular events. Models were stratified by sex to examine effect modification.

Results: The cohort of 27,441 people was 78.9% female and 19.1% male, with a mean age of 58.6 years (SD 12.4). Racial/ethnic composition included: 65.5% White, 15.5% Hispanic/Latino, 11.3% Black, 2.5% Asian, 1.2% Other, 1.2% multiracial, and 2.9% unknown. CVD prevalence was 24.5% overall, with arrhythmias (20.3%, p<0.001), CHF (5.3%, p<0.001), and COPD (4.8%, p<0.001) being most common. Patients with mental health conditions had 2.06 times higher odds of CVD (95% CI: 1.93-2.18) than those without. The association was stronger in women (OR=2.14, CI:2.00-2.29) than men (OR=1.80, CI:1.62-2.00), with a significant interaction (p=0.01). Each additional year of age was associated with a 2.1% increase in CVD odds (OR=1.021, CI:1.018-1.022). Conclusion: Thyroid patients with stress or anxiety had higher CVD risks, especially women (2.1 times vs 1.8 times higher odds). Since 1 in 4 thyroid patients developed CVD, mental health screening may help with reducing risks.

Two men standing side by side, one smiling, both dressed in suits.

Jaylin Dyson

M.S. Data Science

Developing a Knowledge Graph-Driven AI Agent for Protein Function Prediction

Knowledge graphs (KGs) connect nodes and edges for knowledge representation, particularly in biology, to illustrate findings and link biological connections. The most accurate KGs are created from journals, abstracts, and lab experiments, prioritizing precision over recall for high accuracy. However, this manual process is slow and resource-intensive, resulting in gaps and missing connections.

To address this challenge, computational approaches for knowledge graph completion (KGC) have emerged to help accelerate the discovery of new links. Many of these computational methods were originally developed for non-biological link prediction tasks within computer science and have been successfully adapted for biological KGs. Current models primarily employ embedding techniques (TransE, ComplEx, and RotatE) and Graph Neural Networks (GNN).

However, other techniques, like generative adversarial networks (GANs) and reinforcement learning (RL), remain largely unexplored in biological link prediction. In this context, we propose a novel framework that integrates GANs and RL to generate and rank plausible, new links for protein function prediction within biological knowledge graphs.

Woman with long hair, wearing a necklace, in a purple shirt.

Ariel Edwards

M.S. Biomedical Data Science

Exploring the Connection Between Anxiety and Lung Cancer Diagnosis Through Data Analysis

This capstone project explores the relationship between anxiety and lung cancer diagnosis, addressing a gap in research where psychological factors have often been overlooked. Using survey data from Kaggle and the UCI Machine Learning Repository, I built a logistic regression model to see if people who report experiencing anxiety are more likely to also report a lung cancer diagnosis. After accounting for factors like age, gender, smoking habits, and chronic disease, the results demonstrated that individuals reporting anxiety were over twice as likely to report having lung cancer. Overall, the model demonstrated solid performance, with consistent results observed across key demographic and behavioral subgroups, including smokers and individuals over 60. These findings suggest that mental health, particularly anxiety, may play a larger role in physical health outcomes than we often consider. By bringing psychological factors into the conversation, this research encourages a more holistic approach to how we think about cancer risk. 

Invest in Knowledge

With Your Support We Can Change the World.