Ph.D. Data Science

Data Science Ph.D. programs

Ph.D

Degree Granted

4 Year Minimum

Length of Program

Fall

Term

Online

Format

Ph.D. Data Science

Data Science Ph.D. programs

Ph.D

Degree Granted

4 Year Minimum

Length of Program

Fall

Term

Online

Format

The online Data Science Ph.D. from Meharry SACS prepares you to tackle complex big data challenges using AI, machine learning, deep learning, advanced analytics and database design. You’ll gain technical expertise and learn to communicate insights to drive strategic decisions. Through hands-on projects, you’ll apply what you learn to real-world problems while considering ethical data use. As a graduate, you’ll be ready to lead and collaborate on innovative data science solutions. Courses are live and online, with interactive lectures, office hours, and hands-on projects guided by engaged faculty.

Students are eligible for scholarships up to 50% of tuition.

Learn more about SACS Scholarships

  • Curriculum
  • Degree Requirements
  • Admission Requirements
  • Contact
  • Courses
  • Careers in Data Science

The Data Science Ph.D. program is designed for students with undergraduate or graduate backgrounds in data science, computer science, mathematics, statistics, engineering, information technology, or related areas who wish to contribute to data and computational sciences. The curriculum combines mathematics and statistics, AI and machine learning, advanced data analytics and database design, as well as related topics in research and discovery. 

We offer live, online courses and faculty will engage with you through interactive class lectures, office hours, in-class exercises and projects. Ph.D. students will be asked to attend some in-person sessions during the program. 

Highlights include: 

  • Mathematical and statistical theory 
  • Advanced scientific computing 
  • Advanced database design 
  • Advanced Statistics 
  • Big data management and analytics 
  • Artificial Intelligence and machine learning 
  • Computational software engineering 
  • Predictive modeling and analytics 
  • Visualization and unstructured data analysis 
  • Big data privacy and security 
  • Ethical, Legal and Societal Issues in big data analytics 

You will have access to first-class resources including Meharry’s high-performance, supercomputing network, cloud computing environment, and robust data resources. 

Graduation Requirements for the Data Science Ph.D. Program

Completion of the program requires 75 graduate credits.  This includes:

  • 15 hours of foundational coursework drawn from computer science and mathematics.
  • 36 hours of “core” courses drawn from data science, computer science and advanced statistics.
  • 1 hour of Candidacy Exam (or qualifying) study and preparation in the following 6 courses:
    • MSDS 550 Computational Machine Learning;
    • MSDS 565 Predictive Modeling and Analytics;
    • MSDS 700 Fundamentals of Database Management Systems;
    • MSDS 710 Mathematical and Statistical Theory;
    • MSDS 715 Data Modeling for Big Data;
    • MSBD 720 Advanced Statistics. The candidacy exam is normally taken at the end of the second year.
      The PhD student must pass all 6 courses and will be allowed to retake the exam(s) from any of the courses no more than once.
  • 5 hours of research seminar.
  • 6 hours of special topics and electives.
  • 12 hours devoted to dissertation and defense.

Applications for the Data Science Ph.D. program and the Biomedical Data Science Ph.D. program are accepted for the Fall Semester, according to the deadlines below. All admitted candidates will be expected to demonstrate an aptitude for quantitative and computational sciences, which may encompass mathematics, statistics, information systems/technology, fundamental programming skills, and related subjects.

Admission decisions will be based on all aspects of the application, including (1) prior academic performance of the applicant in a baccalaureate or master’s program at a regionally-accredited institution, including coursework and independent research projects, (2) relevant work experience, (3) the applicant’s statement of purpose, (4) letters of support, and test scores.

Priority Application Deadline

Fall 2026

March 15, 2026: Priority deadline for application. All materials outlined in Step 2 of the applicant process are due May 1, 2026.

Application Process

Only completed applications will be considered. Applicants may check the status of their application by checking their application portal.

Step 1

  • Submit the online Ph.D. application.
  • A separate statement of purpose that establishes (1) the applicant’s preparation for graduate school, (2) reasons for pursing a graduate degree, (3) prior relevant work or research experience, and (4) ultimate career objectives. The statement of purpose should not exceed 1000 words and can be submitted via the application portal

 

Step 2

  • Resume or curriculum vita.
  • Three letters of recommendation from academic or professional sources are required. Among these three letters at least one must be from an academic source. Letters should not come from family members or close personal friends. All letters should:
    (1) describe the recommender’s relationship with the applicant,
    (2) address the applicant’s likelihood to succeed in graduate school, and
    (3) speak to the applicant’s verbal and written communication skills, collegiality, and predisposition for quantitative analysis and investigation.
    Recommenders must submit letters directly to Meharry Medical College.
  • Official transcripts
  • Applicants must submit official transcripts of coursework attempted and completed at all previous colleges and universities whether or not a degree was earned at the institution. Please ask your institution to send official transcripts directly to the Office of Enrollment Management to sacsenrollment@mmc.edu.

If the institution prefers to mail transcripts, please use this address:

Office of Enrollment Management
School of Applied Computational SciencesMeharry Medical College,
3401 West End Avenue
Suite 260
Nashville, TN 37203

All submitted transcripts become the property of the Meharry Medical College and will not be returned.

Required test scores

The GRE, GMAT, or MCAT test score requirement is waived for applicants for the Fall of 2023 and 2024. We do ask that any applicants submit any previous test scores. All scores must be sent directly to Meharry Medical College, via sacsenrollment@mmc.edu, in electronic form.

Interview

We will invite select applicants who have completed an application via email for a virtual interview with the faculty admission committee. Interviews will be conducted with cameras on and may be recorded for internal use only. Applicants must complete the interview process to be considered for admission. Final admissions decisions will be made from the pool of interviewed applicants.

Acceptance

Applicants will be notified of their admission decision via the applicant portal. Those offered admissions will be asked to communicate their decision.

Admission Requirements

Applications for admission are only accepted for the Fall Term each year. The requirements for admission are:

Prior Degree

All applicants must have the educational equivalent of at least a bachelor’s degree in computer science; information technology/systems; mathematics; statistics; engineering; finance; biomedical, health, or life sciences; or a related discipline from a regionally accredited university in the U.S.

Minimum Prior Coursework

The prior degree(s) must encompass the following minimum coursework requirements, in which applicants must have earned a grade of B or better:

  • Two semesters of college calculus, and one semester of linear algebra.
  • One semester of calculus-based statistics and/or probability at the college level
  • One semester of college-level computer programming fundamentals or one year of demonstrated, verifiable programming experience in a work setting.

 

Biomedical Data Science applicants should also have at least one semester of college-level biology or related coursework in the biological/health/life sciences.

Additional Recommended Prior Coursework and Experience

Additional coursework and/or experience is recommended, as outlined below:

Applicants for Biomedical Data Science Ph.D. program

  • Competency in a second focus area that complements the applicant’s previous degree(s) in one of the preferred fields identified above, as demonstrated by completion of a major, minor, or certificate in one of these areas. For example, applicants whose prior degree(s) are in one of the quantitative fields should ideally possess competency in a biological/biomedical/health sciences field, and applicants whose prior degree(s) is in a biological/biomedical/health sciences field should ideally possess competency in a quantitative field.

 

Applicants for both Ph.D. programs

  • Competency in a second focus area that complements the applicant’s previous degree(s) in one of the preferred fields identified above, as demonstrated by completion of a major, minor, or certificate in one of these areas.
  • More advanced courses in mathematics and statistics, such as multivariable calculus, differential equations, linear programming, mathematical statistics, biostatistics, biomathematics/biophysics, and bioinformatics.
  • More advanced courses in computer science and/or software engineering beyond fundamentals that encompass the ideas of abstraction, modularity, object-oriented programming, data structures, and algorithm design.
  • Evidence of relevant research or work experience, including, but not limited to, demonstrated participation in data science/analytics projects, leadership of data, computing, or information systems/technology groups or teams, and artifacts such as authored research reports and journal publications that demonstrate the applicant’s research aptitude and verbal communication ability.

 

Minimum Grade Point Average (GPA)

  • All applicants possessing a baccalaureate degree only must have earned a minimum cumulative GPA of 3.00 (on a 4-point scale) equivalent of the last 60 semester hours (approximately two years of work).
  • Applicants possessing a master’s degree must have a minimum cumulative GPA of 3.0 (on a 4-point scale).
  • Applicants possessing a baccalaureate degree plus some graduate work, but not a graduate degree, must have a minimum cumulative GPA of 3.0 (on a 4-point scale) individually in both sets of coursework. GPA should be based on scale of 4.0.

 

Other Qualifications

We may also consider other qualifications presented by applicants, such as strong oral, verbal, and interpersonal communication skills, as well as overall goodness-of-fit for the Ph.D. program.

Additional Requirements for International Applicants

International applicants must hold a degree comparable to a regionally accredited US baccalaureate or master’s degree. Applicants submitting transcripts from international colleges and universities are required to have them verified for US degree program equivalency before being considered for admission. Verification from the following organizations is acceptable:

  • International Education Services (IES)
  • World Education Services (WES)
  • Global Credential Evaluators (GCE)
  • Educational Credential Evaluators (ECE)

The decision of the verifying organization must be transmitted directly to Meharry Medical College in electronic form.

English proficiency

Adequate proficiency of spoken and written English is essential to success in graduate study, and medical residency training at Meharry Medical College.

Please review Meharry’s  F-1 English Language Policy and the College’s English proficiency requirements policy.

Transfer Students

Applicants who are enrolled in a Ph.D. program in biomedical data science or related field outside Meharry Medical College may apply for Meharry’s BDS Ph.D. program as a transfer student. Transfer students are subject to the admissions requirements stated above; and therefore, the transfer track is only recommended for first- or second-year graduate students. Applicants transferring from another institution and who are in good standing at that institution are exempt from the graduate admission exam and TOEFL requirements. A recommendation letter from the previous advisor or program director is required. A maximum of six (6) credit hours can be transferred. For more information, please contact The Office of Enrollment Management at sacsenrollment@mmc.edu.

** Please redact sensitive information prior to sending FERPA protected data via email or contact us for other security options.

JeanLuc Nshimiyimana
sacsadmissions@mmc.edu

Data Science Ph.D. students will enroll in three concurrent courses during the Fall, Spring, and Summer Semesters.

The Pathway to a Data Science Ph.D. degree provides an outline of all degree requirements and a curriculum map, organized by semester, for completing them.

Foundation Courses (15 hours)

MSDS 510 Computer Programming Foundations for Data Science

3 credit hours.

Introduction to the basic foundations of computer programming for data science, using Python, R, and SAS as problem solving tools. 1) Introduction to Python. Python syntax to write basic computer programs; Using the interpreter; Built-in and user-defined functions; Introduction to object-oriented programming in Python. 2) Introduction to R. Simple graphing; R Basics: variables, strings, vectors; Data Structures: arrays, matrices, lists, data-frames; Programming Fundamentals: conditions and loops, functions, objects and classes, debugging. 3) Introduction to SAS Programming. The SAS Operating Environment; Understanding Data and the quality characteristics it exhibits; SAS Programming Essentials: SAS Program Structure, SAS Program Syntax; Getting Data In and Out of SAS; Printing and Displaying Data; Introduction to SAS Graphics.

MSDS 525 Data Management Foundations for Data Science

3 credit hours.

The concepts and structures used to store, analyze, manage, and present (visualize) information and navigation using Python, SQL, SAS, and QGIS. Topics will include information analysis and organizational methods, and metadata concepts and applications. Students will be assisted to identify disparate data sources needed to perform analysis for a given real-world problem. Typically, data from a single source will not be adequate to perform the required analysis. Students will pull data from the disparate data sources and import it into SAS, and use several SAS procedures to detect invalid data; format, validate, clean the data; and impute the data if it is missing. This will prepare the data for statistical analysis and decision modeling in SAS.

MSDS 535 Further Mainstream Programming Languages for Data Science

3 credit hours.

This course covers other useful mainstream programming languages for data science, beyond Python, R, SQL, and SAS. These “other” potential programming languages supplement the ability to crunch numbers, and equip the data scientist with good all-round programming skills. Programming languages covered will vary depending on industry popularity. While some of the programming languages may not be covered in detail, examples include: Java, Scala, Julia, TensorFlow, Go, Spark.

MSDS 540 Introduction to Artificial Intelligence

3 credit hours.

Deep dive into recent advances in AI, focusing on deep learning approaches. Foundations of neural networks. Cutting-edge deep learning models including image, text, multimodal and time-series data. Advanced topics on open challenges of integrating AI in a societal application including interpretability, robustness, privacy and fairness.

MSDS 710 Mathematical and Statistical Theory

3 credit hours.

This course will cover fundamental mathematical background for statistical theories. Probability spaces as models for phenomena with statistical regularity. Discrete spaces (binomial, hypergeometric, Poisson). Continuous spaces (normal, exponential) and densities. Random variables, expectation, independence, conditional probability. The course will cover probabilities, multivariate distribution and special distribution, statistical inference, maximum likelihood methods, sufficiency, test of hypotheses, inference about normal methods, nonparametric statistics, Bayesian statistics.

Core Courses (36 hours)

MSDS 550 Computational Machine Learning

3 credit hours.

Introduction to machine learning with business applications. Survey of machine learning techniques, including traditional statistical methods, resampling techniques, model selection and regularization, tree-based methods, principal components analysis, cluster analysis, artificial neural networks, and deep learning. Students implement machine learning models with open-source software for data science. They explore data and learn from data, finding underlying patterns useful for data reduction, feature analysis, prediction, and classification.

MSDS 555 Big Data Management and Analytics

3 credit hours.

An overview of modern data science: the practice of obtaining, storing, modeling, manipulating, analyzing, and interpreting data. Emerging Big data processing frameworks. NoSQL storage solutions. Memory resident databases and graph databases. Ability to initiate and design highly scalable systems that can accept, store, and analyze large volumes of unstructured data in batch mode and/or real time. Organization, administration and governance of large volumes of both structured and unstructured data.

MSDS 565 Predictive Modeling and Analytics

3 credit hours.

Tools and techniques for building statistical or machine learning models to make predictions based on data. NLP and Text Analytics, Time Series, Experimentation and Optimization.

MSDS 570 Visual Analytics

3 credit hours.

Data visualization tools and technologies essential to analyze massive disparate amounts of information and make data driven decisions. Information and geographic visualization of health data. Hands-on experience in planning, creating and using compelling multimedia visualizations such as online maps, responsive graphs, interactive animations and GIS dashboards. Use of different visualizations to support various research activities including hypothesis formulation, data synthesis, analysis and exploration as well as communicate and share health information. Application of usability and user experience (UX) principles to evaluate the extents to which various visualizations meet expectations.

MSDS 580 Research Methods

3 credit hours.

The research process investigating information needs, creation, organization, flow, retrieval, and use. Stages include: research definition, question, objectives, data collection and management, data analysis and data interpretation. Techniques include: observation, interviews, questionnaires, and transaction-log analysis.

MSDS 700 Fundamentals of Database Management Systems

3 credit hours.

Introduction to database concepts and the relational database model. Topics include ER Model, Relational Model, Relational Algebra, SQL, normalization, Indexing, Normal Forms, design methodology, DBMS functions, Security, Transaction Management, data-base administration, and other database management approaches such as client/server databases, object-oriented databases, and data warehouses. Strong emphasis on database system design and application development.

MSDS 715 Data Modeling for Big Data

3 credit hours.

Principles, practices, and techniques for effective data modeling in the age of Big data.

MSDS 720 Advanced Statistics

3 credit hours.

Utilize current statistical techniques to assess and analyze biomedical and public health related data. Read and critique the use of such techniques in published research. Review of linear models, matrix algebra, and multiple analysis of variance. Introduction to random effects models, understanding and computing power for the GLM, GLM assumption diagnostics, transformations, polynomial regression, coding schemes for regression, multicollinearity. Determine what analytical approaches are appropriate under different research scenarios.

MSDS 725 Advanced Scientific Computing: Stochastic Methods for Data Analysis, Inference & Optimization

3 credit hours.

Study of Monte Carlo methods, a diverse class of algorithms that rely on repeated random sampling to compute the solution to problems whose solution space is too large to explore systematically or whose systemic behavior is too complex to model. Introduction to important principles of Monte Carlo techniques and their power. Bayesian analysis and Markov chain Monte Carlo samplers, slice sampling, multigrid Monte Carlo, Hamiltonian Monte Carlo, parallel tempering and multi-nested methods, and streaming methods such as particle filters/sequential Monte Carlo. Related topics in stochastic optimization and inference such as genetic algorithms, simulated annealing, probabilistic Gaussian models, and Gaussian processes. Applications to Bayesian inference and machine learning. Python or R for all programming assignments and projects.

MSDS 730 Deep Learning

3 credit hours.

Deep learning is a sub-field of machine learning that focuses on learning complex, hierarchical feature representations from raw data. The dominant method for achieving this, artificial neural networks, has revolutionized the processing of data (e.g. images, videos, text, and audio) as well as decisionmaking tasks (e.g. game-playing). Its success has enabled a tremendous amount of practical commercial applications and has had a significant impact on society. In this course, students will learn the fundamental principles, underlying mathematics, and implementation details of deep learning. This includes the concepts and methods used to optimize these highly parameterized models (gradient descent and backpropagation, and more generally computation graphs), the modules that make them up (linear, convolution, and pooling layers, activation functions, etc.), and common neural network architectures (convolutional neural networks, recurrent neural networks, etc.). Applications ranging from computer vision to natural language processing and decision-making (reinforcement learning) will be demonstrated. Through in-depth programming assignments, students will learn how to implement these fundamental building blocks as well as how to put them together using a popular deep learning library, PyTorch.

MSDS 736 Ethical, Legal & Societal Issues in Healthcare

3 credit hours.

Examination of case studies. Introduction to healthcare law and ethics, making ethical decisions, contracts, medical records and informed consent, privacy law and HIPAA.

MSDS 740 Big Data Privacy and Security

3 credit hours.

Security issues related to the safeguarding of sensitive personal and corporate information against inadvertent disclosure; Policy and societal questions concerning the value of security and privacy regulations, the real world effects of data breaches on individuals and businesses, and the balancing of interests among individuals, government, and enterprises; Current and proposed laws and regulations that govern information security and privacy; Private sector regulatory efforts and self-help measures; Emerging technologies that may affect security and privacy concerns; and Issues related to the development of enterprise data security programs, policies, and procedures that take into account the requirements of all relevant constituencies; e.g., technical, business, and legal.

Candidacy Exam (1 hour)

MSDS 800 Candidacy Exam

1 credit hour.

Preparation for the Candidacy Exam intended to demonstrate advanced knowledge of content and materials of the six required classes.

Special Topics and Electives (6 hours)

MSDS 560 Natural Language Processing

3 credit hours.

A comprehensive review of text analytics and natural language processing with a focus on recent developments in computational linguistics and machine learning. Students work with unstructured and semi-structured text from online sources, document collections, and databases. Using methods of artificial intelligence and machine learning, students learn how to parse text into numeric vectors and to convert higher dimensional vectors into lower dimensional vectors for subsequent analysis and modeling. Applications include speech recognition, semantic processing, text classification, relevant search, recommendation systems, sentiment analysis, and topic modeling. This is a project-based course with extensive programming assignments.

MSDS 610 Network and Graph Theory for Data Science

3 credit hours.

Networks are discrete mathematical objects that describe systems of entities with pairwise relationship. Over the past several decades, technological advances in data collection and extraction have fueled an explosion of data in the form of networks from seemingly all corners of science. This course aims at providing the mathematical foundations of networks with a particular emphasis on their applications in modern data science, using tools from algorithmic graph theory and linear algebra. The topics include basic graph theory, network statistics, search algorithms, community detection, duality theorems and applications. The course will utilize python (e.g., Networks and Jupyter Notebook) to implement and test the techniques in graph theory and network science in synthetic and real data. Students are strongly encouraged to have some familiarity in Python prior to taking this course.

MSDS 620 Signal Processing for Big Data

3 credit hours.

This course introduces fundamentals of signal processing along with its applications in wearable sensor devices. The course includes topics on signal acquisition, techniques on processing the signals captured, including time domain approaches for event detection, time-varying signal processing for understanding the dynamical aspects of complex systems, and finally the application of machine learning algorithms to build predictive models for early insights.

MSDS 655 AI in Cyber Security

3 credit hours.

What is artificial intelligence (AI)? What does it mean for cybersecurity? And how AI can be integrated to achieve the goals of cybersecurity? This course designed to answer the above questions. In this course, a mix of key AI technologies will be introduced to support the understanding of the decision-making process when cybersecurity is concerned. The course will address key AI technologies in an attempt to help in understanding their role in cybersecurity. AI deficiently will complement and strengthen the cybersecurity practices and will improve their applications in enhancing our security.

MSDS 727 Digital Image Processing and Understanding

3 credit hours.

This course presents fundamental concepts and techniques in digital image processing and understanding. Both theoretical material and computing techniques are introduced. The analytical tools and methods which are currently used in digital image processing are introduced and applied to practical scenarios. Basic digital computing knowledge and programming skills are reinforced by solving real world problems. Computational studies may be performed in R or Python.

Research Seminar (5 hours total; at least one (1) hour in each)

MSDS 750V Individual Studies

Variable hours per semester may be offered (1–3 hours).

This course provides students an opportunity to delve into a special study of interest related to data science selected by the student under the guidance of a faculty member. The student and faculty member meet weekly to discuss the studies; the student will be required to write a comprehensive review paper on the semester’s studies.

MSDS 870V Literature Review

Variable hours per semester may be offered (1–3 hours).

This course provides doctoral students with advanced research skills and strategies for conducting a literature review leading to a dissertation. Through this course, students will produce an extensive and integrative literature review related to their dissertation topic. Students will search, retrieve, summarize, and synthesize relevant studies to produce a comprehensive literature review.

MSDS 880V Seminar

Variable hours per semester may be offered (1–3 hours).

This course provides the student with the opportunity to concisely describe a data science research problem and methodology. Preparation and defense of the dissertation proposal which clearly articulates the problem to be investigated in the field of data science, literature review, and what would need to be done to complete the dissertation. Student must successfully defend the proposal before a Dissertation Proposal Committee which will determine whether the student proceeds to complete the dissertation.

Dissertation and Defense (12 hours)

MSBD 890 Dissertation and Defense

12 credit hours.

Variable hours may be offered.

The completion of PhD dissertation is the culmination of the doctoral degree in this graduate program. The research topic of the dissertation must be related to the PhD in Data Science program.

Data science is an in-demand career, No. 3 on Glassdoor’s Best Jobs in America list. That demand reflects the great need for professionals who understand how to apply data science to uncover valuable insights from big data. The data science profession, according to Glassdoor, has salaries ranging from $99,000 to $238,000, with compensation increasing with years of experience and expertise.

At Meharry, you will have opportunities to present your research and apply your education to tacking real-world problems. You will also discover new solutions to complex issues. We believe this experience will prepare you for a rewarding career in academia, industry or government.