Data Sciences and Applications MPS

The Master's of Professional Studies in Data Sciences and Applications program trains students in analytics, including standard methods in data mining and machine learning, so they will possess the expertise to obtain insights from large and heterogeneous data sets.

Created in consultation with major companies like IBM, HP, Sentient Science, Calspan, M&T, and Moog, our Data Sciences and Applications MPS program equips graduates with the skills industries need but don't often see in current hiring pools.

The McKinsey Global Institute estimates that the job market will need an additional 140,000–190,000 trained personnel for "deep analytical talent positions" and 1.5 million more "data-savvy managers" to fully take advantage of big data in the United States. A recent New York Times article writes, "Universities can hardly turn out data scientists fast enough." It is estimated that the national shortage of such talent is at least 60%.

About the Program Admissions Requirements Curriculum Overview Course Requirements

About the Program

Students in the Data Sciences and Applications MPS program learn data manipulation, database management, distributed and big data management, and cloud based methodologies.

Graduates of the program are able to:

Execute mathematical and computing techniques commonly used for analyzing data
Identify and perform statistical techniques commonly used for analyzing data
Manage and organize complex and large data sets and databases for analysis
Use high-level and customized programming languages to manage data analysis
Recognize best practices for data security and ethical use of data
Integrate data science methodologies and their adaptation into diverse fields

Program Director

Rachael Hageman Blair
709 Kimball Tower
hageman@buffalo.edu
(716) 829-2814

This program is STEM approved, allowing international students the opportunity to apply for the 24-month STEM OPT extension.

Admissions Requirements

The program is skills-oriented and provides training in data science, computing and analysis. Students will need some prior knowledge of mathematics, statistics, and computing, and bridge classes are available to prepare them for success in the program. In particular, we are interested in students from non-traditional backgrounds who have an interest in or need for data science skills.

Because our program is highly interdisciplinary, students from all majors interested in data sciences and applications skills are encouraged to apply. Applicants must also provide the following application materials:

Students can apply online. New applicants must create an account. If you have applied in the past, you will be asked to log in to your existing account to start a new application.

The application includes sections that will ask you to provide us with personal/biographical information, contact information, citizenship details, your past or current college/university, and degree details. You will be able to enter the names and contact information of your recommenders and upload supporting materials to your application. You may upload supporting documents while completing the application form or after you have submitted your application in your Application Status Portal.

You do not need official transcripts or test scores to apply. Unofficial copies are sufficient for application review, but students are encouraged to provide official copies if they have them.

A non-refundable fee of $100 is required to apply for Spring 2025 or later terms. The application fee for Fall 2024 is $85. After you submit your application form, you can pay the application fee online in your Application Status Portal with a credit card or e-check.

Copies of transcript(s) for all post-secondary schoolwork must be uploaded with your application for initial review. Upon an offer of admission, accepted applicants will be required to submit official transcripts and proof of degree(s).

Two letters of recommendation are required to apply to this program. Letters are automatically requested when you enter recommenders' names and email addresses in your application. While we will accept letters from professional sources, we strongly prefer letters from professors acquainted with your academic interests, achievements, and abilities.

GRE scores are optional for this program. If you would like to take the GRE, you can arrange to take it through the Educational Testing Service (ETS). Official scores should come directly from ETS.

Note: the University at Buffalo Institutional Code is 2925.

Upload a resume or curriculum vitae (CV) to your application. Your resume or CV should include details about your education, employment, and internship history. Pertinent research experience should be included as well.

Please upload a brief personal statement describing your educational objectives, your academic experience, and why you're interested in the program. There is no required length or word limit for the statement.

International applicants are required to provide proof of English proficiency. The chart below outlines acceptable test types and the university's minimum score required for admission. All applicants whose native language is not English will be required to provide proof of English proficiency.

Copies may be submitted for initial review. Official scores must be sent directly to UB from the testing agency.

Test Type	University Minimum Score	To Order...
TOEFL (IBT) (including MyBest scores) TOEFL Home Edition	79	Use Institution Code 2925
TOEFL Essentials	8.5	Use Institution Code 2925
TOEFL ITP Plus	550	Use Institution Code 2925
IELTS and IELTS Indicator	6.5	Select "University at Buffalo, State University of New York (SUNY)"
PTE Academic PTE Academic Online	55	Select "University at Buffalo, State University of New York (SUNY)"
Cambridge English Proficiency (CPE)	185	Select "University at Buffalo, State University of New York (SUNY)"
Cambridge English Advanced (CAE)	185	Select "University at Buffalo, State University of New York (SUNY)"
Duolingo English Test (DET)	120	ADA Department of Testing will report your official scores to central application services

Exam results must be dated within two years from your proposed date of admission and remain valid upon entering the term for which you applied.

It is strongly recommended to make test arrangements early in the year so sufficient time can be allowed for the results to be reported before application deadlines.

Exemptions to English Language Proficiency requirements can be found on The Graduate School website.

Apply Now

Application Deadlines

We accept applications on a rolling basis throughout the year, but encourage all prospective students to submit their applications by the deadlines noted below.

Spring enrollment:
Apply by October 1

Fall enrollment:
Apply by February 15

Curriculum Overview

This program is currently taught in a cohort-based model and offers both Fall and Spring admission.
Students will take a combination of core courses (24 credits), a data science applications course (3 credits) and a data science project (3 credits) for a total of 30 credits.
Students have the opportunity to complete an internship in industry for their data science project requirements. Alternatively, students can opt to complete a research project with a faculty member, or complete an elective in place of a project.
The program can be completed in 1 to 1.5 years on average, depeding on entry term.

Course plan for full-time students:

First semester – 4 core courses (Math and Stats Basics)
Second semester – 3 core courses + 1 elective
Third semester – 1 Data Science Survey course + 1 Project/Capstone

Course Requirements

Core Courses

All courses are 3 credit hours for a total of 30 credits.

This course introduces students to computer science fundamentals for building basic data science applications. The course has two components. The first part introduces students to algorithm design and implementation in a modern, high-level, programming language (currently, Python). It emphasizes problem-solving by abstraction. Topics include data types, variables, expressions, basic imperative programming techniques including assignment, input/output, subprograms, parameters, selection, iteration, Boolean type, and expressions, and the use of aggregate data structures including arrays. Students will also have an introduction to the basics of abstract data types and object-oriented design. The second part covers regression analysis and introduction to linear models. Topics include multiple regression, analysis of covariance, least square means, logistic regression, and nonlinear regression. The students learn to implement the regression models as a computer program and use the developed application to analyze synthetic and real world data sets.

This course provides basic understanding of relational databases including normalization, database schemas and relational algebra, create, update, query and delete tables using standard SQL statements, understand workflows such as ETL (extract, transform, and load) to aggregate data from multiple sources integrating it in databases and data warehouses use, manage and customize NoSQL databases including key value, wide column, document and graph stores as well as their application on non-tabular data, use, manage and customize graph databases and apply them to multi-dimensional datasets.

A first course on the design and implementation of numerical methods to solve the most common types of problem arising in science and engineering. Most such problems cannot be solved in terms of a closed analytical formula, but many can be handled with numerical methods learned in this course. Topics for the two semesters include: how a computer does arithmetic, solving systems of simultaneous linear or nonlinear equations, finding eigenvalues and eigenvectors of (large) matrices, minimizing a function of many variables, fitting smooth functions to data points (interpolation and regression), computing integrals, solving ordinary differential equations (initial and boundary value problems), and solving partial differential equations of elliptic, parabolic, and hyperbolic types. We study how and why numerical methods work, and also their errors and limitations. Students gain practical experience through course projects that entail writing computer programs.

Topics include: review of probability, conditional probability, Bayes' Theorem; random variables and distributions; expectation and properties; covariance, correlation, and conditional expectation; special distributions; Central Limit Theorem and applications; estimations, including Bayes; estimators, maximum likelihood estimators, and their properties. Includes use of sufficient statistics to 'improve' estimators, distribution of estimators, unbiasedness, hypothesis testing, linear statistical models, and statistical inference from the Bayesian point of view.

This course presents statistical models for data mining, inference and prediction. The focus will be on supervised learning, which concerns outcome prediction from input data. Students will be introduced to a number of methods for supervised learning, including: linear and logistic regression, shrinkage methods, lasso, partial least squares, tree-based methods, model assessment and selection, model inference and averaging, and neural networks. Computational applications will be presented using R and high dimensional data to reinforce theoretical concepts.

This course presents the topic of data mining from a statistical perspective, with attention directed towards both applied and theoretical considerations. An emphasis will be placed on unsupervised learning methods, especially those designed to discover and exploit hidden structures in high-dimensional data. Topics include: hierarchical and center based clustering, principal component analysis, data visualization, random forests, directed and undirected graphical models, and special considerations when n>>p. Computational applications to high-dimensional data will be presented using Matlab and R to illustrate methods and concepts.

Humans have an uncanny ability to learn from their mistakes and adapt to new environments by relying on their past experience. Machine learning focuses on "How to write a computer program than can improve performance through experience?" Machine learning has a huge number of practical applications, more so in the present era of Big Data, where staggering volumes of diverse data in almost every facet of society, science, engineering, and commerce, are presenting opportunities for valuable discoveries. For example, machine learning is being used to understand financial markets, impact of climate change on society, protein-protein interactions, diseases, etc. Machine learning also has far ranging applications such as self-driving cars to never ending language learning systems. This course will focus on understanding the mathematical and statistical foundations of machine learning. We will also cover the core set of techniques and algorithms needed to understand the practical applications of machine learning. The course will be an integrated view of machine learning, statistics (classical and Bayesian), data mining, and information theory. A basic understanding of probability, statistics, algorithms, and linear algebra is expected. Familiarity with Python is required for homework assignments and for understanding in-class demonstrations.

Present-day terms, philosophies, technologies, and strategies that go into buttressing an organization’s cybersecurity posture. Managing the resources of a corporate information assurance program, while continually improving a risk footprint and response, is an underpinning of all topics that will be covered. Students will critically examine concepts such as networking, system administration, and system security as well as identifying and applying basic security hardening techniques. Students will gain practical experience through a virtualized lab environment where they will build and secure a small corporate network.

This course will provide students with an overview of data driven analytics in different industry sectors. The class will have a series of visiting lecturers with the faculty member teaching the class providing overview, continuity and grading of homework and term papers.

This course will provide students with a final integrative project experience. The class will require students to obtain an integrative project experience either in industry or at the university. In either case the students will use the skills acquired during the other classes in executing project goals. Students will provide short reports to supervising faculty to ensure that learning objectives are being met.

Have questions or want to learn more?

For degree-specific questions, please contact the graduate coordinator at cdsedept@buffalo.edu.

For admissions-related questions, please contact gradeng@buffalo.edu.

Data Sciences and Applications MPS

About the Program

Admissions Requirements

Online Application Form

Application Fee ($100)

College Transcripts

Letters of Recommendation (2)

GRE Scores (optional)

Resume/CV

Statement of Purpose

English Proficiency Scores (for international students only)

Curriculum Overview

Course plan for full-time students:

Course Requirements

Core Courses

EAS 503 Programming Database Fundamentals

CDA 502/MGS 613 Database Management Systems

CDA 511 Introduction to Numerical Analysis

CDA 531/MTH 511 Probability and Data Analysis

CDA 541/STA 545 Statistical Data Mining 1

CDA 532/STA 546 Statistical Data Mining 2

CSE 574 Machine Learning

CDA 551/MGS 639 Cybersecurity Privacy and Ethics

EAS 504 Applications of Data Science Industry Overview

CDA 571/ Project Guidance