M2 - Statistics and Data Science (SSD) - BIOSTATS

Training structure
Faculty of Science

Presentation

This M2 program is intended for students who have completed an M1 in Statistics and Data Science (SSD) or any other M1 in mathematics or equivalent with a strong specialization in probability and statistics.

This Master's degree is divided into two more specialized sub-courses, with some of the teaching remaining shared between them.

- The first of these specializations is Biostatistics, which delves deeper into the analysis and modeling of biological data.

- The second is Information and Decision Management (MIND), which specializes in the analysis and modeling of economic data as well as the management of decisions and associated risks.

The aim of the SSD-Biostat program is to meet the expectations of M1 SSD students who are interested in modeling biological or environmental data. The statistical aspects covered in this program range from modeling living organisms to the most theoretical issues in statistics and stochastic modeling. Numerical aspects feature heavily in this program and require a strong interest in computer programming.

The SSD-Biostat program is a demanding course of study because it focuses on concepts rather than techniques. In the field of data in the broad sense, digital technologies, with the advent of artificial intelligence, are evolving rapidly and becoming obsolete even more quickly. Future statistical engineers or researchers who will be required to process data will be able to train in new technologies throughout their professional lives, especially if they have had a solid initial conceptual training. The added value of the program is precisely that it provides a theoretical understanding of the statistical concepts underlying automatic algorithms. Graduates must also be able to monitor technological developments effectively.

The SSD-Biostat program remains partially shared in the second year with the Information and Decision Management program (SSD-MIND). However, specialized courses in life or environmental data science, which are more focused on introducing students to research, are specific to the SSD-BIOSTAT program (two courses per semester in the second year of the master's program).

The aim of the SSD-MIND program is to meet the expectations of M1 SSD students who are interested in applying data science in business. Given the wide variety of companies and the issues they face, this M2 program provides training in general data science that can be applied in any field. In addition, it provides more specific training in the business context and its economic and managerial issues (economic information, financial risk management, customer relations, business strategy, etc.).

The SSD - MIND program is a mathematical engineering course that focuses on methodology and a thorough understanding of statistical concepts and models. Graduates of this program will be able to handle any type of data and problem, and design a comprehensive and often original methodology to address the issue, starting with data management and organization, continuing with exploration and targeted reduction, then modeling the phenomena of interest, and finally synthesizing the extracted information for decision-making purposes. They will also need to know how to convey to the company the knowledge synthesized from the information extracted from the data. Each new data set and each question asked about it often presents a new problem, and applying a standard method to this data is therefore inappropriate. Instead, a mathematical model adapted to this data (in the sense that it satisfactorily reflects its complexity) must be written and made compatible with a standard estimation method, or a more specific method must be designed and programmed. The emphasis placed by this program on conceptual and mathematical mastery of the tools ensures that graduates of this course have the high level of adaptability and self-training skills required by the rapidly evolving field of data science.

The SSD-MIND program remains partially shared in the second year with the biostatistics program (SSD-Biostat), which is more specialized in the analysis and modeling of biological and environmental data. The SSD-MIND program is a dual degree program in partnership with the IAE (which provides instruction in economics and management), leading to a double degree.

Objectives

The training program has several objectives.

Enable students to deal with all types of data and issues, and to design a comprehensive and often original methodology to address these issues.
Enable students to quickly integrate into any type of company by rapidly grasping the issues at stake.
Bring students who wish to do so to a theoretical level that enables them to write a doctoral thesis in statistics.
Train future researchers or teacher-researchers in the field of random mathematics: probability or theoretical or applied statistics. After completing their doctorate, they will be able to join laboratories at universities, engineering schools, or research organizations such as the CNRS, INRAE, Inria, CIRAD, INSERM, etc. It is also possible to join a company or industrial research laboratory directly after completing the M2.
Train specialists in high-level statistical data processing for research organizations or companies for which statistics are now an indispensable tool, such as pharmaceutical laboratories, epidemiological monitoring institutes, air and water quality monitoring institutes, agri-food companies, biotechnology companies, healthcare companies (diagnostic assistance, personalized medicine), etc.

Being able to extract relevant data
Perform data pre-processing (cleaning and formatting, if necessary)
Conduct exploratory data analysis using visualization and dimension reduction tools.
Modeling a problem: mastering the standard methods used in modern data science and knowing how to propose the appropriate method(s) for solving the problem at hand, writing one or more mathematical models suited to the problem, and putting them into a form suitable for processing using the standard methods of modern data science.
Implement the method from a computational perspective and be able to propose model selection strategies.
Program effectively in at least one language (Python, R)
Knowing how to analyze and interpret results, i.e., producing knowledge from the information extracted.
Linking knowledge to decision-making in order to inform and optimize the latter.
Be able to communicate results in writing and orally

	Specialty	M2 - Statistics and Data Science (SSD)
	Entry level	Bachelor's degree
	Required entry level	No
	Location(s)	Montpellier - Triolet

Place of instruction

Contacts

Elodie Brunel-Piccinini
Head of Education
- +33 4 67 14 41 64
- elodie.brunel-piccinini @ umontpellier.fr
Xavier Bry
Head of Education
- +33 4 67 14 35 78
- xavier.bry @ umontpellier.fr

Partner institution(s)

University of Sherbrooke (double degree)
IAE Montpellier (dual degree in M2)

Partner laboratory(ies)

Montpellier Alexander Grothendieck Institute IMAG

Program

Master's degree open to work-study programs with long periods of classroom instruction and in-company training.

First term: 7 weeks of teaching from September to the end of October.

Second period: 7 weeks in a company for work-study students or a long supervised project in a laboratory for non-work-study students from November to mid-January.

Third term: 7 weeks of classes from mid-January to mid-March.

Fourth period: work-study program from March to August or a 4- to 6-month internship for non-work-study students.

Internships and supervised projects: A 7-week supervised project for non-work-study students, subject to a report and oral defense.

4- to 6-month internship at the end of the M2 program.

Lifetime analysis
4 credits
Supplement 2
4 credits
Supplement 1
4 credits
Internship
14 credits
Latent variable models
4 credits

Admission

Registration procedures

Applications can be submitted on the following platforms:

French and European students must submit their application via the e-candidat application:https://candidature.umontpellier.fr/candidature

International students from outside the EU: follow the "Études en France" procedure:https://pastel.diplomatie.gouv.fr/etudesenfrance/dyn/public/authentification/login.html

Target audience

Target audience*: This course is intended for students who have completed an M1 in Mathematics - Statistics and Data Science (SSD) or any other equivalent M1 in mathematics with a strong specialization in probability and statistics.

Mandatory prerequisites

M1 Mathematics - Data Science Statistics (SSD)

Recommended prerequisites

M1 Mathematics - Data Science Statistics (SSD)

Continuing education

Doctorate possible after completing the M2.

Professional integration

Professions: statistician, biostatistician, data scientist, data analyst, all at the engineer level.

All sectors of activity: industry, research and development, healthcare, agronomy, banking and insurance, commerce, etc.

M2 - Statistics and Data Science (SSD) - BIOSTATS

Training structure

Presentation

Objectives

Know-how and skills

Program

Nonparametric estimation

Generalized linear models

English

Project or Work-Study Defense

Bayesian statistics

Multivariate analysis

Statistical learning

Lifetime analysis

Supplement 2

Supplement 1

Internship

Latent variable models

Admission

Registration procedures

Target audience

Mandatory prerequisites

Recommended prerequisites

And after

Continuing education

Professional integration

M2 - Statistics and Data Science (SSD) - BIOSTATS

Training structure

Presentation

Objectives

Know-how and skills

Program

Nonparametric estimation

ECTS

Training structure

Generalized linear models

ECTS

Training structure

English

ECTS

Training structure

Project or Work-Study Defense

ECTS

Training structure

Bayesian statistics

ECTS

Training structure

Multivariate analysis

ECTS

Training structure

Statistical learning

ECTS

Training structure

Lifetime analysis

ECTS

Training structure

Supplement 2

ECTS

Training structure

Supplement 1

ECTS

Training structure

Internship

ECTS

Training structure

Latent variable models

ECTS

Training structure

Admission

Registration procedures

Target audience

Mandatory prerequisites

Recommended prerequisites

And after

Continuing education

Professional integration