Training structure
Faculty of Science
Presentation
This M2 program is intended for students who have completed an M1 in Statistics and Data Science (SSD) or any other M1 in mathematics or equivalent with a strong specialization in probability and statistics.
This Master's degree is divided into two more specialized sub-courses, with some of the teaching remaining shared between them.
- The first of these specializations is Biostatistics, which delves deeper into the analysis and modeling of biological data.
- The second is Information and Decision Management (MIND), which specializes in the analysis and modeling of economic data as well as the management of decisions and associated risks.
- The aim of the SSD-Biostat program is to meet the expectations of M1 SSD students who are interested in modeling biological or environmental data. The statistical aspects covered in this program range from modeling living organisms to the most theoretical issues in statistics and stochastic modeling. Numerical aspects feature heavily in this program and require a strong interest in computer programming.
The SSD-Biostat program is a demanding course of study because it focuses on concepts rather than techniques. In the field of data in the broad sense, digital technologies, with the advent of artificial intelligence, are evolving rapidly and becoming obsolete even more quickly. Future statistical engineers or researchers who will be required to process data will be able to train in new technologies throughout their professional lives, especially if they have had a solid initial conceptual training. The added value of the program is precisely that it provides a theoretical understanding of the statistical concepts underlying automatic algorithms. Graduates must also be able to monitor technological developments effectively.
The SSD-Biostat program remains partially shared in the second year with the Information and Decision Management program (SSD-MIND). However, specialized courses in life or environmental data science, which are more focused on introducing students to research, are specific to the SSD-BIOSTAT program (two courses per semester in the second year of the master's program).
- The aim of the SSD-MIND program is to meet the expectations of M1 SSD students who are interested in applying data science in business. Given the wide variety of companies and the issues they face, this M2 program provides training in general data science that can be applied in any field. In addition, it provides more specific training in the business context and its economic and managerial issues (economic information, financial risk management, customer relations, business strategy, etc.).
The SSD - MIND program is a mathematical engineering course that focuses on methodology and a thorough understanding of statistical concepts and models. Graduates of this program will be able to handle any type of data and problem, and design a comprehensive and often original methodology to address the issue, starting with data management and organization, continuing with exploration and targeted reduction, then modeling the phenomena of interest, and finally synthesizing the extracted information for decision-making purposes. They will also need to know how to convey to the company the knowledge synthesized from the information extracted from the data. Each new data set and each question asked about it often presents a new problem, and applying a standard method to this data is therefore inappropriate. Instead, a mathematical model adapted to this data (in the sense that it satisfactorily reflects its complexity) must be written and made compatible with a standard estimation method, or a more specific method must be designed and programmed. The emphasis placed by this program on conceptual and mathematical mastery of the tools ensures that graduates of this course have the high level of adaptability and self-training skills required by the rapidly evolving field of data science.
The SSD-MIND program remains partially shared in the second year with the biostatistics program (SSD-Biostat), which is more specialized in the analysis and modeling of biological and environmental data. The SSD-MIND program is a dual degree program in partnership with the IAE (which provides instruction in economics and management), leading to a double degree.
Objectives
The training program has several objectives.
- Enable students to deal with all types of data and issues, and to design a comprehensive and often original methodology to address these issues.
- Enable students to quickly integrate into any type of company by rapidly grasping the issues at stake.
- Bring students who wish to do so to a theoretical level that enables them to write a doctoral thesis in statistics.
- Train future researchers or teacher-researchers in the field of random mathematics: probability or theoretical or applied statistics. After completing their doctorate, they will be able to join laboratories at universities, engineering schools, or research organizations such as the CNRS, INRAE, Inria, CIRAD, INSERM, etc. It is also possible to join a company or industrial research laboratory directly after completing the M2.
- Train specialists in high-level statistical data processing for research organizations or companies for which statistics are now an indispensable tool, such as pharmaceutical laboratories, epidemiological monitoring institutes, air and water quality monitoring institutes, agri-food companies, biotechnology companies, healthcare companies (diagnostic assistance, personalized medicine), etc.
Know-how and skills
- Being able to extract relevant data
- Perform data pre-processing (cleaning and formatting, if necessary)
- Conduct exploratory data analysis using visualization and dimension reduction tools.
- Modeling a problem: mastering the standard methods used in modern data science and knowing how to propose the appropriate method(s) for solving the problem at hand, writing one or more mathematical models suited to the problem, and putting them into a form suitable for processing using the standard methods of modern data science.
- Implement the method from a computational perspective and be able to propose model selection strategies.
- Program effectively in at least one language (Python, R)
- Knowing how to analyze and interpret results, i.e., producing knowledge from the information extracted.
- Linking knowledge to decision-making in order to inform and optimize the latter.
- Be able to communicate results in writing and orally
Program
Master's degree open to work-study programs with long periods of classroom instruction and in-company training.
First term: 7 weeks of teaching from September to the end of October.
Second period: 7 weeks in a company for work-study students or a long supervised project in a laboratory for non-work-study students from November to mid-January.
Third term: 7 weeks of classes from mid-January to mid-March.
Fourth period: work-study program from March to August or a 4- to 6-month internship for non-work-study students.
Internships and supervised projects: A 7-week supervised project for non-work-study students, subject to a report and oral defense.
4- to 6-month internship at the end of the M2 program.
Nonparametric estimation
5 creditsGeneralized linear models
5 creditsEnglish
2 creditsProject or Work-Study Defense
3 creditsBayesian statistics
5 creditsMultivariate analysis
5 creditsStatistical learning
5 credits
Lifetime analysis
4 creditsSupplement 2
4 creditsSupplement 1
4 creditsInternship
14 creditsLatent variable models
4 credits
Admission
Registration procedures
Applications can be submitted on the following platforms:
- French and European students must submit their application via the e-candidat application:https://candidature.umontpellier.fr/candidature
- International students from outside the EU: follow the "Études en France" procedure:https://pastel.diplomatie.gouv.fr/etudesenfrance/dyn/public/authentification/login.html
Target audience
Target audience*: This course is intended for students who have completed an M1 in Mathematics - Statistics and Data Science (SSD) or any other equivalent M1 in mathematics with a strong specialization in probability and statistics.
Mandatory prerequisites
M1 Mathematics - Data Science Statistics (SSD)
Recommended prerequisites
M1 Mathematics - Data Science Statistics (SSD)
And after
Continuing education
Doctorate possible after completing the M2.
Professional integration
Professions: statistician, biostatistician, data scientist, data analyst, all at the engineer level.
All sectors of activity: industry, research and development, healthcare, agronomy, banking and insurance, commerce, etc.