Science

Statistics and Data Science (SSD)

  • Training structure

    Faculty of Science

Presentation

The SSD program is an applied mathematics course designed to provide high-level skills in statistics, random modeling, and data science.
It aims to provide solid knowledge and professional skills so that students can integrate into multidisciplinary teams in many sectors of activity: health, biology, ecology, environment, genomics, energy, agronomy, economics, banking, insurance, marketing, research, higher education, etc.

Read more

Organization

Open alternately

This course is available on a work-study basis.

Program

Select a program

M1 - Statistics and Data Science (SSD)

Powered by increasingly powerful collection methods, statistical data (aka data) is growing exponentially, and few areas escape increasingly extensive measurement. But while data collection is one thing, analyzing it is another. This is made difficult by two main phenomena: the size of the data and the complexity of the phenomena being measured. Contemporary statistics seeks to resolve these two problems. It is therefore evolving very rapidly, retaining the best of the tools of the past and adapting them to massive and large-scale data, while at the same time offering increasingly refined modeling methods that respect the complexity of the phenomena. Classical statistics has thus evolved into a more computational "data science," which incorporates automatic learning and diagnostic techniques that lie halfway between statistics and artificial intelligence.

The Statistics and Data Science program is a training course in all contemporary statistical analysis and modeling methodologies. While it leads to a career as a data scientist, it incorporates both methodological design aspects—through mastery of the underlying mathematics and their computer programming—and the rigorous application of methods and models to data of various types and domains.

This program is divided in the second year into two more specialized sub-programs, whose courses remain partially shared. The first of these specializations is Biostatistics, which delves deeper into the analysis and modeling of biological data. The second is Information and Decision Management (MIND), which specializes in the analysis and modeling of economic data as well as the management of decisions and associated risks.

 

What do you want to do?New mail Copy

See the full page for this route

  • Stochastic processes

  • Information systems and databases

    4 credits
  • Multidimensional data analysis

    5 credits
  • Optimization

    5 credits
  • Software development

    4 credits
  • Inferential statistics

  • Information and decision theories

    2 credits
  • Stochastic control

    2 credits
  • Time series

    4 credits
  • Nonparametric estimation and testing

    4 credits
  • Linear model

    5 credits
  • Project

    5 credits
  • English

    2 credits
  • CHOICE2

    2 credits
    • Choose 1 out of 4

      • Epidemiology tools

        2 credits
      • Microeconomics

        2 credits
      • Bioinformatics Learning Lab

        2 credits
      • Biological information

        2 credits
    • Choose 1 out of 4

      • Bioinformatics Learning Lab

        2 credits
      • Epidemiology tools

        2 credits
      • Microeconomics

        2 credits
      • Biological information Eco-EPI and SSD

        2 credits
  • CHOICE1

    4 credits
    • Choose one of two options:

      • Alignment and Phylogeny

        4 credits
      • General economics

        4 credits
    • Choose one of two options:

      • General economics

        4 credits
      • Alignment and phylogeny Eco-EPI and SSD

        4 credits
  • R programming

    2 credits

M2 - Statistics and Data Science (SSD) - BIOSTATS

This M2 program is intended for students who have completed an M1 in Statistics and Data Science (SSD) or any other M1 in mathematics or equivalent with a strong specialization in probability and statistics.

This Master's degree is divided into two more specialized sub-courses, with some of the teaching remaining shared between them.

- The first of these specializations is Biostatistics, which delves deeper into the analysis and modeling of biological data.

- The second is Information and Decision Management (MIND), which specializes in the analysis and modeling of economic data as well as the management of decisions and associated risks.

  • The aim of the SSD-Biostat program is to meet the expectations of M1 SSD students who are interested in modeling biological or environmental data. The statistical aspects covered in this program range from modeling living organisms to the most theoretical issues in statistics and stochastic modeling. Numerical aspects feature heavily in this program and require a strong interest in computer programming.

The SSD-Biostat program is a demanding course of study because it focuses on concepts rather than techniques. In the field of data in the broad sense, digital technologies, with the advent of artificial intelligence, are evolving rapidly and becoming obsolete even more quickly. Future statistical engineers or researchers who will be required to process data will be able to train in new technologies throughout their professional lives, especially if they have had a solid initial conceptual training. The added value of the program is precisely that it provides a theoretical understanding of the statistical concepts underlying automatic algorithms. Graduates must also be able to monitor technological developments effectively.

The SSD-Biostat program remains partially shared in the second year with the Information and Decision Management program (SSD-MIND). However, specialized courses in life or environmental data science, which are more focused on introducing students to research, are specific to the SSD-BIOSTAT program (two courses per semester in the second year of the master's program).

  • The aim of the SSD-MIND program is to meet the expectations of M1 SSD students who are interested in applying data science in business. Given the wide variety of companies and the issues they face, this M2 program provides training in general data science that can be applied in any field. In addition, it provides more specific training in the business context and its economic and managerial issues (economic information, financial risk management, customer relations, business strategy, etc.).

The SSD - MIND program is a mathematical engineering course that focuses on methodology and a thorough understanding of statistical concepts and models. Graduates of this program will be able to handle any type of data and problem, and design a comprehensive and often original methodology to address the issue, starting with data management and organization, continuing with exploration and targeted reduction, then modeling the phenomena of interest, and finally synthesizing the extracted information for decision-making purposes. They will also need to know how to convey to the company the knowledge synthesized from the information extracted from the data. Each new data set and each question asked about it often presents a new problem, and applying a standard method to this data is therefore inappropriate. Instead, a mathematical model adapted to this data (in the sense that it satisfactorily reflects its complexity) must be written and made compatible with a standard estimation method, or a more specific method must be designed and programmed. The emphasis placed by this program on conceptual and mathematical mastery of the tools ensures that graduates of this course have the high level of adaptability and self-training skills required by the rapidly evolving field of data science.

The SSD-MIND program remains partially shared in the second year with the biostatistics program (SSD-Biostat), which is more specialized in the analysis and modeling of biological and environmental data. The SSD-MIND program is a dual degree program in partnership with the IAE (which provides instruction in economics and management), leading to a double degree.

See the full page for this route

  • Nonparametric estimation

    5 credits
  • Generalized linear models

    5 credits
  • English

    2 credits
  • Project or Work-Study Defense

    3 credits
  • Bayesian statistics

    5 credits
  • Multivariate analysis

    5 credits
  • Statistical learning

    5 credits
  • Lifetime analysis

    4 credits
  • Supplement 2

    4 credits
  • Supplement 1

    4 credits
  • Internship

    14 credits
  • Latent variable models

    4 credits

M2 - Statistics and Data Science (SSD) - MIND

This M2 program is intended for students who have completed an M1 in Statistics and Data Science (SSD) or any other M1 in mathematics or equivalent with a strong specialization in probability and statistics.

This Master's degree is divided into two more specialized sub-courses, with some of the teaching remaining shared between them.

- The first of these specializations is Biostatistics, which delves deeper into the analysis and modeling of biological data.

- The second is Information and Decision Management (MIND), which specializes in the analysis and modeling of economic data as well as the management of decisions and associated risks.

  • The aim of the SSD-Biostat program is to meet the expectations of M1 SSD students who are interested in modeling biological or environmental data. The statistical aspects covered in this program range from modeling living organisms to the most theoretical issues in statistics and stochastic modeling. Numerical aspects feature heavily in this program and require a strong interest in computer programming.

The SSD-Biostat program is a demanding course of study because it focuses on concepts rather than techniques. In the field of data in the broad sense, digital technologies, with the advent of artificial intelligence, are evolving rapidly and becoming obsolete even more quickly. Future statistical engineers or researchers who will be required to process data will be able to train in new technologies throughout their professional lives, especially if they have had a solid initial conceptual training. The added value of the program is precisely that it provides a theoretical understanding of the statistical concepts underlying automatic algorithms. Graduates must also be able to monitor technological developments effectively.

The SSD-Biostat program remains partially shared in the second year with the Information and Decision Management program (SSD-MIND). However, specialized courses in life or environmental data science, which are more focused on introducing students to research, are specific to the SSD-BIOSTAT program (two courses per semester in the second year of the master's program).

  • The aim of the SSD-MIND program is to meet the expectations of M1 SSD students who are interested in applying data science in business. Given the wide variety of companies and the issues they face, this M2 program provides training in general data science that can be applied in any field. In addition, it provides more specific training in the business context and its economic and managerial issues (economic information, financial risk management, customer relations, business strategy, etc.).

The SSD - MIND program is a mathematical engineering course that focuses on methodology and a thorough understanding of statistical concepts and models. Graduates of this program will be able to handle any type of data and problem, and design a comprehensive and often original methodology to address the issue, starting with data management and organization, continuing with exploration and targeted reduction, then modeling the phenomena of interest, and finally synthesizing the extracted information for decision-making purposes. They will also need to know how to convey to the company the knowledge synthesized from the information extracted from the data. Each new data set and each question asked about it often presents a new problem, and applying a standard method to this data is therefore inappropriate. Instead, a mathematical model adapted to this data (in the sense that it satisfactorily reflects its complexity) must be written and made compatible with a standard estimation method, or a more specific method must be designed and programmed. The emphasis placed by this program on conceptual and mathematical mastery of the tools ensures that graduates of this course have the high level of adaptability and self-training skills required by the rapidly evolving field of data science.

The SSD-MIND program remains partially shared in the second year with the biostatistics program (SSD-Biostat), which is more specialized in the analysis and modeling of biological and environmental data. The SSD-MIND program is a dual degree program in partnership with the IAE (which provides instruction in economics and management), leading to a double degree.

See the full page for this route

  • Generalized linear models

    5 credits
  • English

    2 credits
  • Project or Work-Study Defense

    3 credits
  • Risk management

    10 credits84h
  • Multivariate analysis

    5 credits
  • Statistical learning

    5 credits
  • Lifetime analysis

    4 credits
  • Internship

    14 credits
  • Strategy and project management

    4 credits
  • Latent variable models

    4 credits
  • Data mining and missing data

    4 credits

Admission

Admission requirements

The Master's degree in Mathematics - SSD is accessible after a Bachelor's degree in Mathematics (fundamental or applied).

Read more

Registration procedures

Applications can be submitted on the following platforms: 

French & European students:

International students from outside the EU: follow the "Études en France" procedure:https://pastel.diplomatie.gouv.fr/etudesenfrance/dyn/public/authentification/login.html

Read more

And after

Continuing education

The Master's in Mathematics – SSD also leads to doctoral studies in academia or the professional world to train future teachers, researchers, or research engineers.

Read more

Professional integration

Research engineers, statisticians, data scientists, data managers, marketing research managers, customer relationship managers, risk management managers, biostatisticians, researchers in public research institutions, and R&D teams in companies.

Read more