Sciences

Statistics and Data Science (SSD)

  • Training structure

    Faculty of Science

Presentation

The SSD program is a course in applied mathematics that aims to provide high-level skills in statistics, random modeling and data science.
It is designed to provide solid knowledge and professional skills to enable students to join multidisciplinary teams in a wide range of sectors: health, biology, ecology, environment, genomics, energy, agronomy, economics, banking, insurance, marketing, research, higher education, etc.

Read more

Organization

Open on a sandwich basis

This course is offered on a sandwich basis.

Program

Select a program

M1 - Statistics and Data Science (SSD)

Fueled by increasingly powerful means of data collection, statistical data (aka data ) is growing exponentially, and few areas escape ever more extensive measurement. But while collecting data is one thing, analyzing it is quite another. The latter is made difficult by two main phenomena: the size of the data and the complexity of the phenomena being measured. Contemporary statistics aims to solve both these problems. As a result, it is evolving very rapidly, retaining the best of past tools and adapting them to massive, large-scale data, while at the same time proposing increasingly refined modelling approaches that respect the complexity of the phenomena involved. Classical statistics has thus evolved towards a more computational "data science", incorporating automatic learning and diagnostic techniques halfway between statistics and artificial intelligence.

The Statistics and Data Science program provides training in all contemporary statistical analysis and modeling methodologies. While it leads to a career as a "data scientist", it also integrates aspects of methodological design - thanks to mastery of the underlying mathematics and computer programming - as well as the rigorous application of methods and models to data of various types and domains.

In the second year, this course is split into two more specialized sub-courses, some of which are shared. The first of these specializations is Biostatistics, which focuses on the analysis and modeling of life data. The second is Information and Decision Management (MIND), specializing in the analysis and modeling of economic data, as well as the management of associated decisions and risks.

 

What do you want to do?New mailCopy

See the complete page of this course

  • Stochastic processes

  • Information systems and databases

    4 credits
  • Multi-dimensional data analysis

    5 credits
  • Optimization

    5 credits
  • Software development

    4 credits
  • Inferential statistics

  • Information and decision theories

    2 credits
  • Stochastic control

    2 credits
  • Time series

    4 credits
  • Estimation and non-parametric tests

    4 credits
  • Linear model

    5 credits
  • Project

    5 credits
  • English

    2 credits
  • CHOIX2

    2 credits
    • Your choice: 1 of 4

      • Epidemiology tools

        2 credits
      • Microeconomics

        2 credits
      • Bioinformatics Learning Lab

        2 credits
      • Biological information

        2 credits
  • CHOIX1

    4 credits
    • Your choice: 1 of 2

      • Alignment and Phylogeny

        4 credits
      • General economics

        4 credits
  • R programming

    2 credits

M2 - Statistics and Data Science (SSD) - BIOSTATS

This M2 program is aimed at students with an M1 in Statistics and Data Science (SSD) or any other M1 in mathematics or equivalent, with a strong specialization in probability and statistics.

This M2 course is divided into two more specialized sub-courses, some of which are shared.

- The first of these specializations is Biostatistics, which focuses on the analysis and modeling of living data.

- The second is Information and Decision Management (MIND), specializing in the analysis and modeling of economic data, as well as the management of associated decisions and risks.

  • The ambition of the SSD-Biostat pathway is to meet the expectations of M1 SSD students who are attracted by the modeling of data from the living world or the environment. The statistical aspects covered in this course range from life modeling to the most theoretical issues in statistics and stochastic modeling. Numerical aspects are extremely present in this pathway, and require a pronounced taste for computer programming.

The SSD-Biostat pathway is a demanding course because it focuses on concepts rather than techniques. Indeed, in the field of data in the broadest sense, digital technologies, with the advent of artificial intelligence, are evolving fast and becoming obsolete even faster. Future statistical engineers or researchers who have to deal with data will be able to train in new technologies throughout their working lives, all the more so if they have had solid initial conceptual training. The added value of training is precisely to provide a theoretical understanding of the statistical concepts underlying automatic algorithms. Graduates must also be able to keep abreast of the latest technological developments.

In the second year, the SSD-Biostat pathway remains partly shared with the Information and Decision Management pathway (SSD-MIND). However, the SSD-BIOSTAT pathway has its own specialization courses (two per M2 semester) in life and environmental data science, with an emphasis on introduction to research.

  • The ambition of the SSD-MIND course is to meet the expectations of M1 SSD students who are attracted by the application of data science in business. Given the wide diversity of companies and their issues, this M2 program trains students in generalist, "all-terrain" data science. In addition, it provides training that is more specific to the corporate context and its economic and managerial issues (economic information, financial risk management, customers, corporate strategy, etc.).

The SSD - MIND program is a mathematical engineering course, with a focus on methodology and perfect mastery of statistical concepts and models. Graduates of this course will be able to deal with all types of data and problems, and design a complete and often original methodology to meet these problems, starting with the management and organization of data, continuing with its exploration and targeted reduction, then with the modeling of phenomena of interest, and finally synthesizing the information extracted for decision-making purposes. He or she must also be able to pass on to the business the knowledge synthesized from the information extracted from the data. Each new set of data and each question asked about it is often a new problem, and the application of a standard method to this data is therefore unsuitable. On the contrary, the aim is to write a mathematical model adapted to these data (in the sense that it expresses their complexity satisfactorily) and make it assimilable to a standard estimation method, or to design and program a more specific method. The emphasis placed by this course on conceptual and mathematical mastery of tools ensures that graduates are highly adaptable and self-training, as required by the rapid evolution of data science.

The SSD-MIND pathway remains partly shared in the second year with the biostatistics pathway (SSD-Biostat), more specialized in the analysis and modeling of data from the living world or the environment. The SSD-MIND pathway is a double degree program in partnership with IAE (which teaches economics and management).

See the complete page of this course

  • Non-parametric estimation

    5 credits
  • Generalized linear models

    5 credits
  • English

    2 credits
  • Work-study project or presentation

    3 credits
  • Bayesian statistics

    5 credits
  • Multivariate analysis

    5 credits
  • Statistical learning

    5 credits
  • Life cycle analysis

    4 credits
  • Addendum 2

    4 credits
  • Supplement 1

    4 credits
  • Internship

    14 credits
  • Latent variable models

    4 credits

M2 - Statistics and Data Science (SSD) - MIND

This M2 program is aimed at students with an M1 in Statistics and Data Science (SSD) or any other M1 in mathematics or equivalent, with a strong specialization in probability and statistics.

This M2 course is divided into two more specialized sub-courses, some of which are shared.

- The first of these specializations is Biostatistics, which focuses on the analysis and modeling of living data.

- The second is Information and Decision Management (MIND), specializing in the analysis and modeling of economic data, as well as the management of associated decisions and risks.

  • The ambition of the SSD-Biostat pathway is to meet the expectations of M1 SSD students who are attracted by the modeling of data from the living world or the environment. The statistical aspects covered in this course range from life modeling to the most theoretical issues in statistics and stochastic modeling. Numerical aspects are extremely present in this pathway, and require a pronounced taste for computer programming.

The SSD-Biostat pathway is a demanding course because it focuses on concepts rather than techniques. Indeed, in the field of data in the broadest sense, digital technologies, with the advent of artificial intelligence, are evolving fast and becoming obsolete even faster. Future statistical engineers or researchers who have to deal with data will be able to train in new technologies throughout their working lives, all the more so if they have had solid initial conceptual training. The added value of training is precisely to provide a theoretical understanding of the statistical concepts underlying automatic algorithms. Graduates must also be able to keep abreast of the latest technological developments.

In the second year, the SSD-Biostat pathway remains partly shared with the Information and Decision Management pathway (SSD-MIND). However, the SSD-BIOSTAT pathway has its own specialization courses (two per M2 semester) in life and environmental data science, with an emphasis on introduction to research.

  • The ambition of the SSD-MIND course is to meet the expectations of M1 SSD students who are attracted by the application of data science in business. Given the wide diversity of companies and their issues, this M2 program trains students in generalist, "all-terrain" data science. In addition, it provides training that is more specific to the corporate context and its economic and managerial issues (economic information, financial risk management, customers, corporate strategy, etc.).

The SSD - MIND program is a mathematical engineering course, with a focus on methodology and perfect mastery of statistical concepts and models. Graduates of this course will be able to deal with all types of data and problems, and design a complete and often original methodology to meet these problems, starting with the management and organization of data, continuing with its exploration and targeted reduction, then with the modeling of phenomena of interest, and finally synthesizing the information extracted for decision-making purposes. He or she must also be able to pass on to the business the knowledge synthesized from the information extracted from the data. Each new set of data and each question asked about it is often a new problem, and the application of a standard method to this data is therefore unsuitable. On the contrary, the aim is to write a mathematical model adapted to these data (in the sense that it expresses their complexity satisfactorily) and make it assimilable to a standard estimation method, or to design and program a more specific method. The emphasis placed by this course on conceptual and mathematical mastery of tools ensures that graduates are highly adaptable and self-training, as required by the rapid evolution of data science.

The SSD-MIND pathway remains partly shared in the second year with the biostatistics pathway (SSD-Biostat), more specialized in the analysis and modeling of data from the living world or the environment. The SSD-MIND pathway is a double degree program in partnership with IAE (which teaches economics and management).

See the complete page of this course

  • Generalized linear models

    5 credits
  • English

    2 credits
  • Work-study project or presentation

    3 credits
  • Risk management

    10 credits84h
  • Multivariate analysis

    5 credits
  • Statistical learning

    5 credits
  • Life cycle analysis

    4 credits
  • Internship

    14 credits
  • Strategy and project management

    4 credits
  • Latent variable models

    4 credits
  • Data mining and missing data

    4 credits

Admission

Access conditions

The Master's degree in Maths - SSD is accessible after a Bachelor's degree in Mathematics (fundamental or applied).

Read more

How to register

Applications can be submitted on the following platforms: 

French & European students :

International students from outside the EU: follow the "Études en France" procedure: https: //pastel.diplomatie.gouv.fr/etudesenfrance/dyn/public/authentification/login.html

Read more

And then

Further studies

The Master's degree in Maths - SSD also leads to a thesis in the academic or professional world, to train future teacher-researchers or research engineers.

Read more

Professional integration

Statistician, data scientist, data manager, marketing researcher, customer relations manager, risk management manager, biostatistician, researcher in public research establishments, in corporate R&D teams.

Read more