M1 - Statistics and Data Science (SSD)

  • Training structure

    Faculty of Science

Presentation

Powered by increasingly powerful collection methods, statistical data (aka data) is growing exponentially, and few areas escape increasingly extensive measurement. But while data collection is one thing, analyzing it is another. This is made difficult by two main phenomena: the size of the data and the complexity of the phenomena being measured. Contemporary statistics seeks to resolve these two problems. It is therefore evolving very rapidly, retaining the best of the tools of the past and adapting them to massive and large-scale data, while at the same time offering increasingly refined modeling methods that respect the complexity of the phenomena. Classical statistics has thus evolved into a more computational "data science," which incorporates automatic learning and diagnostic techniques that lie halfway between statistics and artificial intelligence.

The Statistics and Data Science program is a training course in all contemporary statistical analysis and modeling methodologies. While it leads to a career as a data scientist, it incorporates both methodological design aspects—through mastery of the underlying mathematics and their computer programming—and the rigorous application of methods and models to data of various types and domains.

This program is divided in the second year into two more specialized sub-programs, whose courses remain partially shared. The first of these specializations is Biostatistics, which delves deeper into the analysis and modeling of biological data. The second is Information and Decision Management (MIND), which specializes in the analysis and modeling of economic data as well as the management of decisions and associated risks.

 

What do you want to do?New mail Copy

Read more

Objectives

The first year of the program provides students with the conceptual, mathematical, and practical foundations of the major branches of statistical methodology: statistical information and decision-making, probabilistic modeling and statistical inference, multidimensional linear exploration and modeling. The program also includes two introductory courses to choose from among introductory courses in economics and life sciences data.

At the end of this year, students will have a solid foundation on which to build their specialization in the second year. They will also have a clear idea of which specialization is best suited to them: Biostatistics or MIND.

What do you want to do?New mail Copy

Read more

Know-how and skills

For each of the major families of techniques and models, students will master the ideas and mathematical formalism to the point where they can adapt and program them, and, of course, use them correctly, i.e., critically, consciously, and with respect for their limits of validity.

Students will also begin to be able to build a data processing chain consisting of the following steps, in order: defining a problem, managing and preprocessing data, designing and implementing the analytical chain:  exploratory analysis, modeling, estimation and model selection, analysis of results, and finally, writing a clear, rigorous, and educational report on the entire project.

Students will also have learned, through various projects and assignments, how to correctly and efficiently program the statistical calculations required for original problems.

Finally, he or she will have learned to present and defend their work in front of an audience in about twenty minutes.

Read more

Program

A supervised project in the second semester of the M1.

Read more
  • Stochastic processes

  • Information systems and databases

    4 credits
  • Multidimensional data analysis

    5 credits
  • Optimization

    5 credits
  • Software development

    4 credits
  • Inferential statistics

  • Information and decision theories

    2 credits
  • Stochastic control

    2 credits
  • Time series

    4 credits
  • Nonparametric estimation and testing

    4 credits
  • Linear model

    5 credits
  • Project

    5 credits
  • English

    2 credits
  • CHOICE2

    2 credits
    • Choose 1 out of 4

      • Epidemiology tools

        2 credits
      • Microeconomics

        2 credits
      • Bioinformatics Learning Lab

        2 credits
      • Biological information

        2 credits
    • Choose 1 out of 4

      • Bioinformatics Learning Lab

        2 credits
      • Epidemiology tools

        2 credits
      • Microeconomics

        2 credits
      • Biological information Eco-EPI and SSD

        2 credits
  • CHOICE1

    4 credits
    • Choose one of two options:

      • Alignment and Phylogeny

        4 credits
      • General economics

        4 credits
    • Choose one of two options:

      • General economics

        4 credits
      • Alignment and phylogeny Eco-EPI and SSD

        4 credits
  • R programming

    2 credits

Admission

Admission requirements

Registration procedures

Applications can be submitted on the following platforms: 

Read more

Target audience

Students with a bachelor's degree in general mathematics.

Read more

Mandatory prerequisites

Have a good level in analysis, linear and bilinear algebra, geometry, elementary statistics, and probability.

Read more

Recommended prerequisites

Have a good level in analysis, linear and bilinear algebra, geometry, elementary statistics, and probability.

Read more

And after

Continuing education

Master's degree in statistics in the broad sense, preparation for the competitive examination for university professorships in mathematics, and possible doctorate thereafter.

Read more

Professional integration

Professions: statistician, biostatistician, data scientist, data analyst, all at the engineer level.

All sectors of activity: industry, research and development, healthcare, agronomy, banking and insurance, commerce, etc.

Read more