ECTS
4 credits
Component
Faculty of Science
Description
Following on from module HAI109H, the aim of this module is to provide skills for the analysis of DNA sequences from genomic data, from reading the DNA sequences to comparative analysis of these sequences in order to extract relevant information, and to do so automatically using a dedicated Python library: BioPython.
Objectives
In addition to a presentation of the most commonly used algorithm in bioinformatics (BLAST), students will be introduced to programming management tools routinely used in the field, such as git and snakemake.
The program is as follows:
-
Introduction to the principles of sequencing/assembly or how biological data is generated, Sequence comparison (pattern matching, alignment, blast),
-
Python/BioPython programming: object alignment and blast, complex sequence analysis in BioPython/blast,
-
versioning and process automation in biopython: git & snakemake.
Students will be able to apply the skills they have acquired through a supervised project to develop a pipeline for automating sequence comparison. This project will be initiated during the practical sessions.