ONLINE; 26 February- 1 March 2024
To foster international participation, this course will be held online
General Topic: Understanding and Working with Next Generation Sequencing Data
This course will introduce participants to the field of Next Generation Sequencing biology, understanding both the concepts and handling of the data. We will cover a broad range of software and analyses from quality assessment of sequencing runs, through assembling and annotating small genomes, RNAseq and differential gene expression, and phylogenomics with NGS data. Primarily focussed on Illumina data, we will also look at the different requirements and opportunities when utilising long-read data (Nanopore/PacBio). This course will be accompanied by sessions on the use of the Linux command line, and Docker which is the preferred platform for most bioinformatic analyses, as well as software containers, through Docker or Singularity, with a particular focus on best practices for reproducibility.
The course is structured in modules over five days. Each session will include an introductory lecture with a class discussion of key concepts. The remainder of each day will consist of practical hands-on sessions. These sessions will involve a combination of mirroring exercises (delivered via live coding) with the instructors to demonstrate a skill, as well as applying these skills on your own to complete individual exercises. After and during each exercise, the interpretation of results will be discussed as a group.
The course is aimed at researchers with a biological background but no basic hands-on experience with NGS data. We will start by gaining experience with the Linux command line which is fundamental for running the analysis that the rest of the week will be based on. We will therefore dedicate one day to introduce basic and advanced Linux concepts for processing data on Amazon cloud (AWS), and then introduce concepts and background on each analysis step as we progress. Overall, we will begin with assessing raw sequencing data and move through genomic, transcriptomic, and phylogenetic/phylogenomic analysis.
Effectively handling NGS data comfortably and in a reliable and reproducible manner
Understanding the strengths and pitfalls of NGS and how to assess the quality of data generation and analysis
Hands-on experience with state-of-the-art methods to use NGS in experiments across a range of approaches (genomics, transcriptomics, phylogenomics)
Assessment of strengths and weaknesses of the different DNA sequencing technologies, both short read (Illumina), and long reads (Pacific Bioscience, Oxford Nanopore).
Familiarity with biological sequence analysis in an evolutionary context
- We will use example data from Illumina and Nanopore sequencing runs across a range of species and experimental designs.
- We encourage the participants to bring, analyze (if possible), and discuss their data
Monday - 2-8 pm Berlin time
Logging on to Your Personal Cloud Image
Introduction to Linux – an environment for Bioinformatics
Docker and Singularity for reproducible bioinformatics
Cancellation Policy:
> 30 days before the start date = 30% cancellation fee
< 30 days before the start date= No Refund.
Physalia-courses cannot be held responsible for any travel fees, accommodation or other expenses incurred to you as a result of the cancellation.