RNA-seq Analyses in non-model organisms

RNA-seq Analyses in non-model organisms

dates

18th-22nd November 2024

To foster international participation, this course will be held online

Course overview

RNA-Seq technology has been transformative in our ability to explore gene content and gene expression in all realms of biology, and de novo transcriptome assembly has enabled opportunities to expand transcriptome analysis to non-model organisms. This workshop provides an overview of modern applications of transcriptome sequencing and popular tools and algorithms for exploring transcript reconstruction and expression analysis in a genome-free manner, leveraging the Trinity software and analysis framework. Attendees will perform quality assessment of Illumina RNA-Seq data, assemble a transcriptome using, among others, Trinity, quantify transcript expression, leverage Bioconductor tools for differential expression analysis, and apply Trinotate to functionally annotate transcripts. In parallel to the short-read assembly, participants will use 3rd generation sequencing data (PacBio IsoSeq) to perform a hybrid long-read short-read assembly, then compare the assemblies together. Additional methods will then be explored for characterizing the assembled transcriptome and revealing biological findings.

Intended audience

This workshop is aimed primarily at biologist researchers that have basic bioinformatics skills (basic knowledge of the linux command line interface is a must, basic knowledge of the R environment and language is strongly recommended) and are pursuing RNA-Seq projects in non-model organisms. Attendees will gain skills needed to successfully approach transcriptome sequencing, de novo transcriptome assembly, expression analysis, and functional annotation as applied to organisms lacking a high quality reference genome sequence. Attendees are also invited to bring a subset of their own data. Familiarity with Linux and R is essential for benefitting from the workshop.

Teaching format

The workshop will be delivered over the course of four and a half days, with each session entailing lectures followed by practical hands-on sessions. Most computing will be done on the cloud and attendees will use their own laptop computers with the Google Chrome web browser providing all the necessary interfaces to the cloud computing environment, including the linux command terminal. Attendees can also use the native terminal emulator of their Operating System (and ssh). This works natively for Linux, MAC and Windows 10 and 11. For Windows 7 users, installing Visual Studio Code (alternatively MobaXterm) is recommended.

Assumed background for the participants

Basic experience with the linux command-line is essential. Experience with the execution of bioinformatics tools using the linux command line will be helpful. Experience with the R environment and running R code (not writing!) also. No programming or scripting knowledge is required (but some is helpful). As a matter of fact, being able to write short bash scripts is one of the learning objectives.

Session content

Monday – Classes from 2-8 pm Berlin time - High Throughput Sequencing, 2nd generation

   14:00 - 14:30 - Welcome to the course!
   Daily Learning Objectives
   14:30 - 15:10 - High Throughput Sequencing Lecture
   15:10 - 15:30 - Data Preprocessing Workflow Lecture
   15:30 - 16:30 - Data pre-processing discussion
   16:30 - 17:00 - Data description
   17:10 - 19:40 - Hands-on practical data pre-processing
   19:40 - 20:00 - Feedback and Daily Assessment

Tuesday – Classes from 2-8 pm Berlin time - NGS data de-novo transcriptome assembly

   Daily Learning Objectives
   14:00 - 14:30 - Revision session
   14:30 - 16:30 - Hands-on practical data pre-processing
   Input QC tutorial
   16:30 - 19:40 - De-novo transcriptome assembly lecture and tutorial
   Input Transcriptome Assembly
   19:40 - 20:00 - Feedback and Daily Assessment

Wednesday – Classes from 2-8 pm Berlin time - Third(!) generation sequencing, de-novo transcriptome assembly, assessment and evalutation

   14:00 - 14:40 - De-novo transcriptome assembly lecture and tutorial
   Input Transcriptome Assembly
   - Data description
   Daily Learning Objectives
   14:40 - 15:00 - Revision session
   15:00 - 15:20 - De-novo transcriptome assembly lecture - 3rd generation sequencing!
   15:20 - 16:00 - De-novo transcriptome assembly - revisited
   16:10 - 17:40 - Assembly evaluation
   Transcriptome evaluation
   17:40 - 19:40 - Functional annotation
   Transcriptome annotation
   19:40 - 20:00 - Feedback and Daily Assessment

Thursday – Classes from 2-8 pm Berlin time - Transcript quantification and Differential Expression

   14:00 - 14:30 Transcriptome annotation
   Transdecoder
   Daily Learning Objectives
   14:30 - 15:00 - Revision session
   15:00 - 15:30 - Pseudo-alignment
   15:30 - 16:30 - R refresher and data science teaser
   16:30 - 19:15 - Exploratory Data Analysis (a.k.a. Biological Quality Assessment)
   19:15 - 19:40 - Git crash course
   19:40 - 20:00 - Feedback and Daily Assessment

Friday – Classes from 2-8 pm Berlin time - Differential expression analysis

   Daily Learning Objectives
   14:10 - 15:00 - Exploratory Data Analysis (a.k.a. Biological Quality Assessment)
   15:00 - 15:45 - Git crash course
   15:45 - 19:00 - Differential Expression analysis
   19:00 - 20:00 - Final discussion session and Feedback

Instructors

Dr. Nicolas Delhomme

Dr. Kristina Benevides

Cost overview

Package 1

530 €

Register now

Cancellation Policy:

> 30 days before the start date = 30% cancellation fee

< 30 days before the start date= No Refund.

Physalia-courses cannot be held responsible for any travel fees, accommodation or other expenses incurred to you as a result of the cancellation.