26th-29th June 2023
To foster international participation, this course will be held online
This course will introduce methods and approaches to analyse longitudinal data, i.e. data which are repeated in time or space (or any other dimensions, for that matter!). Longitudinal data
present specific challenges in all aspects of processing and analysis, from visualization to exploratory data analysis, to modelling and validation. The course will outline the main challenges
related to dealing with longitudinal data, from classical statistical and machine learning perspectives. Specific areas that will be covered include forecasting (prediction of time-series
data), epidemiology and gene-expression experiments.
The course is structured in modules over four days. Each day will include lectures with class discussions of key concepts and practical hands-on sessions with collaborative exercises where
students will interact with the whole class and instructors to apply the acquired skills. After and during each exercise, results will be interpreted and discussed. At the end of the course, a
Kahoot quiz will be taken together to recap and highlight the most important concepts covered, and there will be room to discuss specific research problems and questions from participants.
The course is aimed at advanced students, researchers and professionals interested in learning what longitudinal data are and how to analyze them in the context of real life applications in
biology. It will include information useful for both absolute beginners and more advanced users willing to delve into some aspects of the implementation of longitudinal models and scripting code.
We will start by introducing the general concepts and approaches to deal with longitudinal data; we will then explore applications to specific scientific domains where longitudinal data are
common: forecasting, epidemiology, gene expression.
Attendees are expected to have a background in biology and the research problems involving prediction, inference, pattern discovery; previous exposure to inferential and predictive experiments
would be beneficial. There will be a mix of lectures and hands-on practical exercises using mainly R/Python, Markdown/Jupyter Notebooks and the Linux command line. Some basic understanding of R
programming and the Linux environment will be advantageous, but is not required.
At the end of the course the student will have an understanding of:
Monday– Classes from 2-8 PM Berlin time
- Longitudinal data: examples and challenges
- The classical statistical perspective
- Models to analyse data with repeated records over time (multiple time points) and space (multiple locations)
Tuesday– Classes from 2-8 PM Berlin time
- The machine-learning perspective
- Cross-validation with temporal and spatial data structures
- Forecasting
Wednesday– Classes from 2-8 PM Berlin time
The use of Longitudinal-based designs in Epidemiology:
1 - times series and outbreak detection
2- disease incidence
3- effects of environmental factors
Thursday– Classes from 2-8 PM Berlin time
RNA-seq Real Time Series:
1 - Factorial Time Course Experiments
2 - Single Transient Time Course Experiment
3 - Circadian rhythmic data and cell cycle data
Final recap quiz (kahoot)
Discussing your own research problems and wrap-up discussion
Should you have any further questions, please send an email to info@physalia-courses.org
Cancellation Policy:
> 30 days before the start date = 30% cancellation fee
< 30 days before the start date= No Refund.
Physalia-courses cannot be held responsible for any travel fees, accommodation or other expenses incurred to you as a result of the cancellation.