AI for Genomics: From CNNs and LSTMs to Transformers

Dates

7-9 April 2026

 

To foster international participation, this course will be held online

Course overview

This course explores the application of modern AI architectures—Convolutional Neural Networks (CNNs), Long Short-Term Memory networks (LSTMs), and Transformers—to genomic and metagenomic data. Students will gain practical experience through hands-on coding labs and interactive notebooks, learning how to model sequence data, extract biologically meaningful features, and interpret results. Emphasis is placed on real-world applications, including prediction of genomic functional elements, sequence classification and source tracking, as well as biological sequence generation.

Target audience and assumed background

•    Basic knowledge of molecular biology and genomics (e.g., genetic variation)
•    Familiarity with Python/R programming

Learning outcomes

By the end of this course, participants will be able to:


•    Understand and implement LSTM and CNN architectures for genomic sequence data
•    Apply attention mechanisms to improve genomic feature extraction and prediction
•    Train simple Transformer models for sequence classification or functional element prediction
•    Use notebooks to run and modify ML & DL workflows for genomics research
•    Interpret model outputs and assess performance using biological context

Program

Day 1:14:00–20:00 (Berlin time)

  • Introduction to artificial neural networks and concepts of convolution and recurrence.
  • Encoding genomics data for CNNs and LSTMs in Keras and TensorFlow.

Day 2: 14:00–20:00 (Berlin time)

  • Training CNNs and LSTMs for sequence classification and functional element prediction.

  • Applying NLP concepts to genomics data: bag-of-words and Word2Vec models.

     

Day 3: 14:00–20:00 (Berlin time)

  • Metagenomic source tracking with CNNs: microbial community sequence annotation.

  • Biological sequence generation with attention, transformer model for genomics applications.

     

Instructor

Nikolay Oskolkov, PhD 

Group Leader (PI) at LIOS, Riga, Latvia


Cost overview

Package 1

 

450 €

 


what people say about this course -2nd edition

"The course was concise, clear, and very well structured, with highly relevant insights. I especially appreciated the focus on building scripts from scratch, which really helped me understand the methods in depth. The real-world examples were extremely valuable, and I feel I am leaving with many new ideas and tools to explore further. Overall, an excellent learning experience. Thank you very much!"

 

 

related courses

1- Introduction to Python for biologists - ONLINE, 9-12 February

 

2- Machine Learning with R - ONLINE, 16-20 February

 

3 - Advanced Python for Data Science and Bioinformatics - ONLINE, 23-26 March

 

4 - Genomic Data Visualisation with Python - ONLINE, 22-24 June

 

5 - AI-Powered Python for Bioinformatics - ONLINE, 1-2 July

 

Should you have any further questions, please send an email to [email protected]

Cancellation Policy:

 

> 30  days before the start date = 30% cancellation fee

< 30 days before the start date= No Refund.

 

Physalia-courses cannot be held responsible for any travel fees, accommodation or other expenses incurred to you as a result of the cancellation.