Skip to content

Schedule

DAY 1 - Wednesday 29/05
09:00 CEST
10:00 EEST
Welcome and Introduction
Presenters: Jørn Dietze (LUST) and Christian Schou Oxvig (LUST and DeiC)
09:15 CEST
10:15 EEST
Introduction to LUMI
Presenter: Jørn Dietze (LUST)
09:45 CEST
10:45 EEST
Using the LUMI web interface
Presenters: Mats Sjöberg (CSC) and Lukas Prediger (CSC)
10:05 CEST
11:05 EEST
Hands-on: Run a simple PyTorch example notebook
10:35 CEST
11:35 EEST
Break (25 minutes)
10:50 CEST
11:50 EEST
Your first AI training job on LUMI
Presenters: Mats Sjöberg (CSC) and Lukas Prediger (CSC)
11:20 CEST
12:20 EEST
Hands-on: Run a simple single-GPU PyTorch AI training job
12:05 CEST
13:05 EEST
Lunch break (45 minutes)
12:50 CEST
13:50 EEST
Understanding GPU activity & checking jobs
Presenter: Samuel Añtao (AMD)
13:10 CEST
14:10 EEST
Hands-on: Checking GPU usage interactively using rocm-smi
13:30 CEST
14:30 EEST
Running containers on LUMI
Presenter: Christian Schou Oxvig (LUST & DeiC)
13:50 CEST
14:50 EEST
Hands-on: Pull and run a container
14:50 CEST
15:50 EEST
Break (15 minutes)
14:25 CEST
15:25 EEST
Building containers from conda/pip environments
Presenter: Christian Schou Oxvig (LUST & DeiC)
14:45 CEST
15:45 EEST
Hands-on: Creating a conda environment file and building a container using cotainr
15:05 CEST
16:05 EEST
Extending containers with virtual environments for faster testing
Presenter: Gregor Decristoforo (LUST)
15:25 CEST
16:25 EEST
Getting started with your own project
16:25 CEST
17:25 EEST
End of the course day
DAY 2 - Thursday 30/05
09:00 CEST
10:00 EEST
Scaling AI training to multiple GPUs
Presenters: Mats Sjöberg (CSC) and Lukas Prediger (CSC)
09:30 CEST
10:30 EEST
Hands-on: Converting the PyTorch single GPU AI training job to use all GPUs in a single node via DDP
10:00 CEST
11:00 EEST
Hyper-parameter tuning using Ray on LUMI
Presenter: Gregor Decristoforo (LUST)
10:20 CEST
11:20 EEST
Hands-on: Hyper-parameter tuning the PyTorch model using Ray
10:40 CEST
11:40 EEST
Break (15 minutes)
10:55 CEST
11:55 EEST
Extreme scale AI
Presenter: Samuel Añtao (AMD)
11:25 CEST
12:25 EEST
Demo/Hands-on: Using multiple nodes
11:45 CEST
12:45 EEST
Loading training data from Lustre and LUMI-O
Presenter: Harvey Richardson (HPE)
12:00 CEST
13:00 EEST
Lunch break (60 minutes)
13:00 CEST
14:00 EEST
Coupling machine learning with HPC simulation
Presenter: Harvey Richardson (HPE)
13:30 CEST
14:30 EEST
Advancing your own project
16:00 CEST
17:00 EEST
End of the course day