(in Polish) High-Performance Computing for Artificial Intelligence 2 4010-HPAI
Artificial Intelligence (AI) has become a widely used tool in scientific work. Not only does AI enable advanced data analysis and visualization but it can also replace traditional numerical simulations or speed them up by making accurate predictions of interactions in a system. Additionally, AI allows for quick searching of scientific publication databases and generating article summaries. It is also being used more frequently for the full writing of scientific papers. However, the development of more sophisticated AI models requires the processing of huge amounts of data and the costly, time-consuming process of training AI models. This is especially evident with large language models (LLMs), where the training process can take weeks on thousands of the most powerful GPUs. Working with such complex models requires the use of large computing power: computing clusters or a cloud, which is the domain of High-Performance Computing (HPC).
HPC is a field of computer science dedicated to solving the most demanding computational problems in science. Top well-known applications include: (i) predicting the properties of semiconductors using quantum mechanical calculations with Density Functional Theory (DFT), (ii) drug design through molecular dynamics simulations, and (iii) numerical weather forecasting. Such calculations require hundreds or even thousands of CPUs. The key element of HPC is parallel computing. It allows calculations to be sped up by using many CPUs or GPUs. It also helps reduce the memory needed for each single computing core. For many years, parallel computing has been essential for scientific calculations, and now it is a crucial technique for AI. This is why the techniques, methodologies, and algorithms developed for HPC tasks are now also being applied to the training of AI models.
This lecture will focus on three main aspects:
1. What HPC is, how to work with computing clusters, and how parallel computing can be used in scientific research?
2. How to speed up Python code execution, with particular focus on mathematical operations, and how to parallelize these operations or use computational accelerators (like GPUs).
3. How to improve the efficiency of AI model training and apply parallel programming techniques to AI model training.
The session will be conducted in a workshop format, allowing participants to gain both theoretical knowledge and practical skills. Moreover, practical sessions will reinforce the acquired theoretical knowledge and provide hands-on experience. The exercises will be conducted on an HPC computer used by leading research teams in Poland.
Prerequisites (description)
Course coordinators
Type of course
Mode
Learning outcomes
W1 - Knows the basic computer science and machine learning terminology and understands its sources and applications in related scientific disciplines [K_W02}
W2 - Understands the role of High-Performance Computing in Artificial Intelligence [K_W03]
W3 - Has basic knowledge of the place of time series analysis and anomaly detection in the system of sciences and of its subject-related and methodological connections to other scientific disciplines [K_W04]
W4 - Knows the HPC tools used in AI and understands the advantages and disadvantages of individual research methods [K_W05]
W5 - Knows what a programming language is and understands how it can be used [K_W07]
U1 - Is able to use basic terms in HPC and AI [K_U11]
U2 - Is able to prepare a short scientific report [K_U09]
U3 - IIs able to use telecommunications tools for effective collaboration and is able to plan and implement numerical simulations for a selected application field of time series analysis and anomaly detection, analyze their results, and draw conclusions [K_U13]
U4 - Knows the principles of health and safety of working with a computer and is able to perform basic economic analysis of the calculations carried out [K_U14]
U5 - Can write simple queue scripts for ML model training [K_U11]
U6 - Can evaluate the effectiveness of machine learning calculations [K_U10]
K1 - Independently plans systematically, and timely implements research goals, making difficult decisions when necessary [K_K08, K_K09]
K2 - Is able to participate in substantive discussions, is able to work out compromises, and define a common position [K_K03, K_K05}
K3 - Is able to behave politely during a discussion without offending other participants [KK10]
Assessment criteria
Learning outcomes K1-K3 are assessed continuously during classes.
Learning outcomes U1-U6 are assessed continuously during laboratory classes.
Learning outcomes W1-W6 are assessed in a written exam, where students also receive their final grade.
Practical placement
Not applicable.
Additional information
Additional information (registration calendar, class conductors, localization and schedules of classes), might be available in the USOSweb system: