High Performance Computing  I (WS14/15)

We will start with an introduction into basic principles and algorithms of parallel computing followed by transferring selected algorithms onto many-core architectures. We will focus first on NVIDIA-GPUs using the CUDA library, even on multiple GPUs. The students will compare the performance of the algorithms on the GPU with performance on multi-core CPUs using OpenMP. Recent Compilers for OpenACC supporting GPU programming will available for testing and benchmarking. A similar programming scheme can be used for the core accelerator card Xeon Phi by using Intel compilers and OpenMP 4.0.
A tech-report has to be produced until the end of the term.

Course Material : Follow the link. See templates.

Compilers (new):

Prof. Gundolf Haase, Heinrichstr. 36, Zi 506, Tel. 5178,

:  Monday 10:15 - 11:45 in Heinrichstr. 36, SR 11.33
Projects: WS 15/16, WS 14/15, WS13/14, WS12/13WS11/12, WS10/11
Hardware (login from outside KFU only via  VPN):
Material for CUDA:
Material for OpenACC (PGI):

for CUDA:
Material for OpenCL:
Further Links

Document made with KompoZer  06.04.2016