Programming Multi-Core and Many-Core Systems

Table of Contents

Exam review notes

These are the notes I wrote while revising, sort of a summary of everything.

Optimizing code on a single thread CPU:

In-core parallelism: SIMD vectorization

OpenMP (-fopenmp)

Pthreads (-pthread)

Common pitfalls

GPU programming

CUDA: programming GPUs

Execution model

Memory spaces

Shared variables

Consistency & synchronization

CUDA streams

CUDA events