Algorithm Acceleration Study
Step 1 : Introduction to GPGPU/CUDA
Step 1 : Introduction to GPGPU/CUDA
Overview of GPGPU/CUDA
Overview of GPGPU/CUDA
Thread and Execution Model
Thread and Execution Model
Step 2 : CUDA Memory Model
Step 2 : CUDA Memory Model
CUDA memory hierarchy
CUDA memory hierarchy
Memory model & Performance
Memory model & Performance
Using shared memory
Using shared memory
Step 3 : Maximizing Memory Throughput
Step 3 : Maximizing Memory Throughput
Global memory
Global memory
Shared memory
Shared memory
Step 4 : Synchronization & Concurrent Execution
Step 4 : Synchronization & Concurrent Execution
Synchronization
Synchronization
CUDA stream & Concurrent execution
CUDA stream & Concurrent execution
CUDA Event
CUDA Event
Step 5 : Algorithm Implementation and Acceleration
Step 5 : Algorithm Implementation and Acceleration
Algorithm Implementation with MATLAB
Algorithm Implementation with MATLAB
Algorithm Implementation with C/C++
Algorithm Implementation with C/C++
Algorithm Acceleration with GPGPU/CUDA
Algorithm Acceleration with GPGPU/CUDA