Date of Award

9-30-2014

Document Type

Thesis

Degree Name

Computer Science, MS

First Advisor

Hai Jiang

Committee Members

Hung-Chi Su; Jeff Jenness; Xiuzhen Huang

Call Number

LD 251 .A566t 2014 G74

Abstract

To achieve high performance parallel computing, the graphic processing unit (GPU) plays a critical role. NVIDIA invented CUDA as a parallel processing platform and programming model in the late 1990s. With CUDA, we can directly use GPU with C, C++, Fortran, Java or Python code by NVCC compiler. We introduced checkpoint/restart scheme and computation states migration strategy for fault tolerance. Checkpoint/Restart scheme is used to save all the computation state in run-time for later restoration if necessary. Migrating computation state is the process of moving computation states from one heavily loaded host to a lightly loaded host for load balancing and load sharing. This thesis focuses on the implementations of constructing computation states including local variables, execution counter and application-level stack structures in GPU, achieving GPU and CPU communication and migrating computation state from one machine to another through the support of a run-time module.

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.