Exploiting CUDA acceleration and impact of data transfer overhead on heterogeneous system

dc.contributor.advisorLee, Byeong Kil
dc.contributor.authorAhmed, Fahian
dc.contributor.committeeMemberJohn, Eugene
dc.contributor.committeeMemberLin, Wei-Ming
dc.date.accessioned2024-01-25T19:32:09Z
dc.date.available2024-01-25T19:32:09Z
dc.date.issued2012
dc.descriptionThis item is available only to currently enrolled UTSA students, faculty or staff. To download, navigate to Log In in the top right-hand corner of this screen, then select Log in with my UTSA ID.
dc.description.abstractWith the introduction of many-core GPUs, there is widespread interest in using GPUs to accelerate non-graphics applications such as bioinformatics, energy, finance and several research areas. Even though the GPUs provide highly parallel processing capability, the interface between CPU and GPU could be a performance bottleneck due to heavy data transfer. To investigate this bottleneck and find out the solutions, hotspot analysis based partial GPU acceleration is studied and performance impact on data size variation is observed as well. Here, I architecturally characterize some basic kernels and genetic applications, and also investigate performance hotspot functions in HMMER 3.0 and LavaMD. HMMER is an application whose main use is searching sequence databases for homologs of protein sequences. LavaMD is a molecular dynamics application and it can be found in Rodinia benchmark suits. Individual hotspot acceleration and full GPU acceleration are compared in terms of performance (speedup). In some cases, if data transfer time is overwhelming the computation time on GPU, it would be better keep the computation on CPU instead of using GPUs. Thus, I aim to observe the borderline between CPU vs. GPU performance as well as the effects of using different types of memory. For this observation I use a security application name blowfish with different input size. The Security application includes several common algorithms for data encryption, decryption and hashing. Among others blowfish is one of the popular security applications. Based on workload characterization and bottleneck analysis, I provide optimization methodologies to remove the bottleneck.
dc.description.departmentElectrical and Computer Engineering
dc.format.extent69 pages
dc.format.mimetypeapplication/pdf
dc.identifier.isbn9781267343185
dc.identifier.urihttps://hdl.handle.net/20.500.12588/2368
dc.languageen
dc.subject.classificationComputer engineering
dc.titleExploiting CUDA acceleration and impact of data transfer overhead on heterogeneous system
dc.typeThesis
dc.type.dcmiText
dcterms.accessRightspq_closed
thesis.degree.departmentElectrical and Computer Engineering
thesis.degree.grantorUniversity of Texas at San Antonio
thesis.degree.levelMasters
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Ahmed_utsa_1283M_10796.pdf
Size:
1.51 MB
Format:
Adobe Portable Document Format