Accelerating scientific applications on reconfigurable computing systems
dc.contributor.advisor | Lo, Chia-Tien Dan | |
dc.contributor.advisor | Psarris, Kleanthis | |
dc.contributor.author | Tai, Yi-Gang | |
dc.contributor.committeeMember | Robbins, Kay A. | |
dc.contributor.committeeMember | Tosun, Ali Saman | |
dc.contributor.committeeMember | Zhang, Weining | |
dc.date.accessioned | 2024-03-08T15:44:09Z | |
dc.date.available | 2024-03-08T15:44:09Z | |
dc.date.issued | 2011 | |
dc.description | This item is available only to currently enrolled UTSA students, faculty or staff. To download, navigate to Log In in the top right-hand corner of this screen, then select Log in with my UTSA ID. | |
dc.description.abstract | Advances in multi-core, many-core, and heterogeneous computing systems have created numerous possibilities of parallelization and hardware acceleration. With their exibility and abundant logic resources, reconfigurable computing systems, in particular systems based on field-programmable gate arrays (FPGAs), have become an attractive option as hardware accelerators. This dissertation studies acceleration of QR and LU matrix decompositions on FPGA-based reconfigurable computing systems, where there are few solutions for scalable floating-point matrix decompositions. First, exploring experiments are presented to reveal the characteristics regarding different embedded processor cores and system configurations on an FPGA-based system. Next, a vector reduction method termed delayed buffering is proposed. With its low latency and high operator pipeline utilization, the method accelerates matrix decomposition by improving composing vector reduction computation. Finally, with the delayed buffering reduction incorporated, using an enhanced tiled matrix decomposition algorithm to access off-chip memory and parallelizing the main decomposition loop for on-chip computation allow a single FPGA to perform better than two general-purpose processors plus a graphics processing unit (GPU) for matrix decomposition of size limited by the capacity of off-chip memory. | |
dc.description.department | Computer Science | |
dc.format.extent | 100 pages | |
dc.format.mimetype | application/pdf | |
dc.identifier.isbn | 9781124877112 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12588/5682 | |
dc.language | en | |
dc.subject | FPGA | |
dc.subject | hardware acceleration | |
dc.subject | LU decomposition | |
dc.subject | QR decomposition | |
dc.subject | reconfigurable computing system | |
dc.subject | vector reduction | |
dc.subject.classification | Computer science | |
dc.title | Accelerating scientific applications on reconfigurable computing systems | |
dc.type | Thesis | |
dc.type.dcmi | Text | |
dcterms.accessRights | pq_closed | |
thesis.degree.department | Computer Science | |
thesis.degree.grantor | University of Texas at San Antonio | |
thesis.degree.level | Doctoral | |
thesis.degree.name | Doctor of Philosophy |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Tai_utsa_1283D_10621.pdf
- Size:
- 745.7 KB
- Format:
- Adobe Portable Document Format