Achieving accurate and context-sensitive timing for code optimization

dc.contributor.authorWhaley, R. Clint
dc.contributor.authorCastaldo, Anthony M.
dc.date.accessioned2023-10-24T15:10:26Z
dc.date.available2023-10-24T15:10:26Z
dc.date.issued2008-01-18
dc.description.abstractKey computational kernels must run near their peak efficiency for most high performance computing (HPC) applications. Getting this level of efficiency has always required extensive tuning of the kernel on a particular platform of interest. The success or failure of an optimization is usually measured by invoking a timer. Understanding how to build reliable and context-sensitive timers is one of the most neglected areas in HPC, and this results in a host of HPC software that looks good when reported in papers, but which delivers only a fraction of the reported performance when used by actual HPC applications. In this paper we motivate the importance of timer design, and then discuss the techniques and methodologies we have developed in order to accurately time HPC kernel routines for our well-known empirical tuning framework, ATLAS.
dc.description.departmentComputer Science
dc.description.sponsorshipThis work was supported in part by National Science Foundation CRI grant SNS-0551504.
dc.identifier.urihttps://hdl.handle.net/20.500.12588/2136
dc.language.isoen_US
dc.publisherUTSA Department of Computer Science
dc.relation.ispartofseriesTechnical Report; CS-TR-2008-001
dc.titleAchieving accurate and context-sensitive timing for code optimization
dc.typeTechnical Report

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Whaley_Castaldo_CS-TR-2008-001.pdf
Size:
278.73 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.86 KB
Format:
Item-specific license agreed upon to submission
Description: