Semi-automatic tool to ease the creation and optimization of GPU programs

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Jacob Jepsen

We present a tool that reduces the development time of GPU-executable code. We implement a catalogue of common optimizations specific to the GPU architecture. Through the tool, the programmer can semi-automatically transform a computationally-intensive code section into GPU-executable form and apply optimizations thereto. Based on experiments, the code generated by the tool can be 3-256X faster than code generated by an OpenACC compiler, 4-37X faster than optimized CPU code, and attain up to 25% of peak performance of the GPU. We found that by using pattern-matching rules, many of the transformations can be performed automatically, which makes the tool usable for both novices and experts in GPU programming.

Original languageEnglish
Title of host publicationProceedings of the 43rd International Conference on Parallel Processing Workshops : ICPPW 2014
Number of pages10
PublisherIEEE
Publication date2014
Pages196-205
ISBN (Print)978-1-4799-5615-9
DOIs
Publication statusPublished - 2014
Event43rd International Conference on Parallel Processing Workshops, ICPPW 2014 - Minneapolis, United States
Duration: 9 Sep 201412 Sep 2014

Conference

Conference43rd International Conference on Parallel Processing Workshops, ICPPW 2014
LandUnited States
ByMinneapolis
Periode09/09/201412/09/2014

ID: 162745676