Semi-automatic tool to ease the creation and optimization of GPU programs
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
We present a tool that reduces the development time of GPU-executable code. We implement a catalogue of common optimizations specific to the GPU architecture. Through the tool, the programmer can semi-automatically transform a computationally-intensive code section into GPU-executable form and apply optimizations thereto. Based on experiments, the code generated by the tool can be 3-256X faster than code generated by an OpenACC compiler, 4-37X faster than optimized CPU code, and attain up to 25% of peak performance of the GPU. We found that by using pattern-matching rules, many of the transformations can be performed automatically, which makes the tool usable for both novices and experts in GPU programming.
Original language | English |
---|---|
Title of host publication | Proceedings of the 43rd International Conference on Parallel Processing Workshops : ICPPW 2014 |
Number of pages | 10 |
Publisher | IEEE |
Publication date | 2014 |
Pages | 196-205 |
ISBN (Print) | 978-1-4799-5615-9 |
DOIs | |
Publication status | Published - 2014 |
Event | 43rd International Conference on Parallel Processing Workshops, ICPPW 2014 - Minneapolis, United States Duration: 9 Sep 2014 → 12 Sep 2014 |
Conference
Conference | 43rd International Conference on Parallel Processing Workshops, ICPPW 2014 |
---|---|
Land | United States |
By | Minneapolis |
Periode | 09/09/2014 → 12/09/2014 |
ID: 162745676