Semi-automatic tool to ease the creation and optimization of GPU programs
Publikation: Bidrag til bog/antologi/rapport › Konferencebidrag i proceedings › Forskning › fagfællebedømt
We present a tool that reduces the development time of GPU-executable code. We implement a catalogue of common optimizations specific to the GPU architecture. Through the tool, the programmer can semi-automatically transform a computationally-intensive code section into GPU-executable form and apply optimizations thereto. Based on experiments, the code generated by the tool can be 3-256X faster than code generated by an OpenACC compiler, 4-37X faster than optimized CPU code, and attain up to 25% of peak performance of the GPU. We found that by using pattern-matching rules, many of the transformations can be performed automatically, which makes the tool usable for both novices and experts in GPU programming.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the 43rd International Conference on Parallel Processing Workshops : ICPPW 2014 |
Antal sider | 10 |
Forlag | IEEE |
Publikationsdato | 2014 |
Sider | 196-205 |
ISBN (Trykt) | 978-1-4799-5615-9 |
DOI | |
Status | Udgivet - 2014 |
Begivenhed | 43rd International Conference on Parallel Processing Workshops, ICPPW 2014 - Minneapolis, USA Varighed: 9 sep. 2014 → 12 sep. 2014 |
Konference
Konference | 43rd International Conference on Parallel Processing Workshops, ICPPW 2014 |
---|---|
Land | USA |
By | Minneapolis |
Periode | 09/09/2014 → 12/09/2014 |
ID: 162745676