Semi-automatic tool to ease the creation and optimization of GPU programs

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningfagfællebedømt

  • Jacob Jepsen

We present a tool that reduces the development time of GPU-executable code. We implement a catalogue of common optimizations specific to the GPU architecture. Through the tool, the programmer can semi-automatically transform a computationally-intensive code section into GPU-executable form and apply optimizations thereto. Based on experiments, the code generated by the tool can be 3-256X faster than code generated by an OpenACC compiler, 4-37X faster than optimized CPU code, and attain up to 25% of peak performance of the GPU. We found that by using pattern-matching rules, many of the transformations can be performed automatically, which makes the tool usable for both novices and experts in GPU programming.

OriginalsprogEngelsk
TitelProceedings of the 43rd International Conference on Parallel Processing Workshops : ICPPW 2014
Antal sider10
ForlagIEEE
Publikationsdato2014
Sider196-205
ISBN (Trykt)978-1-4799-5615-9
DOI
StatusUdgivet - 2014
Begivenhed43rd International Conference on Parallel Processing Workshops, ICPPW 2014 - Minneapolis, USA
Varighed: 9 sep. 201412 sep. 2014

Konference

Konference43rd International Conference on Parallel Processing Workshops, ICPPW 2014
LandUSA
ByMinneapolis
Periode09/09/201412/09/2014

ID: 162745676