Frameworks for GPU Accelerators: a Comprehensive Evaluation Using 2D/3D Image Registration
This is the author’s version of the work. The definitive work was published in Proceedings of the 9th IEEE Symposium on Application Specific Processors (SASP), San Diego, CA, USA, June 5-6, 2011. Frameworks for GPU Accelerators: A Comprehensive Evaluation using 2D/3D Image Registration Richard Membarth, Frank Hannig, and Jürgen Teich Mario Körner and Wieland Eckert Department of Computer Science, Siemens Healthcare Sector, H IM AX, University of Erlangen-Nuremberg, Germany. Forchheim, Germany. {richard.membarth,hannig,teich}@cs.fau.de {mario.koerner,wieland.eckert}@siemens.com for graphics cards without bothering about low-level hardware Abstract—In the last decade, there has been a dramatic growth details. in research and development of massively parallel many-core Compared to the predominant task parallel processing architectures like graphics hardware, both in academia and industry. This changed also the way programs are written in model on standard shared memory multi-core processors, order to leverage the processing power of a multitude of cores where each core processes independent tasks, a data parallel on the same hardware. In the beginning, programmers had to processing model is used for many-core processing. The main use special graphics programming interfaces to express general source for data parallelism arises out of inherent parallelism purpose computations on graphics hardware. Today, several within loops, distributing the computation of different loop frameworks exist to relieve the programmer from such tasks. In this paper, we present five frameworks for parallelization on iterations to the available processing cores. Parallelization GPU Accelerators, namely RapidMind, PGI Accelerator, HMPP frameworks assist the programmer in extracting this kind of Workbench, OpenCL, and CUDA.
[Show full text]