high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Image and Signal Processing » Signal processing » Performance Comparison of Graphics Processors to Reconfigurable Logic: A Case Study

Performance Comparison of Graphics Processors to Reconfigurable Logic: A Case Study

Ben Cope, Peter Y. K. Cheung, Wayne Luk, Lee Howes

Department of Electrical and Electronic Engineering, Imperial College London, London SW7 2BT, UK

IEEE Transactions on Computers, Vol. 59, No. 4. (April 2010), pp. 433-448.

DOI:10.1109/TC.2009.179

@article{cope2010performance,

title={Performance comparison of graphics processors to reconfigurable logic: A case study},

author={Cope, B. and Cheung, P.Y.K. and Luk, W. and Howes, L.},

journal={Computers, IEEE Transactions on},

volume={59},

number={4},

pages={433–448},

issn={0018-9340},

year={2010},

publisher={IEEE}

}

Download (PDF)

View

Source

2420

views

A systematic approach to the comparison of the graphics processor (GPU) and reconfigurable logic is defined in terms of three throughput drivers. The approach is applied to five case study algorithms, characterized by their arithmetic complexity, memory access requirements, and data dependence, and two target devices: the nVidia GeForce 7900 GTX GPU and a Xilinx Virtex-4 field programmable gate array (FPGA). Two orders of magnitude speedup, over a general-purpose processor, is observed for each device for arithmetic intensive algorithms. An FPGA is superior, over a GPU, for algorithms requiring large numbers of regular memory accesses, while the GPU is superior for algorithms with variable data reuse. In the presence of data dependence, the implementation of a customized data path in an FPGA exceeds GPU performance by up to eight times. The trends of the analysis to newer and future technologies are analyzed.

Tags: FPGA, nVidia, nVidia GeForce 7900 GTX, OpenGL, Signal processing

November 5, 2010 by hgpu

No votes yet.

Please wait...