high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Bridging the GPGPU-FPGA efficiency gap

Bridging the GPGPU-FPGA efficiency gap

Christopher W. Fletcher, Ilia A. Lebedev, Narges B. Asadi, Daniel R. Burke, John Wawrzynek

Massachusetts Institute of Technology, Cambridge, MA, and University of California at Berkeley, Berkeley, CA, USA

In Proceedings of the 19th ACM/SIGDA international symposium on Field programmable gate arrays (2011), pp. 119-122.

DOI:10.1145/1950413.1950439

@conference{fletcher2011bridging,

title={Bridging the GPGPU-FPGA efficiency gap},

author={Fletcher, C.W. and Lebedev, I.A. and Asadi, N.B. and Burke, D.R. and Wawrzynek, J.},

booktitle={Proceedings of the 19th ACM/SIGDA international symposium on Field programmable gate arrays},

pages={119–122},

year={2011},

organization={ACM}

}

Source

1522

views

This paper compares an implementation of a Bayesian inference algorithm across several FPGAs and GPGPUs, while embracing both the execution model and high-level architecture of a GPGPU. Our study is motivated by recent work in template-based programming and architectural models for FPGA computing. The comparison we present is meant to demonstrate the FPGA’s potential, while constraining the design to follow the microarchitectural template of more programmable devices such as GPGPUs. The FPGA implementation proves capable of matching the performance of a high-end Nvidia Fermi-based GPU – the most advanced GPGPU available to us at the time of this study. Further investigation shows that each FPGA core outperforms workstation GPGPU cores by a factor of ~ 3.14x, and mobile GPGPU cores by ~ 4.25x despite a ~ 4x reduction in core clock frequency. Using these observations, we discuss the efficiency gap between these two platforms, and the challenges associated with template-based programming models.

Tags: Algorithms, Bayesian, Computer science, FPGA, Heterogeneous systems, nVidia, OpenCL

March 22, 2011 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

* * *

high performance computing on graphics processing units: hgpu.org

Bridging the GPGPU-FPGA efficiency gap

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)

Bridging the GPGPU-FPGA efficiency gap

Share this:

Recent source codes

Most viewed papers (last 30 days)