high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Multi-GPU Implementation of Machine Learning Algorithm using CUDA and OpenCL

Multi-GPU Implementation of Machine Learning Algorithm using CUDA and OpenCL

Jan Masek, Radim Burget, Lukas Povoda, Malay Kishore Dutta

BurgSys, a.s., Hnevkovskeho 30/65, 617 00 Brno, Czech Republic

International Journal of Advances in Telecommunications, Electrotechnics, Signals and Systems, Vol 5, No 2, 2016

DOI:10.11601/ijates.v5i2.142

@article{masek2016multi,

title={Multi–GPU Implementation of Machine Learning Algorithm using CUDA and OpenCL},

author={Masek, Jan and Burget, Radim and Povoda, Lukas and Dutta, Malay Kishore},

journal={International Journal of Advances in Telecommunications, Electrotechnics, Signals and Systems},

volume={5},

number={2},

pages={101–107},

year={2016}

}

Download (PDF)

View

Source

1562

views

Using modern Graphic Processing Units (GPUs) becomes very useful for computing complex and time consuming processes. GPUs provide high-performance computation capabilities with a good price. This paper deals with a multi-GPU OpenCL and CUDA implementations of k-Nearest Neighbor (k-NN) algorithm. This work compares performances of OpenCLand CUDA implementations where each of them is suitable for different number of used attributes. The proposed CUDA algorithm achieves acceleration up to 880x in comparison witha single thread CPU version. The common k-NN was modified to be faster when the lower number of k neighbors is set. The performance of algorithm was verified with two GPUs dual-core NVIDIA GeForce GTX 690 and CPU Intel Core i7 3770 with 4.1 GHz frequency. The results of speed up were measured for one GPU, two GPUs, three and four GPUs. We performed several tests with data sets containing up to 4 million elements with various number of attributes.

Tags: Algorithms, Computer science, CUDA, Machine learning, Nearest neighbour, nVidia, nVidia GeForce GTX 690, OpenCL

June 14, 2016 by hgpu

Rating: 1.5/5. From 2 votes.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

Multi-GPU Implementation of Machine Learning Algorithm using CUDA and OpenCL

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

Multi-GPU Implementation of Machine Learning Algorithm using CUDA and OpenCL

Share this:

Recent source codes

Most viewed papers (last 30 days)