high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster

Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster

Piotr Pawliczek, Witold Dzwinel, David A. Yuen

University of Texas, Department of Biochemistry and Molecular Biology, Houston, TX 77030, USA

University of Texas, 2012

@article{pawliczek2012visualization,

title={Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster},

author={Pawliczek, P. and Dzwinel, W. and Yuen, D.A.},

year={2012}

}

Download (PDF)

View

Source

2281

views

Multidimensional scaling (MDS) is a very popular and reliable method used in feature extraction and visualization of multidimensional data. The role of MDS is to reconstruct the topology of an original N-dimensional feature space consisting of M feature vectors in target 2-D (3-D) Euclidean space. It can be achieved by minimization of the error – "stress" function – F(||D-d||), where D and d are the MxM dissimilarity matrices in the original and in the target spaces, respectively. However, the stress function is in general a multimodal and multidimensional function for which the complexity of finding global minimum increases exponentially with the number of data. We employ here a robust heuristics based on discrete particle method enabling interactive visualization of data for various types of stress functions. Nevertheless, due to at least O(M^2) memory and time complexity, the method becomes computationally demanding when applied for interactive visualization of data sets involving M~10^4. We present here efficient parallel algorithms developed for various small and pre-medium computer architectures from single and multi-core processors to GPU and multiprocessor MPI clusters. The timings obtained show that the computational efficiency of CUDA implementation of MDS on a PC equipped with a strong GPU board (Tesla M2050 or GeForce 480) is two times greater than its MPI equivalent run on 10 nodes (10x 2xIntel Xeon X5670 = 120 threads) of a professional multiprocessor cluster (HP SL390). We show also that the hybridized two-level MPI/CUDA implementation run on a small cluster of GPU nodes can additionally provide a linear speed-up.

Tags: Algorithms, Computer science, CUDA, Data mining, MPI, Multidimensional scaling, nVidia, nVidia GeForce 8500 GT, nVidia GeForce 8800 Ultra, nVidia GeForce 9500 GT, nVidia GeForce 9800 GT, nVidia GeForce GT 330 M, nVidia GeForce GTX 260, nVidia GeForce GTX 460, nVidia GeForce GTX 480, Tesla M2050, Visualization

October 6, 2012 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster

Share this:

Recent source codes

Most viewed papers (last 30 days)