high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster

Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster

Piotr Pawliczek, Witold Dzwinel, David A. Yuen

University of Texas, Department of Biochemistry and Molecular Biology, Houston, TX 77030, USA

University of Texas, 2012

@article{pawliczek2012visualization,

title={Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster},

author={Pawliczek, P. and Dzwinel, W. and Yuen, D.A.},

year={2012}

}

Download (PDF)

View

Source

2895

views

Multidimensional scaling (MDS) is a very popular and reliable method used in feature extraction and visualization of multidimensional data. The role of MDS is to reconstruct the topology of an original N-dimensional feature space consisting of M feature vectors in target 2-D (3-D) Euclidean space. It can be achieved by minimization of the error – "stress" function – F(||D-d||), where D and d are the MxM dissimilarity matrices in the original and in the target spaces, respectively. However, the stress function is in general a multimodal and multidimensional function for which the complexity of finding global minimum increases exponentially with the number of data. We employ here a robust heuristics based on discrete particle method enabling interactive visualization of data for various types of stress functions. Nevertheless, due to at least O(M^2) memory and time complexity, the method becomes computationally demanding when applied for interactive visualization of data sets involving M~10^4. We present here efficient parallel algorithms developed for various small and pre-medium computer architectures from single and multi-core processors to GPU and multiprocessor MPI clusters. The timings obtained show that the computational efficiency of CUDA implementation of MDS on a PC equipped with a strong GPU board (Tesla M2050 or GeForce 480) is two times greater than its MPI equivalent run on 10 nodes (10x 2xIntel Xeon X5670 = 120 threads) of a professional multiprocessor cluster (HP SL390). We show also that the hybridized two-level MPI/CUDA implementation run on a small cluster of GPU nodes can additionally provide a linear speed-up.

Tags: Algorithms, Computer science, CUDA, Data mining, MPI, Multidimensional scaling, nVidia, nVidia GeForce 8500 GT, nVidia GeForce 8800 Ultra, nVidia GeForce 9500 GT, nVidia GeForce 9800 GT, nVidia GeForce GT 330 M, nVidia GeForce GTX 260, nVidia GeForce GTX 460, nVidia GeForce GTX 480, Tesla M2050, Visualization

October 6, 2012 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

high performance computing on graphics processing units: hgpu.org

Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster

Your response

Recent source codes

RepoLaunch: Automating Build and Test Pipeline of Code Repositories on ANY Language and ANY Platform

RepoLaunch: Automating Build and Test Pipeline of Code Repositories on ANY Language and ANY Platform

CONCUR: a benchmark designed to evaluate multithreaded Java code generated by LLMs

HIPRT: Ray Tracing using HIP

MXFP4 Training Support Codebase

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

CUDABench: Benchmarking LLMs for Text-to-CUDA Generation

CL4SE: A Context Learning Benchmark For Software Engineering Tasks

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Most viewed papers (last 30 days)

Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)