high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » The GPU Enhanced Parallel Computing for Large Scale Data Clustering

The GPU Enhanced Parallel Computing for Large Scale Data Clustering

Xiaohui Cui, Jesse St. Charles, Justin Beaver, Thomas E. Potok

Oak Ridge National Laboratory, Oak Ridge, TN 37831

International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC), 2011

DOI:10.1109/CyberC.2011.44

BibTeX

Download (PDF)

View

Source

1937

views

Analyzing and clustering large scale data set is a complex problem. One explored method of solving this problem borrows from nature, imitating the flocking behavior of birds. One limitation of this method of data clustering is its complexity O(n^2). As the number of data and feature dimensions grows, it becomes increasingly difficult to generate results in a reasonable amount of time. In the last few years, the graphics processing unit (GPU) has received attention for its ability to solve highly-parallel and semi-parallel problems much faster than the traditional sequential processor. In this chapter, we have conducted research to exploit this architecture and apply its strengths to the flocking based data clustering problem. Using the CUDA platform from NVIDIA, we developed a Multiple Species Data Flocking implementation to be run on the NVIDIA GPU. Performance gains ranged from 30 to 60 times improvement of the GPU over the CPU implementation.

Tags: Clustering, Computer science, CUDA, nVidia, nVidia GeForce 8800 GTX

January 24, 2012 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

The GPU Enhanced Parallel Computing for Large Scale Data Clustering

Your response

Recent source codes

GEAK-agent: LLM-based AI agent, which can write correct and efficient GPU kernels automatically

OpenDwarfs 2025: re-engineered version of the OpenDwarfs benchmark suite, for compatibility with modern platforms

Specx: Speculative task-based runtime system

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

Most viewed papers (last 30 days)

The GPU Enhanced Parallel Computing for Large Scale Data Clustering

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)