high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Heterogeneous Distributed Big Data Clustering on Sparse Grids

Heterogeneous Distributed Big Data Clustering on Sparse Grids

David Pfander, Gregor Daiss, Dirk Pfluger

University of Stuttgart

Preprints, 2019020019, 2019

DOI:10.20944/preprints201902.0019.v1

@article{pfander2019heterogeneous,

title={Heterogeneous Distributed Big Data Clustering on Sparse Grids},

author={Pfander, David and Dai{ss}, Gregor and Pfl{"u}ger, Dirk},

year={2019},

publisher={Preprints}

}

Download (PDF)

View

Source

Source codes

Package:

SG++: a numerical library for adaptive Sparse Grids

2737

views

Clustering is an important task in data mining that has become more challenging due to the ever-increasing size of available datasets. To cope with these big data scenarios, a high-performance clustering approach is required. Sparse grid clustering is a density-based clustering method that uses a sparse grid density estimation as its central building block. The underlying density estimation approach enables the detection of clusters with non-convex shapes and without a predetermined number of clusters. In this work, we introduce a new distributed and performance-portable variant of the sparse grid clustering algorithm that is suited for big data settings. Our compute kernels were implemented in OpenCL to enable portability across a wide range of architectures. For distributed environments, we added a manager-worker scheme that was implemented using MPI. In experiments on two supercomputers, Piz Daint and Hazel Hen, with up to 100 million data points in a 10-dimensional dataset, we show the performance and scalability of our approach. The dataset with 100 million data points was clustered in 1198s using 128 nodes of Piz Daint. This translates to an overall performance of 352TFLOPS. On the node-level, we provide results for two GPUs, Nvidia’s Tesla P100 and the AMD FirePro W8100, and one processor-based platform that uses Intel Xeon E5-2680v3 processors. In these experiments, we achieved between 43% and 66% of the peak performance across all compute kernels and devices, demonstrating the performance portability of our approach.

Tags: Clustering, Computer science, Data mining, Distributed computing, Heterogeneous systems, Machine learning, MPI, nVidia, OpenCL, Package, performance portability, Tesla P100

February 10, 2019 by hgpu

Rating: 2.0/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Heterogeneous Distributed Big Data Clustering on Sparse Grids

Package:

Your response

Recent source codes

Awesome LLM-Driven Kernel Generation

PhysProver: Advancing Automatic Theorem Proving for Physics

ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation

SeedFold: Scaling Biomolecular Structure Prediction

Tilus: A Tile-Level GPU Kernel Programming Language

Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs

BoltzGen:Toward Universal Binder Design

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution

MATLAB Tensor Core models

Most viewed papers (last 30 days)

Heterogeneous Distributed Big Data Clustering on Sparse Grids

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)