high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Cluster-SkePU: A Multi-Backend Skeleton Programming Library for GPU Clusters

Cluster-SkePU: A Multi-Backend Skeleton Programming Library for GPU Clusters

Mudassar Majeed, Usman Dastgeer, Christoph Kessler

Department of Computer and Information Sciences (IDA), Linkoping University, Sweden

The 2013 International Conference on Parallel and Distributed, Processing Techniques and Applications (PDPTA’13), 2013

BibTeX

Download (PDF)

View

Source

2106

views

SkePU is a C++ template library with a simple and unified interface for expressing data parallel computations in terms of generic components, called skeletons, on multi-GPU systems using CUDA and OpenCL. The smart containers in SkePU, such as Matrix and Vector, perform data management with a lazy memory copying mechanism that reduces redundant data communication. SkePU provides programmability, portability and even performance portability, but up to now application written using SkePU could only run on a single multi-GPU node. We present the extension of SkePU for GPU clusters without the need to modify the SkePU application source code. With our prototype implementation, we performed two experiments. The first experiment demonstrates the scalability with regular algorithms for N-body simulation and electric field calculation over multiple GPU nodes. The results for the second experiment show the benefit of lazy memory copying in terms of speedup gained for one level of Strassen’s algorithm and another synthetic matrix sum application.

Tags: Algorithms, Computer science, CUDA, GPU cluster, N-body simulation, nVidia, Tesla M2090

December 4, 2013 by hgpu

No votes yet.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

Cluster-SkePU: A Multi-Backend Skeleton Programming Library for GPU Clusters

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Cluster-SkePU: A Multi-Backend Skeleton Programming Library for GPU Clusters

Share this:

Recent source codes

Most viewed papers (last 30 days)