high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Physics » Astrophysics » N-Body Simulation Using GP-GPU: Evaluating Host/Device Memory Transference Overhead

N-Body Simulation Using GP-GPU: Evaluating Host/Device Memory Transference Overhead

Sergio M. Martin, Fernando G. Tinetti, Nicanor B. Casas, Graciela E. De Luca, Daniel A. Giulianelli

Universidad Nacional de La Matanza, Florencio Varela 1903 – San Justo, Argentina

XIX Congreso Argentino de Ciencia de la Computacion (CACIC 2013), 2013

@article{martin2013n,

title={N-Body Simulation Using GP-GPU: Evaluating Host/Device Memory Transference Overhead},

author={Martin, Sergio M and Tinetti, Fernando G and Casas, Nicanor B and De Luca, Graciela E and Giulianelli, Daniel A},

year={2013}

}

Download (PDF)

View

Source

1935

views

N-Body simulation algorithms are amongst the most commonly used within the field of scientific computing. Especially in computational astrophysics, they are used to simulate gravitational scenarios for solar systems or galactic collisions. Parallel versions of such N-Body algorithms have been extensively designed and optimized for multicore and distributed computing schemes. However, N-Body algorithms are still a novelty in the field of GP-GPU computing. Although several N-body algorithms have been proved to harness the potential of a modern GPU processor, there are additional complexities that this architecture presents that could be analyzed for possible optimizations. In this article, we introduce the problem of host to device (GPU) – and vice versa – data transferring overhead and analyze a way to estimate its impact in the performance of simulations.

Tags: Astrophysics, CUDA, Gravitation, N-body simulation, nVidia, nVidia GeForce GTX 550 Ti, Physics

November 3, 2013 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

N-Body Simulation Using GP-GPU: Evaluating Host/Device Memory Transference Overhead

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

N-Body Simulation Using GP-GPU: Evaluating Host/Device Memory Transference Overhead

Share this:

Recent source codes

Most viewed papers (last 30 days)