high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » CUDA » Synergia CUDA: GPU-accelerated accelerator modeling package

Synergia CUDA: GPU-accelerated accelerator modeling package

Q. Lu, J. Amundson

Scientific Computing Division, Fermi National Accelerator Laboratory, P.O.Box 500, Batavia, Illinois 60510, U.S.

Journal of Physics: Conference Series, 513, 052021, 2014

DOI:10.1088/1742-6596/513/5/052021

BibTeX

Download (PDF)

View

Source

Source codes

Package:

Synergia2

3046

views

Synergia is a parallel, 3-dimensional space-charge particle-in-cell accelerator modeling code. We present our work porting the purely MPI-based version of the code to a hybrid of CPU and GPU computing kernels. The hybrid code uses the CUDA platform in the same framework as the pure MPI solution. We have implemented a lock-free collaborative charge-deposition algorithm for the GPU, as well as other optimizations, including local communication avoidance for GPUs, a customized FFT, and fine-tuned memory access patterns. On a small GPU cluster (up to 4 Tesla C1070 GPUs), our benchmarks exhibit both superior peak performance and better scaling than a CPU cluster with 16 nodes and 128 cores. We also compare the code performance on different GPU architectures, including C1070 Tesla and K20 Kepler.

Tags: CUDA, FFT, GPU cluster, MPI, nVidia, Package, Particle-in-cell methods, Physics, Tesla C1070, Tesla K20

June 17, 2014 by hgpu

No votes yet.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

Synergia CUDA: GPU-accelerated accelerator modeling package

Package:

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Synergia CUDA: GPU-accelerated accelerator modeling package

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)