high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » CUDA » Parallel Implementation of Vortex Element Method on CPUs and GPUs

Parallel Implementation of Vortex Element Method on CPUs and GPUs

Kseniia Kuzmina, Ilia Marchevsky, Victoriya Moreva

Bauman Moscow State Technical University, Moscow, Russia

Procedia Computer Science, Volume 66, Pages 73-82, 2015

DOI:10.1016/j.procs.2015.11.010

BibTeX

Download (PDF)

View

Source

2048

views

The implementations of 2D vortex element method adapted to different types of parallel computers are considered. The developed MPI-implementation provides close to linear acceleration for small number of computational cores and approximately 40-times acceleration for 80-cores cluster when solving model problem. OpenMP-based modification allows to obtain 5% additional acceleration due to shared memory usage. Approximate fast multipole method usage reduces time of computations significantly: 11 times for the testmodel problem in sequential mode and 3.5 times in parallel mode for 16-cores cluster. The most efficient implementation of vortex element method is developed for GPUs using NVidia CUDA technology. Time of the model problem solving using single GeForce GTX 970 or Tesla C2070 accelerator is comparable with time of its solving on cluster when involving 30-40 cores of Intel Xeon E5450 CPUs.

Tags: CUDA, Fast multipole method, Fluid dynamics, MPI, nVidia, nVidia GeForce GTX 970, Tesla C2070

December 6, 2015 by hgpu

Rating: 1.5/5. From 2 votes.

Please wait...

Your response

You must be logged in to post a comment.

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations

microSYCL: SYCL micro-benchmarks repository

Exploring SYCL as a Portability Layer for High-Performance Computing on CPUs

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Parallel Implementation of Vortex Element Method on CPUs and GPUs

Your response

Recent source codes

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

exa-AMD: Exascale Accelerated Materials Discovery

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

Most viewed papers (last 30 days)

Parallel Implementation of Vortex Element Method on CPUs and GPUs

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)