high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Fluid dynamics » Multiple-GPUs Algorithm for Lattice Boltzmann Method

Multiple-GPUs Algorithm for Lattice Boltzmann Method

Jifu Zhou, Chengwen Zhong, Jianfei Xie, Shiqun Yin

Center for High Performance Comput., Northwestern Polytech. Univ., Xian

International Symposium on Information Science and Engineering, 2008. ISISE ’08

DOI:10.1109/ISISE.2008.68

BibTeX

Source

1887

views

It is studied about parallel algorithm of lattice Boltzmann method. The data’s arrangement, commutation and computational progress are redesigned in a marriage of message passing interface and general purpose graphic processing Units. On the single-GPU, novel techniques appearing in shader model 3.0 such as frame buffer object (FBO), multiple-channels-rendering and, rendering-to-textures are used to improve computational efficiency. On multiple-GPUs, MPI is used to extend available mesh size and accomplish parallel algorithm. Consequently, the problem of excessively enormous mesh such as the size of 1024*1024, which could not be calculated on single GPU, is resolved in this paper. Moreover, the computational time of the instance-velocity vector of incompressible fluid is merely 0.585 second/step, a speed which is about 5.0 times faster than that of a single CPU implementation.

Tags: Fluid dynamics, GPU cluster, Lattice Boltzmann model, MPI, OpenGL, Rendering

June 3, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations

microSYCL: SYCL micro-benchmarks repository

Exploring SYCL as a Portability Layer for High-Performance Computing on CPUs

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Multiple-GPUs Algorithm for Lattice Boltzmann Method

Your response

Recent source codes

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

exa-AMD: Exascale Accelerated Materials Discovery

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

Most viewed papers (last 30 days)

Multiple-GPUs Algorithm for Lattice Boltzmann Method

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)