Posts
Sep, 14
GPU-based Parallel Reservoir Simulators
We have developed a GPU-based parallel linear solver package. When solving matrices from reservoir simulation, the parallel solvers are much more efficient than CPU-based linear solvers. However, efforts should be made to improve the setup phase of domain decomposition, the factorization of ILUT and parallelism of block ILUT preconditioner.
Sep, 14
A GPU-based Affine and Scale Invariant Feature Transform Algorithm
Affine invariance is one of the main performances of a good feature extraction algorithm. SIFT is a kind of scale-invariant feature extraction algorithm, but it is not affine invariant. In order to improve SIFT algorithm’s affine invariance. Affine and Scale Invariant Feature Transform (ASIFT) algorithm takes affine Model into SIFT. However, serial ASIFT algorithm’s computing […]
Sep, 14
A GPU Accelerated BiConjugate Gradient Stabilized Solver for Speeding-up Large Scale Model Evaluation
Solving linear systems remains a key activity in of economics modelling, therefore making fast and accurate methods for computing solutions highly desirable. In this paper, a proof of concept C++ AMP implementation of an iterative method for solving linear systems, BiConjugate Gradient Stabilized (henceforth BiCGSTAB), is presented. The method relies on matrix and vector operations, […]
Sep, 14
Efficient CUDA polynomial preconditioned Conjugate Gradient solver for Finite Element computation of elasticity problems
Graphics Processing Unit (GPU) has obtained great success in scientific computations for its tremendous computational horsepower and very high memory bandwidth. This paper discusses the efficient way to implement polynomial preconditioned conjugate gradient solver for the finite element computation of elasticity on NVIDIA GPUs using Compute Unified Device Architecture (CUDA). Sliced Block ELLPACK (SBELL) format […]
Sep, 13
FuzzyGPU: a fuzzy arithmetic library for GPU
Data are traditionally represented using native format such as integer or floating-point numbers in various flavor. However, some applications rely on more complex representation format. This is the case when uncertainty needs to be apprehended. Fuzzy arithmetic is one of the major tools to address this problem, but the execution time of basic operations such […]
Sep, 13
Increasing GPU Throughput using Kernel Interleaved Thread Block Scheduling
The number of active threads required to achieve peak application throughput on graphics processing units (GPUs) depends largely on the ratio of time spent on computation to the time spent accessing data from memory. While compute-intensive applications can achieve peak throughput with a low number of threads, memory-intensive applications might not achieve good throughput even […]
Sep, 13
An Interface for Halo Exchange Pattern
Halo exchange patterns are very common in scientific computing, since the solution of PDEs often requires communication between neighbor points. Although this is a common pattern, implementations are often made by programmers from scratch, with an accompanying feeling of "reinventing the wheel". In this paper we describe GCL, a C++ generic library that implements a […]
Sep, 13
Exploring Multiple Dimensions of Parallelism in Junction Tree Message Passing
Belief propagation over junction trees is known to be computationally challenging in the general case. One way of addressing this computational challenge is to use node-level parallel computing, and parallelize the computation associated with each separator potential table cell. However, this approach is not efficient for junction trees that mainly contain small separators. In this […]
Sep, 13
Recent progress and challenges in exploiting graphics processors in computational fluid dynamics
The progress made in accelerating simulations of fluid flow using GPUs, and the challenges that remain, are surveyed. The review first provides an introduction to GPU computing and programming, and discusses various considerations for improved performance. Case studies comparing the performance of CPU- and GPU- based solvers for the Laplace and incompressible Navier-Stokes equations are […]
Sep, 13
Accelerating moderately stiff chemical kinetics in reactive-flow simulations using GPUs
The chemical kinetics ODEs arising from operator-split reactive-flow simulations were solved on GPUs using explicit integration algorithms. Nonstiff chemical kinetics of a hydrogen oxidation mechanism (9 species and 38 irreversible reactions) were computed using the explicit fifth-order Runge-Kutta-Cash-Karp method, and the GPU-accelerated version performed faster than single- and six-core CPU versions by factors of 126 […]
Sep, 13
A massively parallel program to solve the phase field formulation for crack propagation
Phase field models for fracture employ a continuous field variable to model cracks. Therefore, in contrast to discrete descriptions of fracture, numerical tracking of discontinuities in the displacement field are not required. This really reduces implementation complexity. In this paper, we discuss the use of a single graphical processing unit (GPU) to accelerate the solution […]
Sep, 13
Simulation and modeling of physical broadcasts
The environment around us has many phenomena and has different behaviors according to different parameters, biological, chemical, physical, etc. To represent a simple and abstract reality of this environment we use a concept called environmental modeling. The environmental modeling deals with many environmental problems such as air pollution, diffusion of disease, animal behavior and so […]