7970

Posts

Jul, 11

A fully parallel, high precision, N-body code running on hybrid computing platforms

We present a new implementation of the numerical integration of the classical, gravitational, N-body problem based on a high order Hermite’s integration scheme with block time steps, with a direct evaluation of the particle-particle forces. The main innovation of this code (called HiGPUs) is its full parallelization, exploiting both OpenMP and MPI in the use […]
Jul, 11

Hybrid Monte Carlo with Wilson Dirac operator on the Fermi GPU

In this article we present our implementation of a Hybrid Monte Carlo algorithm for Lattice Gauge Theory using two degenerate flavours of Wilson-Dirac fermions on a Fermi GPU. We find that using registers instead of global memory speeds up the code by almost an order of magnitude. To map the array variables to scalars, so […]
Jul, 10

Exposure Render: An Interactive Photo-Realistic Volume Rendering Framework

The field of volume visualization has undergone rapid development during the past years, both due to advances in suitable computing hardware and due to the increasing availability of large volume datasets. Recent work has focused on increasing the visual realism in Direct Volume Rendering (DVR) by integrating a number of visually plausible but often effect-specific […]
Jul, 10

Multi-level Parallelization of Advanced Video Coding on Hybrid CPU/GPU Platform

In this paper we propose a dynamic model for parallel H.264/AVC video encoding on hybrid GPU/CPU systems. Entire inter-loop is parallelized on both CPU and GPU and computationally light and efficient model is proposed to dynamically distribute computation load among simultaneously processing devices. This model includes both dependency aware task scheduling and load balancing algorithm […]
Jul, 10

Runtime Systems and Scheduling Support for High-End CPU-GPU Architectures

In recent years, multi-core CPUs and many-core GPUs have emerged as mainstream and cost-effective means for scaling. Consequently, a trend that is receiving wide attention is of heterogeneous computing platforms consisting of both CPU and GPU. Such heterogeneous architectures are pervasive across notebooks, desktops, clusters, supercomputers and cloud environments. While they expose huge potential for […]
Jul, 10

A Generic Library for Stencil Computations

In this era of diverse and heterogeneous computer architectures, the programmability issues, such as productivity and portable efficiency, are crucial to software development and algorithm design. One way to approach the problem is to step away from traditional sequential programming languages and move toward domain specific programming environments to balance between expressivity and efficiency. In […]
Jul, 10

A hybrid Hermitian general eigenvalue solver

The adoption of hybrid GPU-CPU nodes in traditional supercomputing platforms opens acceleration opportunities for electronic structure calculations in materials science and chemistry applications, where medium sized Hermitian generalized eigenvalue problems must be solved many times. The small size of the problems limits the scalability on a distributed memory system, hence they can benefit from the […]
Jul, 9

Parallelising the Transfer-Matrix Method using Graphics Processors

We study the disorder-induced Anderson localisation of a d-dimensional solid, computing the localisation lengths using the Transfer-Matrix Method (TMM) and aiming to develop an efficient parallel implementation to run on Graphics Processing Units (GPUs). In the TMM, a quasi one-dimensional bar of length L >> M is split into slices of size M^(d-1). The Schrodinger […]
Jul, 9

Performance models for CUDA streams on NVIDIA GeForce series

Graphics Processing Units (GPU) have impressively arisen as generalpurpose coprocessors in high performance computing applications, since the launch of the Compute Unified Device Architecture (CUDA). However, they present an inherent performance bottleneck in the fact that communication between two separate address spaces (the main memory of the CPU and the memory of the GPU) is […]
Jul, 9

Elastically Deformable Models based on the Finite Element Method Accelerated on Graphics Hardware using CUDA

Elastically deformable models have found applications in various areas ranging from mechanical sciences and engineering to computer graphics. The method of Finite Elements has been the tool of choice for solving the underlying PDE, when accuracy and stability of the computations are more important than, e.g., computation time. In this paper we show that the […]
Jul, 9

Intensity model with blur effect on GPUs applied to large-scale star simulators

Intensity model with blur effect is widely employed to accurately simulate the imaging process of star simulator used for attitude determination and guiding system. It imposes great demands of computing power for realistic domains and modern Graphics Processing Units (GPUs) have demonstrated to be a powerful accelerator for this kind of computationally intensive simulations. This […]
Jul, 9

Complete PISO and SIMPLE solvers on Graphics Processing Units

We implemented the pressure-implicit with splitting of operators (PISO) and semi-implicit method for pressure-linked equations (SIMPLE) solvers of the Navier-Stokes equations on Fermi-class graphics processing units (GPUs) using the CUDA technology. We also introduced a new format of sparse matrices optimized for performing elementary CFD operations, like gradient or divergence discretization, on GPUs. We verified […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: