7967

Posts

Jul, 10

Multi-level Parallelization of Advanced Video Coding on Hybrid CPU/GPU Platform

In this paper we propose a dynamic model for parallel H.264/AVC video encoding on hybrid GPU/CPU systems. Entire inter-loop is parallelized on both CPU and GPU and computationally light and efficient model is proposed to dynamically distribute computation load among simultaneously processing devices. This model includes both dependency aware task scheduling and load balancing algorithm […]
Jul, 10

Runtime Systems and Scheduling Support for High-End CPU-GPU Architectures

In recent years, multi-core CPUs and many-core GPUs have emerged as mainstream and cost-effective means for scaling. Consequently, a trend that is receiving wide attention is of heterogeneous computing platforms consisting of both CPU and GPU. Such heterogeneous architectures are pervasive across notebooks, desktops, clusters, supercomputers and cloud environments. While they expose huge potential for […]
Jul, 10

A Generic Library for Stencil Computations

In this era of diverse and heterogeneous computer architectures, the programmability issues, such as productivity and portable efficiency, are crucial to software development and algorithm design. One way to approach the problem is to step away from traditional sequential programming languages and move toward domain specific programming environments to balance between expressivity and efficiency. In […]
Jul, 10

A hybrid Hermitian general eigenvalue solver

The adoption of hybrid GPU-CPU nodes in traditional supercomputing platforms opens acceleration opportunities for electronic structure calculations in materials science and chemistry applications, where medium sized Hermitian generalized eigenvalue problems must be solved many times. The small size of the problems limits the scalability on a distributed memory system, hence they can benefit from the […]
Jul, 9

Parallelising the Transfer-Matrix Method using Graphics Processors

We study the disorder-induced Anderson localisation of a d-dimensional solid, computing the localisation lengths using the Transfer-Matrix Method (TMM) and aiming to develop an efficient parallel implementation to run on Graphics Processing Units (GPUs). In the TMM, a quasi one-dimensional bar of length L >> M is split into slices of size M^(d-1). The Schrodinger […]
Jul, 9

Performance models for CUDA streams on NVIDIA GeForce series

Graphics Processing Units (GPU) have impressively arisen as generalpurpose coprocessors in high performance computing applications, since the launch of the Compute Unified Device Architecture (CUDA). However, they present an inherent performance bottleneck in the fact that communication between two separate address spaces (the main memory of the CPU and the memory of the GPU) is […]
Jul, 9

Elastically Deformable Models based on the Finite Element Method Accelerated on Graphics Hardware using CUDA

Elastically deformable models have found applications in various areas ranging from mechanical sciences and engineering to computer graphics. The method of Finite Elements has been the tool of choice for solving the underlying PDE, when accuracy and stability of the computations are more important than, e.g., computation time. In this paper we show that the […]
Jul, 9

Intensity model with blur effect on GPUs applied to large-scale star simulators

Intensity model with blur effect is widely employed to accurately simulate the imaging process of star simulator used for attitude determination and guiding system. It imposes great demands of computing power for realistic domains and modern Graphics Processing Units (GPUs) have demonstrated to be a powerful accelerator for this kind of computationally intensive simulations. This […]
Jul, 9

Complete PISO and SIMPLE solvers on Graphics Processing Units

We implemented the pressure-implicit with splitting of operators (PISO) and semi-implicit method for pressure-linked equations (SIMPLE) solvers of the Navier-Stokes equations on Fermi-class graphics processing units (GPUs) using the CUDA technology. We also introduced a new format of sparse matrices optimized for performing elementary CFD operations, like gradient or divergence discretization, on GPUs. We verified […]
Jul, 8

Fast GPU Garment Simulation and Collision Detection

This paper describes a technique for garment simulation and collision detection implemented on modern Graphics Processors (GPU). It exploits a mass-spring cloth model with velocity modification approach to overcome the super-elasticity. Our novel algorithms for cloth-body and cloth-cloth collision detection and response are based on image-space interference tests. For collision detection a 3D texture is […]
Jul, 8

Interactive BRDF Estimation for Mixed-Reality Applications

Recent methods in augmented reality allow simulating mutual light interactions between real and virtual objects. These methods are able to embed virtual objects in a more sophisticated way than previous methods. However, their main drawback is that they need a virtual representation of the real scene to be augmented in the form of geometry and […]
Jul, 8

Teaching Parallel Programming Models on a Shallow-Water Code

We present a software package that supports teaching different parallel programming models in a computational science and engineering context. It implements a Finite Volume solver for the shallow water equations, with application to tsunami simulation in mind. The numerical model is kept simple, using patches of Cartesian grids as computational domain, which can be connected […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: