high performance computing on graphics processing units: hgpu.org

Posts

Sep, 2

Generalized Voronoi Diagram Computation on GPU

We study the problem of using the GPU to compute the generalized Voronoi diagram (GVD) for higher-order sites, such as line segments and curves. This problem has applications in many fields, including computer animation, pattern recognition and so on. A number of methods have been proposed that use the GPU to speed up the computation […]

OpenGL

Sep, 2

A GPU Accelerated Algorithm for Compressive Sensing Based Image Super-Resolution

This paper presents a parallel algorithm designed for Super-resolution Image Reconstruction based on Compressive sensing in the ATI Stream platform. In the accelerating process, we select part of the serial program as the objects to be sped up according to the execution time of each stage, set appropriate parallel granularity to make full use of […]

Sep, 2

GPU-accelerated time-domain circuit simulation

Time-domain circuit simulation is often dominated by the transistor model evaluation time. An analysis of a test suite of 27 circuits shows 66% of the transient runtime is spent evaluating the core BSIM4 transistor model code. A modern graphics processing unit (GPU) is a highly paralled, high performance computer suitable for non-graphics tasks. Circuit simulation […]

Sep, 2

Accelerating String Matching Using Multi-Threaded Algorithm on GPU

Network Intrusion Detection System has been widely used to protect computer systems from network attacks. Due to the ever-increasing number of attacks and network complexity, traditional software approaches on uni-processors have become inadequate for the current high-speed network. In this paper, we propose a novel parallel algorithm to speedup string matching performed on GPUs. We […]

CUDA

Sep, 2

GPU-based simulation of side-looking sonar images

This paper describes an implementation of a sonar image simulator optimized for running on a computer’s Graphics Processing Unit (GPU). GPUs are hardware-optimized to obtain maximum performance on computer graphics applications. Because these applications generally simulate focal plane images (i.e. optical systems, video, etc), some specific adaptations are required to render range images such as […]

CUDA

Sep, 2

GPU Versus FPGA for High Productivity Computing

Heterogeneous or co-processor architectures are becoming an important component of high productivity computing systems (HPCS). In this work the performance of a GPU based HPCS is compared with the performance of a commercially available FPGA based HPC. Contrary to previous approaches that focussed on specific examples, a broader analysis is performed by considering processes at […]

CUDA

Sep, 2

Case study: Interactive rendering of adaptive mesh refinement data

Adaptive mesh refinement (AMR) is a popular computational simulation technique used in various scientific and engineering fields. Although AMR data is organized in a hierarchical multi-resolution data structure, the traditional volume visualization algorithms such as ray-casting and splatting cannot handle the form without converting it to a sophisticated data structure. In this paper, we present […]

OpenGL

Sep, 2

Fast evolutionary image processing using Multi-GPUs

In this paper, the authors propose a fast evolutionary image processing system. The authors employ graphics processing unit (GPU) to automatic construction of tree-structural image transformation (ACTIT) for the purpose of reducing optimization time. Besides, the system calculates in parallel by using multiple GPUs for the fast processing. The optimization speed of the proposed system […]

OpenGL

Sep, 1

Energy-optimized mapping of application to smartphone platform – A case study of mobile face recognition

Modern smartphones use heterogeneous multi-core SoC which includes CPU, GPU, DSP and various application-specific accelerators. It provides opportunities to realize compute-intensive applications on a battery-powered and resource-limited mobile device by assigning each sub-task to the most suitable computing core. To meet the performance requirement with minimized energy consumption, the algorithm also needs to be characterized […]

Sep, 1

Acceleration of TM cylinder EFIE with CUDA

In this paper, we have shown that exploitation of the GPU’s massively parallel architecture can dramatically increase the speed of MoM calculations. While the code can certainly be improved, matrix fill speed-up factors are already commonly found to be between 150X-260X. The conjugate gradient solver stands to be improved at this writing but still results […]

CUDA

Sep, 1

GOST-28147 Encryption Implementation on Graphics Processing Units

The GOST-21147 encryption implementation on graphics processing units is suggested. The approach for organizing the computation process on the GPU is described. Some realizations of basic operations for ciphering algorithm are suggested for Direct3D and OpenGL graphics API. The results of computational experiments on different GPU and CPU are given.

OpenGL

Sep, 1

Implementation of Sequential Importance Sampling in GPGPU

The estimation of many unknown parameters is carried out using a simplified Sequential Importance Sampling (SIS) algorithm which is implemented in a graphic processing unit (GPU). The aim of the present work is to show technical points to bring out the performance of GPU. Using the implemented code, two numerical experiments are demonstrated. In the […]

CUDA