A parallel algorithm to remove impulsive noise in digital images using heterogeneous CPU/GPU computing is proposed. The parallel denoising algorithm is based on the peer group concept and uses an Euclidean metric. In order to identify the amount of pixels to be allocated in multi-core and GPUs, a performance analysis using large images is presented. […]

June 27, 2014 by hgpu

A parallel algorithm for image noise removal is proposed. The algorithm is based on peer group concept and uses a fuzzy metric. An optimization study on the use of the CUDA platform to remove impulsive noise using this algorithm is presented. Moreover, an implementation of the algorithm on multi-core platforms using OpenMP is presented. Performance […]

December 15, 2013 by hgpu

In this paper, we present an efficient implementation of parallel algorithms to remove noise in digital images using different Graphics Processing Units (GPUs). The algorithm, based on the concept of peer group, uses a fuzzy metric for finding wrong pixels and the Arithmetic Mean Filter (AMF) to correct it. There are many factors to study […]

September 1, 2012 by hgpu

This paper proposes a type of pseudorandom number generator, Mersenne Twister for Graphic Processor (MTGP), for efficient generation on graphic processessing units (GPUs). MTGP supports large state sizes such as 11213 bits, and uses the high parallelism of GPUs in computing many steps of the recursion in parallel. The second proposal is a parameter-set generator […]

February 21, 2012 by hgpu

AIMS: We discuss an implementation of our 3D radiative transfer (3DRT) framework with the OpenCL paradigm for general GPU computing. METHODS: We implemented the kernel for solving the 3DRT problem in Cartesian coordinates with periodic boundary conditions in the horizontal (x,y) plane, including the construction of the nearest neighbor ^* and the operator splitting step. […]

February 17, 2012 by hgpu

The author proposes pseudorandom number generators suitable to execute on a graphic processor. They generate pseudorandom numbers in device memory on graphic processors. Each generator uses shared memory on graphic processors as its internal state space, and uses constant memory as a look-up table for a linear transformation. Output formats of the generator are 32-bit […]

August 3, 2011 by hgpu

We discuss an implementation of our 3D radiative transfer (3DRT) framework with the OpenCL paradigm for general GPU computing. We implement the kernel for solving the 3DRT problem in Cartesian coordinates with periodic boundary conditions in the horizontal $(x,y)$ plane, including the construction of the nearest neighbor $Lstar$ and the operator splitting step. We present […]

July 29, 2011 by hgpu

A CUDA implementation of the existing software FIRST (Fast Iterative Reconstruction Software for (PET) Tomography) is presented. This implementation uses consumer graphics processing units (GPUs) to accelerate the compute-intensive parts of the reconstruction: forward and backward projection. FIRST was originally developed in FORTRAN, and it has been migrated to C language to be used with […]

May 19, 2011 by hgpu

As the effective limits of frequency and instruction level parallelism have been reached, the strategy of microprocessor vendors has changed to increase the number of processing cores on a single chip each generation. The implicit expectation is that software developers will write their applications with concurrency in mind to take advantage of this sudden change […]

November 7, 2010 by hgpu