Posts
Dec, 6
Automatic Fusions of CUDA-GPU Kernels for Parallel Map
When implementing a function mapping on the contemporary GPU, several contradictory performance factors affecting distribution of computation into GPU kernels have to be balanced. A decomposition-fusion scheme suggest to decompose computational problem to be solved by several simple functions implemented as standalone kernels and some of these functions later fuse into more complex kernels to […]
Dec, 6
Multiprocessing Acceleration of H.264/AVC Motion Estimation Full Search Algorithm under CUDA Architecture
This work presents a parallel GPU-based solution for the Motion Estimation (ME) process in a videoencoding system. We propose a way to partition the steps of Full Search block matching algorithm in the CUDA architecture, and to compare the performance with a theoretical model and two implementations (sequential and parallel using OpenMP library). We obtained […]
Dec, 6
DTAM: Dense tracking and mapping in real-time
DTAM is a system for real-time camera tracking and reconstruction which relies not on feature extraction but dense, every pixel methods. As a single hand-held RGB camera flies over a static scene, we estimate detailed textured depth maps at selected keyframes to produce a surface patchwork with millions of vertices. We use the hundreds of […]
Dec, 6
Massively Parallel Identification of Intersection Points for GPGPU Ray Tracing
The latest advancements in computer graphics architectures, as the replacement of some fixed stages of the pipeline for programmable stages (shaders), have been enabling the development of parallel general purpose applications on massively parallel graphics architectures (Streaming Processors). For years the graphics processing unit (GPU) is being optimized for increasingly high throughput of massively parallel […]
Dec, 6
Simulation of pollutant transport in shallow water on a CUDA architecture
Shallow water simulation enables the study of problems such as dam break, river, canal and coastal hydrodynamics, as well as the transport of inert substances, such as pollutants, on a fluid. This article describes a GPU efficient and cost-effective CUDA implementation of a finite volume numerical scheme for solving pollutant transport problems in bidimensional domains. […]
Dec, 6
GPU-Based Liquid Crystal Display Processing Platform
In the past decade liquid crystal displays (LCD) have taken over the television (TV) and monitor market from cathode ray tube (CRT) display. Compared to CRT displays, LCD offers larger screen sizes, higher resolution, thinner, lighter, and more energy efficient. However, with respect to image quality, LCD does not catch up to CRT display in […]
Dec, 6
Performance Analysis of GPU compared to Single-core and Multi-core CPU for Natural Language Applications
In Natural Language Processing (NLP) applications, the main time-consuming process is string matching due to the large size of lexicon. In string matching processes, data dependence is minimal and hence it is ideal for parallelization. A dedicated system with memory interleaving and parallel processing techniques for string matching can reduce this burden of host CPU, […]
Dec, 6
Real-time Terrain Modeling using CPU-GPU Coupled Computation
Motivated by the importance of having real-time feedback in sketch-based modeling tools, we present a framework for terrain edition capable of generating and displaying complex and high-resolution terrains. Our system is efficient and fast enough to allow the user to see the terrain morphing at the same time the drawing editing occurs. We have two […]
Dec, 6
Performance engineering for the Lattice Boltzmann method on GPGPUs: Architectural requirements and performance results
GPUs offer several times the floating point performance and memory bandwidth of current standard two socket CPU servers, e.g. NVIDIA C2070 vs. Intel Xeon Westmere X5650. The lattice Boltzmann method has been established as a flow solver in recent years and was one of the first flow solvers to be successfully ported and that performs […]
Dec, 6
A Fast 3D Spatial Analysis Technique Using Graphic Process Units
More and more 3D terrain information is converted and stored in a digital format, and they have been used as a primary data source for a variety of spatial analyses in various application domains. One unsolved issue is that 3D spatial analyses (e.g., 3D ray tracing) often require huge computing cost in terms of processing […]
Dec, 5
Soft Error Resilient QR Factorization for Hybrid System with GPGPU
The general purpose graphics processing units (GPGPU) are increasingly deployed for scientific computing due to their performance advantages over CPUs. What followed is the fact that fault tolerance has become a more serious concern compared to the period when GPGPUs were used exclusively for graphics applications. Using GPUs and CPUs together in a hybrid computing […]
Dec, 5
Effectiveness of GPGPU for Solving the Magnetohydrodynamics Equations Using the CIP-MOCCT Method
A simple parallelization approach using General Purpose computation on Graphics Processing Unit was applied for solving the MHD equations using the CIP-MOCCT method. We investigated the efficiency of this parallelization approach and found that the computational speed of the modified code is significantly improved despite the simple modification.