1650

Posts

Nov, 19

Design and Performance Evaluation of Image Processing Algorithms on GPUs

In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel GPU (graphics processing units) using the CUDA (compute unified device architecture) programming model. A set of metrics, customized for image processing, are proposed to quantitatively evaluate algorithm characteristics. In addition, we show that a range of […]
Nov, 19

Multi-dimensional characterization of temporal data mining on graphics processors

Through the algorithmic design patterns of data parallelism and task parallelism, the graphics processing unit (GPU) offers the potential to vastly accelerate discovery and innovation across a multitude of disciplines. For example, the exponential growth in data volume now presents an obstacle for high-throughput data mining in fields such as neuroscience and bioinformatics. As such, […]
Nov, 19

A two-level real-time vision machine combining coarse- and fine-grained parallelism

In this paper, we describe a real-time vision machine having a stereo camera as input generating visual information on two different levels of abstraction. The system provides visual low-level and mid-level information in terms of dense stereo and optical flow, egomotion, indicating areas with independently moving objects as well as a condensed geometric description of […]
Nov, 18

From Rendering to Tracking Point-based 3D Models

This paper adds to the abundant visual tracking literature with two main contributions. First we illustrate the interest of using Graphic Processing Units (GPU) to support efficient implementations of computer vision algorithms and, secondly, we introduce the use of point-based 3D models as a shape prior for real-time 3D tracking with a monocular camera. The […]
Nov, 18

Acceleration of the Smith-Waterman Algorithm using Single and Multiple Graphics Processors

Finding regions of similarity between two very long data streams is a computationally intensive problem referred to as sequence alignment. Alignment algorithms must allow for imperfect sequence matching with different starting locations and some gaps and errors between the two data sequences. Perhaps the most well known application of sequence matching is the testing of […]
Nov, 18

Parallel implementation of Artificial Neural Network training for speech recognition

In this paper we describe the implementation of a complete ANN training procedure using the block mode back-propagation learning algorithm for sequential patterns – such as the observation feature vectors of a speech recognition system – exploiting the high performance SIMD architecture of GPU using CUDA and its C-like language interface. We also compare the […]
Nov, 18

Modeling the propagation of elastic waves using spectral elements on a cluster of 192 GPUs

We implement a high-order finite-element application, which performs the numerical simulation of seismic wave propagation resulting for instance from earthquakes at the scale of a continent or from active seismic acquisition experiments in the oil industry, on a large GPU-enhanced cluster. Mesh coloring enables an efficient accumulation of degrees of freedom in the assembly process […]
Nov, 18

Accelerating POCS interpolation of 3D irregular seismic data with Graphics Processing Units

Seismic trace interpolation is necessary for high-resolution imaging when the acquired data are not adequate or when some traces are missing. Projection-onto-convex-sets (POCS) interpolation can gradually recover missing traces with an iterative algorithm, but its computational cost in a 3D CPU-based implementation is too high for practical applications. We present a computing scheme to speed […]
Nov, 18

Fast evaluation of Helmholtz potential on graphics processing units (GPUs)

This paper presents a parallel algorithm implemented on graphics processing units (GPUs) for rapidly evaluating spatial convolutions between the Helmholtz potential and a large-scale source distribution. The algorithm implements a non-uniform grid interpolation method (NGIM), which uses amplitude and phase compensation and spatial interpolation from a sparse grid to compute the field outside a source […]
Nov, 18

Fault Table Computation on GPUs

In this paper, we explore the implementation of fault table generation on a Graphics Processing Unit (GPU). A fault table is essential for fault diagnosis and fault detection in VLSI testing and debug. Generating a fault table requires extensive fault simulation, with no fault dropping, and is extremely expensive from a computational standpoint. Fault simulation […]
Nov, 18

A GPGPU compiler for memory optimization and parallelism management

This paper presents a novel optimizing compiler for general purpose computation on graphics processing units (GPGPU). It addresses two major challenges of developing high performance GPGPU programs: effective utilization of GPU memory hierarchy and judicious management of parallelism.
Nov, 18

Feature-preserving triangular geometry images for level-of-detail representation of static and skinned meshes

Geometry images resample meshes to represent them as texture for efficient GPU processing by forcing a regular parameterization that often incurs a large amount of distortion. Previous approaches broke the geometry image into multiple rectangular or irregular charts to reduce distortion, but complicated the automatic level of detail one gets from MIP-maps of the geometry […]
Page 914 of 957« First...102030...912913914915916...920930940...Last »

Recent source codes

* * *

* * *

HGPU group © 2010-2018 hgpu.org

All rights belong to the respective authors

Contact us: