Posts
Oct, 29
Performance study of interference on GPU and CPU resources with multiple applications
In the last years, the performance and capabilities of Graphics Processing Units (GPUs) improved drastically, mostly due to the demands of the entertainment market, with consumers and companies alike pushing for improvements in the level of visual fidelity, which is only achieved with high performing GPU solutions. Beside the entertainment market, there is an ongoing […]
Oct, 29
Current performance gains from utilizing the GPU or the ASIC MDGRAPE-3 within an enhanced Poisson Boltzmann approach
Scientific applications do frequently suffer from limited compute performance. In this article, we investigate the suitability of specialized computer chips to overcome this limitation. An enhanced Poisson Boltzmann program is ported to the graphics processing unit and the application specific integrated circuit MDGRAPE-3 and resulting execution times are compared to the conventional performance obtained on […]
Oct, 29
GPU-based image manipulation and enhancement techniques for dynamic volumetric medical image visualization
An important part of an image-guided surgical system is the display component, and seamless interactivity is critical to its successful application in a clinical environment. In this paper, we present several novel techniques for 4D medical image manipulation and enhancement that employ a graphics processing unit (GPU) to accelerate image processing. We describe three types […]
Oct, 29
An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness
GPU architectures are increasingly important in the multi-core era due to their high number of parallel processors. Programming thousands of massively parallel threads is a big challenge for software engineers, but understanding the performance bottlenecks of those parallel programs on GPU architectures to improve application performance is even more difficult. Current approaches rely on programmers […]
Oct, 29
GPU-based Video Feature Tracking and Matching
Abstract This paper describes novel implementations of the KLT feature tracking and SIFT feature extraction algorithms that run on the graphics processing unit (GPU) and is suitable for video analysis in real-time vision systems. While significant acceleration over standard CPU implementations is obtained by exploiting parallelism provided by modern programmable graphics hardware, the CPU is […]
Oct, 28
GPU acceleration of cutoff pair potentials for molecular modeling applications
The advent of systems biology requires the simulation of ever-larger biomolecular systems, demanding a commensurate growth in computational power. This paper examines the use of the NVIDIA Tesla C870 graphics card programmed through the CUDA toolkit to accelerate the calculation of cutoff pair potentials, one of the most prevalent computations required by many different molecular […]
Oct, 28
Real-time eye blink detection with GPU-based SIFT tracking
This paper reports on the implementation of a GPUbased, real-time eye blink detector on very low contrast images acquired under near-infrared illumination. This detector is part of a multi-sensor data acquisition and analysis system for driver performance assessment and training. Eye blinks are detected inside regions of interest that are aligned with the subject’s eyes […]
Oct, 28
AUTO-GC: Automatic translation of data mining applications to GPU clusters
Because of the very favorable price to performance ratio of the GPUs, a popular parallel programming configuration today is a cluster of GPUs. However, extracting performance on such a configuration would typically require programming in both MPI and CUDA, thus requiring a high degree of expertise and effort. It is clearly desirable to be able […]
Oct, 28
GPU-based streaming architectures for fast cone-beam CT image reconstruction and demons deformable registration
This paper shows how to significantly accelerate cone-beam CT reconstruction and 3D deformable image registration using the stream-processing model. We describe data-parallel designs for the Feldkamp, Davis and Kress (FDK) reconstruction algorithm, and the demons deformable registration algorithm, suitable for use on a commodity graphics processing unit. The streaming versions of these algorithms are implemented […]
Oct, 28
GPU Gems: Programming Techniques, Tips and Tricks for Real-Time Graphics
GPU Gems has won a prestigious Front Line Award from Game Developer Magazine. The Front Line Awards recognize products that enable faster and more efficient game development, advancing the state of the art.FULL COLOR THROUGHOUT! “This collection of articles is particularly impressive for its depth and breadth. The book includes product-oriented case studies, previously unpublished […]
Oct, 28
CUDA cuts: Fast graph cuts on the GPU
Graph cuts has become a powerful and popular optimization tool for energies defined over an MRF and have found applications in image segmentation, stereo vision, image restoration, etc. The maxflow/mincut algorithm to compute graph-cuts is computationally heavy. The best-reported implementation of graph cuts takes over 100 milliseconds even on images of size 640×480 and cannot […]
Oct, 28
Browsing a Large Collection of Community Photos Based on Similarity on GPU
A novel approach is proposed in this paper to facilitate browsing a large collection of community photos based on visual similarities. Using extracted feature vectors, the approach maps photos onto a 2D rectangular area such that the ones with similar features are close to each other. When a user browses the collection, a subset of […]