1744

Posts

Nov, 22

Optimal rotation alignment of 3D objects using a GPU-based similarity function

In this paper, we address the challenging task of finding the best alignment between two 3D objects by solving a global optimization problem in the space of rotations SO(3). The objective function to be optimized is a newly developed rotation-variant similarity measure, which is obtained directly from the object’s geometry and is entirely implemented on […]
Nov, 22

Tuned and wildly asynchronous stencil kernels for hybrid CPU/GPU systems

We describe heterogeneous multi-CPU and multi-GPU implementations of Jacobi’s iterative method for the 2-D Poisson equation on a structured grid, in both single- and double-precision. Properly tuned, our best implementation achieves 98% of the empirical streaming GPU bandwidth (66% of peak) on a NVIDIA C1060, and 78% on a C870. Motivated to find a still […]
Nov, 22

Parallel multi-objective evolutionary algorithms on graphics processing units

Most real-life optimization problems or decision-making problems are multi-objective in nature, since they normally have several (possibly conflicting) objectives that must be satisfied at the same time. Multi-Objective Evolutionary Algorithms (MOEAs) have been gaining increasing attention among researchers and practitioners. However, they may execute for a long time for some difficult problems, because several evaluations […]
Nov, 22

GPU computing with Kaczmarz’s and other iterative algorithms for linear systems

The graphics processing unit (GPU) is used to solve large linear systems derived from partial differential equations. The differential equations studied are strongly convection-dominated, of various sizes, and common to many fields, including computational fluid dynamics, heat transfer, and structural mechanics. The paper presents comparisons between GPU and CPU implementations of several well-known iterative methods, […]
Nov, 22

A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware

Emerging DNA sequencing technologies open up exciting new opportunities for genome sequencing by generating read data with a massive throughput. However, produced reads are significantly shorter and more error-prone compared to the traditional Sanger shotgun sequencing method. This poses challenges for de novo DNA fragment assembly algorithms in terms of both accuracy (to deal with […]
Nov, 22

Inverse scattering and refraction corrected reflection for breast cancer imaging

Reflection ultrasound (US) has been utilized as an adjunct imaging modality for over 30 years. TechniScan, Inc. has developed unique, transmission and concomitant reflection algorithms which are used to reconstruct images from data gathered during a tomographic breast scanning process called Warm Bath Ultrasound (WBU). The transmission algorithm yields high resolution, 3D, attenuation and speed […]
Nov, 22

Interventional 4-D Motion Estimation and Reconstruction of Cardiac Vasculature without Motion Periodicity Assumption

Anatomical and functional information of cardiac vasculature is a key component in the field of interventional cardiology. With the technology of C-arm CT it is possible to reconstruct static intraprocedural 3-D images from angiographic projection data. Current approaches attempt to add the temporal dimension (4-D). In the assumption of periodic heart motion, ECG-gating techniques can […]
Nov, 22

A dynamically configurable coprocessor for convolutional neural networks

Convolutional neural networks (CNN) applications range from recognition and reasoning (such as handwriting recognition, facial expression recognition and video surveillance) to intelligent text applications such as semantic text analysis and natural language processing applications. Two key observations drive the design of a new architecture for CNN. First, CNN workloads exhibit a widely varying mix of […]
Nov, 22

A fast stereo matching algorithm suitable for embedded real-time systems

In this paper, the challenge of fast stereo matching for embedded systems is tackled. Limited resources, e.g. memory and processing power, and most importantly real-time capability on embedded systems for robotic applications, do not permit the use of most sophisticated stereo matching approaches. The strengths and weaknesses of different matching approaches have been analyzed and […]
Nov, 22

Optimising GPR modelling: A practical, multi-threaded approach to 3D FDTD numerical modelling

The demand for advanced interpretational tools has lead to the development of highly sophisticated, computationally demanding, 3D GPR processing and modelling techniques. Many of these methods solve very large problems with stepwise methods that utilise numerically similar functions within iterative computational loops. Problems of this nature are readily parallelised by splitting the computational domain into […]
Nov, 22

Fast reduction of undersampling artifacts in radial MR angiography with 3D total variation on graphics hardware

OBJECTIVE: Subsampling of radially encoded MRI acquisitions in combination with sparsity promoting methods opened a door to significantly increased imaging speed, which is crucial for many important clinical applications. In particular, it has been shown recently that total variation (TV) regularization efficiently reduces undersampling artifacts. The drawback of the method is the long reconstruction time […]
Nov, 22

Real-time ambient occlusion and halos with summed area tables

Volume models often show high depth complexity. This poses difficulties to the observer in judging the spatial relationships accurately. Illustrators usually use certain techniques such as improving the shading through shadows, halos, or edge darkening in order to enhance depth perception of certain structures. Both effects are difficult to generate in real-time for volumetric models. […]

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: