8899

Posts

Jan, 22

The PEPPHER Composition Tool: Performance-Aware Dynamic Composition of Applications for GPU-based Systems

The PEPPHER component model defines an environment for annotation of native C/C++ based components for homogeneous and heterogeneous multicore and manycore systems, including GPU and multi-GPU based systems. For the same computational functionality, captured as a component, different sequential and explicitly parallel implementation variants using various types of execution units might be provided, together with […]
Jan, 22

Analysis of Metallic Nanostructures by a Discontinuous Galerkin Time-Domain Maxwell Solver on Graphics Processing Units

In this thesis, we examine the optical properties of metallic nanostructures with typical feature sizes of the order of visible light. The interaction of light with such structures can be accurately described by classical electrodynamics. Thus, for the analysis of metallic nanostructures within this thesis, we will employ Maxwell’s equations [1] to model the physical […]
Jan, 20

Accelerating Image Reconstruction in Dual-Head PET System by GPU and Symmetry Properties

Positron emission tomography (PET) is an important imaging modality in both clinical usage and research studies. We have developed a compact high-sensitivity PET system that consisted of two large-area panel PET detector heads, which produce more than 224 million lines of response and thus request dramatic computational demands. In this work, we employed a state-of-the-art […]
Jan, 20

The Fast Multipole Method on the Cell processor

This paper presents the first deployment of the Fast Multipole Method on the Cell processor (PowerXCell 8i). We rely on the matrix formulation with BLAS routines of the FMB code (Fast Multipole with BLAS) in order to directly and efficiently offload the most time consuming operators of both far field and near field computations on […]
Jan, 20

Fast Positron Range Calculation in Heterogeneous Media for 3D PET Reconstruction

This paper presents a fast GPU-based solution to compensate positron range effects in heterogeneous media for iterative PET reconstruction. We assume a factorized approach, where projections are decomposed to phases according to the main physical effects. Positron range is the first effect in this chain, which causes a spatially varying blurring according to local material […]
Jan, 20

Parallel Distributed Face Search System for National and Border Security

The CCTV surveillance industry is undergoing a sea change due to the adoption of IP technologies. This is allowing the integration of a plethora of new cameras and other sensors into huge integrated networks. Adoption of IP technologies is presenting opportunities for scalable visual analytics that has the potential to add enormous value to entire […]
Jan, 19

Duality based optical flow algorithms with applications

We consider the popular TV-L^1 optical flow formulation, and the so-called duality based algorithm for minimizing the TV-L^1 energy. The original formulation is extended to allow for vector valued images, and minimization results are given. In addition we consider different definitions of total variation regularization, and related formulations of the optical flow problem that may […]
Jan, 18

GPU-accelererated regularisation of large diffusion-tensor volumes

We discuss the benefits, difficulties, and performance of a GPU implementation of the Chambolle-Pock algorithm for TGV (total generalised variation) denoising of medical diffusion tensor images. Whereas we have previously studied the denoising of 2D slices of $2 times 2$ and $3 times 3$ tensors, attaining satisfactory performance on a normal CPU, here we concentrate […]
Jan, 18

Scan Test Power Simulation on GPGPUs

The precise estimation of dynamic power consumption, power droop and temperature development during scan test require a very large number of time-aware gate-level logic simulations. Until now, such characterizations have been feasible only for rather small designs or with reduced precision due to the high computational demands. We propose a new, throughput-optimized timing simulator on […]
Jan, 18

Flip-Flop: Convex Hull Construction via Star-Shaped Polyhedron in 3D

Flipping is a local and efficient operation to construct the convex hull in an incremental fashion. However, it is known that the traditional flip algorithm is not able to compute the convex hull when applied to a polyhedron in R3. Our novel Flip-Flop algorithm is a variant of the flip algorithm. It overcomes the deficiency […]
Jan, 18

Fast Sparse Level Sets on Graphics Hardware

The level-set method is one of the most popular techniques for capturing and tracking deformable interfaces. Although level sets have demonstrated great potential in visualization and computer graphics applications, such as surface editing and physically based modeling, their use for interactive simulations has been limited due to the high computational demands involved. In this paper, […]
Jan, 18

Rethinking resampling in the particle filter on graphics processing units

Modern parallel computing devices such as the graphics processing unit (GPU) have gained significant traction in scientific computing, and are particularly well-suited to data-parallel algorithms such as the particle filter. Of the components of the particle filter, the resampling step is the most difficult to implement well on such devices, as it often requires a […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: