Posts
Jun, 23
GPU Implementation of the LFT Shape Matching Algorithm
Registration of partial scan data sets is still a challenge for today’s CAD systems and CAD system users. Many of the known methods rely on user interaction or feature recognition. For non-regular users this is too time consuming and error prone. The paper describes a method to register partial scan data by fitting a large […]
Jun, 23
Analysis of periodic structures with GPU accelerating
Various FDTD approaches has been applied in the analysis of periodic structures, among which Spectral FDTD is a straight-forward and robust one. This essay proposed an improved Spectral FDTD algorithm that improves the simulation speed of off-normal incident case without jeopardizing accuracy. Together with GPU accelerating, as is proven by the example, the proposed acceleration […]
Jun, 22
GPU accelerated high intensity ultrasound acoustical power computation
The simulation of the hepatocellular carcinoma therapy effects is often used for the intervention planning. As the physical-based model of the simulation is very time-consuming, the speed of this method becomes an obstacle during the clinical application simulation. In order to accelerate the simulation, a GPU-based (Graphic Processing Unit) acceleration method of the pressure field […]
Jun, 22
An Efficient Signal Processor of Synthetic Aperture Radar Based on GPU
The extending of Synthetic Aperture Radar (SAR) applications brings new challenges to SAR signal processing. Highly efficient SAR signal processors need to be explored towards these challenges. Recently, Graphics Processing Unit (GPU) is developing at top speed, which is a novel and promising computation platform for efficient SAR processors. A GPU-based SAR signal processor is […]
Jun, 22
Optimized GPU histograms for multi-modal registration
GPU-based systems are used more and more for medical image processing because of their parallel processing power and memory bandwidth. Impressive results have been achieved when registering large volume, however, one of themost-used similarity measures for multi-modal registration – mutual information – is not well suited for the streaming architecture because of its memory access […]
Jun, 22
Implementation and Performance Analysis of SEAL Encryption on FPGA, GPU and Multi-core Processors
Accelerators, such as field programmable gate arrays (FPGAs) and graphics processing units (GPUs), are special purpose processors designed to speed up compute-intensive sections of applications. FPGAs are highly customizable, while GPUs provide massive parallel execution resources and high memory bandwidth. In this paper, we compare the performance of these architectures, presenting a performance study of […]
Jun, 22
Accelerated realization method of infrared targets detection based on GPU
With the rapid development of infrared detector, Realtime detection of infrared targets faces biggish challenge; In the Infrared System with detector of more cells and higher frame speed, when using more complicated processing arithmetic, the Signal Processing Platform basing on the trad framework has met hardly the demand of System’s Real-Time Detecting; the GPU for […]
Jun, 22
Optimizing simulated annealing on GPU: A case study with IC floorplanning
In this paper, we propose a novel floorplanning algorithm based on simulated annealing on GPUs. Simulated annealing is an inherently sequential algorithm, far from the typical programs suitable for Single Instruction Multiple Data (SIMD) style concurrency in a GPU. We propose a fundamentally different approach of exploring the floorplan solution space, where we evaluate concurrent […]
Jun, 22
High performance gate-level simulation with GP-GPU computing
Functional verification of modern digital designs is a mission critical and time-consuming task. Verification is essential since it ensures the correctness of the final product; however, due to the complexity of modern designs, verification has become the primary bottleneck of time to market. Logic simulation forms the core of most current verification efforts, as it […]
Jun, 22
Cloth Simulation Using AABB Hierarchies and GPU Parallelism
Providing realistic, high-resolution and high fidelity representation of motions ia essential in the cloth simulation problem. In order to make high resolution simulations tractable, several algorithms have been developed that manage cloth-object interactions efficiently through specialized data structures such as AABB trees. However, implementation restrictions on single CPU architectures impose certain limits on quality and […]
Jun, 22
Resource Sharing in GPU-Accelerated Windowing Systems
Recent windowing systems allow graphics applications to directly access the graphics processing unit (GPU) for fast rendering. However, application tasks that render frames on the GPU contend heavily with the windowing server that also accesses the GPU to blit the rendered frames to the screen. This resource-sharing nature of direct rendering introduces core challenges of […]
Jun, 22
Exploiting SPMD Horizontal Locality
In this paper, we analyze a particular spatial locality case (called horizontal locality) inherent to manycore accelerator architectures employing barrel execution of SPMD kernels, such as GPUs. We then propose an adaptive memory access granularity framework to exploit and enforce the horizontal locality in order to reduce the interferences among accelerator cores memory accesses and […]