Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET

hgpu.org » Programming » CUDA » Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET

Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET

Kyung Sang Kim, Young Don Son, Zang Hee Cho, Jong Beom Ra, and Jong Chul Ye

Bio Imaging Signal Processing Lab., Dept. of Bio/Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 305-701, Republic of Korea

IEEE Journal of Biomedical and Health Informatics, 2013

@article{kima2013ultra,

title={Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET},

author={Kima, Kyung Sang and Sonb, Young Don and Chob, Zang Hee and Rac, Jong Beom and Yea, Jong Chul},

year={2013}

}

Download (PDF)

View

Source

2062

views

Scatter correction is very important in 3D PET reconstruction due to a large scatter contribution in measurements. Currently, one of the most popular methods is so called single scatter simulation (SSS), which considers single Compton scattering contributions from many randomly distributed scatter points. The SSS enables a fast calculation of scattering with a relatively high accuracy; however, the accuracy of SSS is dependent on the accuracy of tail fitting to find a correct scaling factor, which is often difficult in low photon count measurements. To overcome this drawback as well as to improve accuracy of scatter estimation by incorporating multiple scattering contribution, we propose a multiple scatter simulation (MSS) based on a simplified Monte Carlo (MC) simulation that considers photon migration and interactions due to photoelectric absorption and Compton scattering. Unlike the SSS, the MSS calculates a scaling factor by comparing simulated prompt data with the measured data in the whole volume, which enables a more robust estimation of a scaling factor. Even though the proposed MSS is based on MC, a significant acceleration of the computational time is possible by using a virtual detector array with a larger pitch by exploiting that the scatter distribution varies slowly in spatial domain. Furthermore, our MSS implementation is nicely fit to a parallel implementation using graphic processor unit (GPU). In particular, we exploit a hybrid CPU-GPU technique using the open multi-processing (OpenMP) and the compute unified device architecture (CUDA), which results in 128.3 times faster than using a single CPU. Overall, the computational time of MSS is 9.4 sec for a high-resolution research tomograph (HRRT) system. The performance of the proposed MSS is validated through actual experiments using an HRRT.

Tags: CUDA, Image processing, Monte Carlo simulation, nVidia, OpenMP, PET, Tesla C1060

June 14, 2013 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org