Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET
Bio Imaging Signal Processing Lab., Dept. of Bio/Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 305-701, Republic of Korea
IEEE Journal of Biomedical and Health Informatics, 2013
@article{kima2013ultra,
title={Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET},
author={Kima, Kyung Sang and Sonb, Young Don and Chob, Zang Hee and Rac, Jong Beom and Yea, Jong Chul},
year={2013}
}
Scatter correction is very important in 3D PET reconstruction due to a large scatter contribution in measurements. Currently, one of the most popular methods is so called single scatter simulation (SSS), which considers single Compton scattering contributions from many randomly distributed scatter points. The SSS enables a fast calculation of scattering with a relatively high accuracy; however, the accuracy of SSS is dependent on the accuracy of tail fitting to find a correct scaling factor, which is often difficult in low photon count measurements. To overcome this drawback as well as to improve accuracy of scatter estimation by incorporating multiple scattering contribution, we propose a multiple scatter simulation (MSS) based on a simplified Monte Carlo (MC) simulation that considers photon migration and interactions due to photoelectric absorption and Compton scattering. Unlike the SSS, the MSS calculates a scaling factor by comparing simulated prompt data with the measured data in the whole volume, which enables a more robust estimation of a scaling factor. Even though the proposed MSS is based on MC, a significant acceleration of the computational time is possible by using a virtual detector array with a larger pitch by exploiting that the scatter distribution varies slowly in spatial domain. Furthermore, our MSS implementation is nicely fit to a parallel implementation using graphic processor unit (GPU). In particular, we exploit a hybrid CPU-GPU technique using the open multi-processing (OpenMP) and the compute unified device architecture (CUDA), which results in 128.3 times faster than using a single CPU. Overall, the computational time of MSS is 9.4 sec for a high-resolution research tomograph (HRRT) system. The performance of the proposed MSS is validated through actual experiments using an HRRT.
June 14, 2013 by hgpu