high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » CUDA » GPU acceleration of MOLAR for HRRT List-Mode OSEM reconstructions

GPU acceleration of MOLAR for HRRT List-Mode OSEM reconstructions

W.C. Barker, S. Thada

Nat. Inst. of Health, Bethesda

IEEE Nuclear Science Symposium Conference Record, 2007. NSS ’07

DOI:10.1109/NSSMIC.2007.4436766

@conference{barker2007gpu,

title={GPU acceleration of MOLAR for HRRT List-Mode OSEM reconstructions},

author={Barker, W.C. and Thada, S.},

booktitle={Nuclear Science Symposium Conference Record, 2007. NSS’07. IEEE},

volume={4},

pages={3004–3008},

issn={1082-3654},

year={2007},

organization={IEEE}

}

Source

2450

views

The Siemens ECAT HRRT PET scanner has the potential to produce images of the human brain with spatial resolution better than 3 mm. MOLAR (a motion-compensation OSEM List-mode Algorithm for Resolution-recovery) was developed to provide reconstructions of HRRT data with the best possible accuracy and precision. However, a computer cluster is required to generate reconstructions in a reasonable amount of time. Strategies for computational efficiency have already been implemented in MOLAR but room for improvement remains. In this study we have begun the process of converting time- consuming components of MOLAR to parallelized code that runs on commodity graphics cards (GPUs) with much faster turnaround. We evaluated the performance of list-mode event forward projections and component-based normalization factor calculations, and we confirmed the numerical accuracy of images reconstructed with GPU-assisted code running on an HP xw8400 workstation with an NVIDIA Quadro FX 4600 graphics card. We evaluated simulated data projected through a 128times128times128 image volume that included the direct calculation of a gaussian resolution function for simulated list-mode events. This was done using the Cg and CUDA programming APIs for implementation comparison. Both GPU versions ran up to 100 times faster than the CPU-only code. The CUDA version showed some improvement over Cg and was easier to program. We also examined measured Ge-68 phantom data projected through a 256times256times207 image volume with resolution functions obtained through array lookup rather than by direct calculation. The GPU-assisted code was observed to be up to 14 times faster than the CPU-only code, particularly when one million or more events were processed. Normalization processing was found to be up to 36 times faster. However, speedup decreased to a factor of 3 when disk I/O became dominant as more than one billion events were processed. We anticipate further acceleration of MOLAR as we convert other -components to GPU-assisted code, in particular backprojection and scatter correction. (Backprojection is on hold until a next generation GPU which has atomic write capability becomes available.)

Tags: CUDA, Image processing, Image reconstruction, Medicine, nVidia, nVidia Quadro FX 4600, Tomography

April 3, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

GPU acceleration of MOLAR for HRRT List-Mode OSEM reconstructions

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

GPU acceleration of MOLAR for HRRT List-Mode OSEM reconstructions

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)