Posts
Nov, 13
Implementation and evaluation of various demons deformable image registration algorithms on GPU
Online adaptive radiation therapy (ART) promises the ability to deliver an optimal treatment in response to daily patient anatomic variation. A major technical barrier for the clinical implementation of online ART is the requirement of rapid image segmentation. Deformable image registration (DIR) has been used as an automated segmentation method to transfer tumor/organ contours from […]
Nov, 13
Development of a GPU-based Monte Carlo dose calculation code for coupled electron-photon transport
Monte Carlo simulation is the most accurate method for absorbed dose calculations in radiotherapy. Its efficiency still requires improvement for routine clinical applications, especially for online adaptive radiotherapy. In this paper, we report our recent development on a GPU-based Monte Carlo dose calculation code for coupled electron-photon transport. We have implemented the Dose Planning Method […]
Nov, 13
Stellar-mass black holes in star clusters: implications for gravitational wave radiation
We study the dynamics of stellar-mass black holes (BH) in star clusters with particular attention to the formation of BH-BH binaries, which are interesting as sources of gravitational waves (GW). We examine the properties of these BH-BH binaries through direct N-body simulations of star clusters using the GPU-enabled NBODY6 code. We perform simulations of N
Nov, 13
TWQCD’s dynamical DWF project
We present an overview of our project of simulation of unquenched lattice QCD with optimal domain-wall quarks, using a GPU cluster currently constituting of 16 units of Nvidia Tesla S1070 plus 64 graphic cards with Nvidia GTX285 (total 128 GPUs with 128 Teraflops peak), attaining sustained computing power of 15.36 Teraflops. The first production run […]
Nov, 13
A Markovian event-based framework for stochastic spiking neural networks
In this article we introduce and study a mathematical framework for characterizing and simulating networks of noisy integrate-and-fire neurons based on the spike times. We show that the firing times of the neurons in the networks constitute a Markov chain, whose transition probability is related to the probability distribution of the interspike interval of the […]
Nov, 13
Enhanced molecular dynamics performance with a programmable graphics processor
Design considerations for molecular dynamics algorithms capable of taking advantage of the computational power of a graphics processing unit (GPU) are described. Accommodating the constraints of scalable streaming-multiprocessor hardware necessitates a reformulation of the underlying algorithm. Performance measurements demonstrate the considerable benefit and cost-effectiveness of such an approach, which produces a factor of 2.5 speed […]
Nov, 13
CUDAEASY – a GPU Accelerated Cosmological Lattice Program
This paper presents, to the author’s knowledge, the first graphics processing unit (GPU) accelerated program that solves the evolution of interacting scalar fields in an expanding universe. We present the implementation in NVIDIA’s Compute Unified Device Architecture (CUDA) and compare the performance to other similar programs in chaotic inflation models. We report speedups between one […]
Nov, 13
The role of multigrid algorithms for LQCD
We report on the first successful QCD multigrid algorithm which demonstrates constant convergence rates independent of quark mass and lattice volume for the Wilson Dirac operator. The new ingredient is the adaptive method for constructing the near null space on which the coarse grid multigrid Dirac operator acts. In addition we speculate on future prospects […]
Nov, 13
QCD on GPUs: cost effective supercomputing
The exponential growth of floating point power in graphics processing units (GPUs), together with their low cost, has given rise to an attractive platform upon which to deploy lattice QCD calculations. GPUs are essentially many (O(100)) core chips, that are programmed using a massively threaded environment, and so are representative of the future of high […]
Nov, 13
Air pollution modelling using a graphics processing unit with CUDA
The Graphics Processing Unit (GPU) is a powerful tool for parallel computing. In the past years the performance and capabilities of GPUs have increased, and the Compute Unified Device Architecture (CUDA) – a parallel computing architecture – has been developed by NVIDIA to utilize this performance in general purpose computations. Here we show for the […]
Nov, 13
GPU-based Fast Low-dose Cone Beam CT Reconstruction via Total Variation
Cone-beam CT (CBCT) has been widely used in image guided radiation therapy (IGRT) to acquire updated volumetric anatomical information before treatment fractions for accurate patient alignment purpose. However, the excessive x-ray imaging dose from serial CBCT scans raises a clinical concern in most IGRT procedures. The excessive imaging dose can be effectively reduced by reducing […]
Nov, 13
Tiling for Performance Tuning on Different Models of GPUs
The strategy of using CUDA-compatible GPUs as a parallel computation solution to improve the performance of programs has been more and more widely approved during the last two years since the CUDA platform was released. Its benefit extends from the graphic domain to many other computationally intensive domains. Tiling, as the most general and important […]

