Posts
Nov, 13
The role of multigrid algorithms for LQCD
We report on the first successful QCD multigrid algorithm which demonstrates constant convergence rates independent of quark mass and lattice volume for the Wilson Dirac operator. The new ingredient is the adaptive method for constructing the near null space on which the coarse grid multigrid Dirac operator acts. In addition we speculate on future prospects […]
Nov, 13
QCD on GPUs: cost effective supercomputing
The exponential growth of floating point power in graphics processing units (GPUs), together with their low cost, has given rise to an attractive platform upon which to deploy lattice QCD calculations. GPUs are essentially many (O(100)) core chips, that are programmed using a massively threaded environment, and so are representative of the future of high […]
Nov, 13
Air pollution modelling using a graphics processing unit with CUDA
The Graphics Processing Unit (GPU) is a powerful tool for parallel computing. In the past years the performance and capabilities of GPUs have increased, and the Compute Unified Device Architecture (CUDA) – a parallel computing architecture – has been developed by NVIDIA to utilize this performance in general purpose computations. Here we show for the […]
Nov, 13
GPU-based Fast Low-dose Cone Beam CT Reconstruction via Total Variation
Cone-beam CT (CBCT) has been widely used in image guided radiation therapy (IGRT) to acquire updated volumetric anatomical information before treatment fractions for accurate patient alignment purpose. However, the excessive x-ray imaging dose from serial CBCT scans raises a clinical concern in most IGRT procedures. The excessive imaging dose can be effectively reduced by reducing […]
Nov, 13
Tiling for Performance Tuning on Different Models of GPUs
The strategy of using CUDA-compatible GPUs as a parallel computation solution to improve the performance of programs has been more and more widely approved during the last two years since the CUDA platform was released. Its benefit extends from the graphic domain to many other computationally intensive domains. Tiling, as the most general and important […]
Nov, 12
Advanced Architectures for Astrophysical Supercomputing
Astronomers have come to rely on the increasing performance of computers to reduce, analyze, simulate and visualize their data. In this environment, faster computation can mean more science outcomes or the opening up of new parameter spaces for investigation. If we are to avoid major issues when implementing codes on advanced architectures, it is important […]
Nov, 12
RapidMind: Portability across Architectures and its Limitations
Recently, hybrid architectures using accelerators like GPGPUs or the Cell processor have gained much interest in the HPC community. The RapidMind Multi-Core Development Platform is a programming environment that allows generating code which is able to seamlessly run on hardware accelerators like GPUs or the Cell processor and multicore CPUs both from AMD and Intel. […]
Nov, 12
GPU-Based Volume Rendering of Noisy Multi-Spectral Astronomical Data
Traditional analysis techniques may not be sufficient for astronomers to make the best use of the data sets that current and future instruments, such as the Square Kilometre Array and its Pathfinders, will produce. By utilizing the incredible pattern-recognition ability of the human mind, scientific visualization provides an excellent opportunity for astronomers to gain valuable […]
Nov, 12
Real-time digital holographic microscopy observable in multi-view and multi-resolution
We propose a real-time digital holographic microscopy, that enables simultaneous multiple reconstructed images with arbitrary resolution, depth and positions, using Shifted-Fresnel diffraction instead of Fresnel diffraction. In this system, we used four graphics processing units (GPU) for multiple reconstructions in real-time. We show the demonstration of four reconstruction images from a hologram with arbitrary depths, […]
Nov, 12
Fast calculation of computer-generated-hologram on AMD HD5000 series GPU and OpenCL
In this paper, we report fast calculation of a computer-generated-hologram using a new architecture of the HD5000 series GPU (RV870) made by AMD and its new software development environment, OpenCL. Using a RV870 GPU and OpenCL, we can calculate 1,920 * 1,024 resolution of a CGH from a 3D object consisting of 1,024 points in […]
Nov, 12
GPU-based Fast Cone Beam CT Reconstruction from Undersampled and Noisy Projection Data via Total Variation
Purpose: Cone-beam CT (CBCT) plays an important role in image guided radiation therapy (IGRT). However, the large radiation dose from serial CBCT scans in most IGRT procedures raises a clinical concern, especially for pediatric patients who are essentially excluded from receiving IGRT for this reason. The goal of this work is to develop a fast […]
Nov, 12
Exploring the Limits of GPUs With Parallel Graph Algorithms
In this paper, we explore the limits of graphics processors (GPUs) for general purpose parallel computing by studying problems that require highly irregular data access patterns: parallel graph algorithms for list ranking and connected components. Such graph problems represent a worst case scenario for coalescing parallel memory accesses on GPUs which is critical for good […]