1576

Posts

Nov, 13

Stellar-mass black holes in star clusters: implications for gravitational wave radiation

We study the dynamics of stellar-mass black holes (BH) in star clusters with particular attention to the formation of BH-BH binaries, which are interesting as sources of gravitational waves (GW). We examine the properties of these BH-BH binaries through direct N-body simulations of star clusters using the GPU-enabled NBODY6 code. We perform simulations of N
Nov, 13

TWQCD’s dynamical DWF project

We present an overview of our project of simulation of unquenched lattice QCD with optimal domain-wall quarks, using a GPU cluster currently constituting of 16 units of Nvidia Tesla S1070 plus 64 graphic cards with Nvidia GTX285 (total 128 GPUs with 128 Teraflops peak), attaining sustained computing power of 15.36 Teraflops. The first production run […]
Nov, 13

A Markovian event-based framework for stochastic spiking neural networks

In this article we introduce and study a mathematical framework for characterizing and simulating networks of noisy integrate-and-fire neurons based on the spike times. We show that the firing times of the neurons in the networks constitute a Markov chain, whose transition probability is related to the probability distribution of the interspike interval of the […]
Nov, 13

Enhanced molecular dynamics performance with a programmable graphics processor

Design considerations for molecular dynamics algorithms capable of taking advantage of the computational power of a graphics processing unit (GPU) are described. Accommodating the constraints of scalable streaming-multiprocessor hardware necessitates a reformulation of the underlying algorithm. Performance measurements demonstrate the considerable benefit and cost-effectiveness of such an approach, which produces a factor of 2.5 speed […]
Nov, 13

CUDAEASY – a GPU Accelerated Cosmological Lattice Program

This paper presents, to the author’s knowledge, the first graphics processing unit (GPU) accelerated program that solves the evolution of interacting scalar fields in an expanding universe. We present the implementation in NVIDIA’s Compute Unified Device Architecture (CUDA) and compare the performance to other similar programs in chaotic inflation models. We report speedups between one […]
Nov, 13

The role of multigrid algorithms for LQCD

We report on the first successful QCD multigrid algorithm which demonstrates constant convergence rates independent of quark mass and lattice volume for the Wilson Dirac operator. The new ingredient is the adaptive method for constructing the near null space on which the coarse grid multigrid Dirac operator acts. In addition we speculate on future prospects […]
Nov, 13

QCD on GPUs: cost effective supercomputing

The exponential growth of floating point power in graphics processing units (GPUs), together with their low cost, has given rise to an attractive platform upon which to deploy lattice QCD calculations. GPUs are essentially many (O(100)) core chips, that are programmed using a massively threaded environment, and so are representative of the future of high […]
Nov, 13

Air pollution modelling using a graphics processing unit with CUDA

The Graphics Processing Unit (GPU) is a powerful tool for parallel computing. In the past years the performance and capabilities of GPUs have increased, and the Compute Unified Device Architecture (CUDA) – a parallel computing architecture – has been developed by NVIDIA to utilize this performance in general purpose computations. Here we show for the […]
Nov, 13

GPU-based Fast Low-dose Cone Beam CT Reconstruction via Total Variation

Cone-beam CT (CBCT) has been widely used in image guided radiation therapy (IGRT) to acquire updated volumetric anatomical information before treatment fractions for accurate patient alignment purpose. However, the excessive x-ray imaging dose from serial CBCT scans raises a clinical concern in most IGRT procedures. The excessive imaging dose can be effectively reduced by reducing […]
Nov, 13

Tiling for Performance Tuning on Different Models of GPUs

The strategy of using CUDA-compatible GPUs as a parallel computation solution to improve the performance of programs has been more and more widely approved during the last two years since the CUDA platform was released. Its benefit extends from the graphic domain to many other computationally intensive domains. Tiling, as the most general and important […]
Nov, 12

Advanced Architectures for Astrophysical Supercomputing

Astronomers have come to rely on the increasing performance of computers to reduce, analyze, simulate and visualize their data. In this environment, faster computation can mean more science outcomes or the opening up of new parameter spaces for investigation. If we are to avoid major issues when implementing codes on advanced architectures, it is important […]
Nov, 12

RapidMind: Portability across Architectures and its Limitations

Recently, hybrid architectures using accelerators like GPGPUs or the Cell processor have gained much interest in the HPC community. The RapidMind Multi-Core Development Platform is a programming environment that allows generating code which is able to seamlessly run on hardware accelerators like GPUs or the Cell processor and multicore CPUs both from AMD and Intel. […]

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org