1127

Posts

Oct, 28

GPU virtualization on VMware’s hosted I/O architecture

Modern graphics co-processors (GPUs) can produce high fidelity images several orders of magnitude faster than general purpose CPUs, and this performance expectation is rapidly becoming ubiquitous in personal computers. Despite this, GPU virtualization is a nascent field of research. This paper introduces a taxonomy of strategies for GPU virtualization and describes in detail the specific […]
Oct, 28

GPU accelerated image aligned splatting

Splatting is a popular technique for volume rendering, where voxels are represented by Gaussian kernels, whose pre-integrated footprints are accumulated to form the image. Splatting has been mainly used to render pre-shaded volumes, which can result in significant blurring in zoomed views. This can be avoided in the image-aligned splatting scheme, where one accumulates kernel […]
Oct, 28

GPU-accelerated large-scale quantum molecular dynamics simulation of 3-dimensional C60 polymers

Polymerization of C60 molecular crystal under high pressure and high temperature is simulated by using linear scaling tight binding molecular dynamics (TBMD) with Graphic Processing Unit (GPU) as a computational accelerator for matrix-matrix multiplication. Two sets of tight binding parameters were tested.
Oct, 28

An Implementation of the Smooth Particle Mesh Ewald Method on GPU Hardware

The smooth particle mesh Ewald summation method is widely used to efficiently compute long-range electrostatic force terms in molecular dynamics simulations, and there has been considerable work in developing optimized implementations for a variety of parallel computer architectures. We describe an implementation for Nvidia graphical processing units (GPUs) which are general purpose computing devices with […]
Oct, 28

Fuzzy ART Neural Network Parallel Computing on the GPU

Graphics Processing Units (GPUs) have evolved into powerful programmable processors, faster than Central Processing Units (CPUs) regarding the execution of parallel algorithms. In this paper, an implementation of a Fuzzy ART Neural Network on the GPU is presented. Experimental results show training process is slower on the GPU than on a dual-core Pentium 4 at […]
Oct, 28

GAMER: a GPU-Accelerated Adaptive Mesh Refinement Code for Astrophysics

We present the newly developed code, GAMER (GPU-accelerated Adaptive MEsh Refinement code), which has adopted a novel approach to improve the performance of adaptive mesh refinement (AMR) astrophysical simulations by a large factor with the use of the graphic processing unit (GPU). The AMR implementation is based on a hierarchy of grid patches with an […]
Oct, 28

High performance cellular level agent-based simulation with FLAME for the GPU

Driven by the availability of experimental data and ability to simulate a biological scale which is of immediate interest, the cellular scale is fast emerging as an ideal candidate for middle-out modelling. As with bottom-up’ simulation approaches, cellular level simulations demand a high degree of computational power, which in large-scale simulations can only be achieved […]
Oct, 28

CrystalGPU: Transparent and Efficient Utilization of GPU Power

General-purpose computing on graphics processing units (GPGPU) has recently gained considerable attention in various domains such as bioinformatics, databases and distributed computing. GPGPU is based on using the GPU as a co-processor accelerator to offload computationally-intensive tasks from the CPU. This study starts from the observation that a number of GPU features (such as overlapping […]
Oct, 28

Fast CGH computation using S-LUT on GPU

In computation of full-parallax computer-generated hologram (CGH), balance between speed and memory usage is always the core of algorithm development. To solve the speed problem of coherent ray trace (CRT) algorithm and memory problem of look-up table (LUT) algorithm without sacrificing reconstructed object quality, we develop a novel algorithm with split look-up tables (S-LUT) and […]
Oct, 28

All-pairs shortest-paths for large graphs on the GPU

The all-pairs shortest-path problem is an intricate part in numerous practical applications. We describe a shared memory cache efficient GPU implementation to solve transitive closure and the all-pairs shortest-path problem on directed graphs for large datasets. The proposed algorithmic design utilizes the resources available on the NVIDIA G80 GPU architecture using the CUDA API. Our […]
Oct, 28

An integrated GPU power and performance model

GPU architectures are increasingly important in the multi-core era due to their high number of parallel processors. Performance optimization for multi-core processors has been a challenge for programmers. Furthermore, optimizing for power consumption is even more difficult. Unfortunately, as a result of the high number of processors, the power consumption of many-core processors such as […]
Oct, 27

GPU as a General Purpose Computing Resource

In the last few years, GPUs(Graphics Processing Units) have made rapid development. Their ever-increasing computing power and decreasing cost have attracted attention from both industry and academia. In addition to graphics applications, researchers are interested in using them for general purpose computing. Recently, NVIDIA released a new computing architecture, CUDA (Compute Uni¿ed Device Architecture), for […]
Page 916 of 922« First...102030...914915916917918...Last »

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: