2305

Posts

Dec, 21

GPU Acceleration of Iterative Clustering

Iterative clustering algorithms based on Lloyds algorithm (often referred to as the k-means algorithm) have been used in a wide variety of areas, including graphics, computer vision, signal processing, compression, and computational geometry. We describe a method for accelerating many variants of iterative clustering by using programmable graphics hardware to perform the most computationally expensive […]
Dec, 21

Interactive visibility culling in complex environments using occlusion-switches

We present occlusion-switches for interactive visibility culling in complex 3D environments. An occlusion-switch consists of two GPUs (graphics processing units) and each GPU is used to either compute an occlusion representation or cull away primitives not visible from the current viewpoint. Moreover, we switch the roles of each GPU between successive frames. The visible primitives […]
Dec, 21

Fast computation of general Fourier Transforms on GPUS

We present an implementation of general FFTs for graphics processing units (GPUs). Unlike most existing GPU FFT implementations, we handle both complex and real data of any size that can fit in a texture. The basic building block for our algorithms is a radix-2 Stockham formulation of the FFT for power-of-two data sizes that avoids […]
Dec, 21

GPU-based parallel particle swarm optimization

A novel parallel approach to run standard particle swarm optimization (SPSO) on Graphic Processing Unit (GPU) is presented in this paper. By using the general-purpose computing ability of GPU and based on the software platform of Compute Unified Device Architecture (CUDA) from NVIDIA, SPSO can be executed in parallel on GPU. Experiments are conducted by […]
Dec, 21

Frequent itemset mining on graphics processors

We present two efficient Apriori implementations of Frequent Itemset Mining (FIM) that utilize new-generation graphics processing units (GPUs). Our implementations take advantage of the GPU’s massively multi-threaded SIMD (Single Instruction, Multiple Data) architecture. Both implementations employ a bitmap data structure to exploit the GPU’s SIMD parallelism and to accelerate the frequency counting operation. One implementation […]
Dec, 21

Performance analysis of accelerated image registration using GPGPU

This paper presents a performance analysis of an accelerated 2-D rigid image registration implementation that employs the Compute Unified Device Architecture (CUDA) programming environment to take advantage of the parallel processing capabilities of NVIDIA’s Tesla C870 GPU. We explain the underlying structure of the GPU implementation and compare its performance and accuracy against a fast […]
Dec, 21

Demystifying GPU microarchitecture through microbenchmarking

Graphics processors (GPU) offer the promise of more than an order of magnitude speedup over conventional processors for certain non-graphics computations. Because the GPU is often presented as a C-like abstraction (e.g., Nvidia’s CUDA), little is known about the characteristics of the GPU’s architecture beyond what the manufacturer has documented. This work develops a microbechmark […]
Dec, 21

Practical Symmetric Key Cryptography on Modern Graphics Hardware

Graphics processors are continuing their trend of vastly outperforming CPUs while becoming more general purpose. The latest generation of graphics processors have introduced the ability handle integers natively. This has increased the GPU’s applicability to many fields, especially cryptography. This paper presents an application oriented approach to block cipher processing on GPUs. A new block […]
Dec, 21

Dust-Dust Collisional Charging and Lightning in Protoplanetary Discs

We study the role of dust-dust collisional charging in protoplanetary discs. We show that dust-dust collisional charging becomes an important process in determining the charge state of dust and gas, if there is dust enhancement and/or dust is fluffy, so that dust surface area per disc volume is locally increased. We solve the charge equilibrium […]
Dec, 21

GPU-based ultra fast IMRT plan optimization

The widespread adoption of on-board volumetric imaging in cancer radiotherapy has stimulated research efforts to develop online adaptive radiotherapy techniques to handle the inter-fraction variation of the patient’s geometry. Such efforts face major technical challenges to perform treatment planning in real-time. To overcome this challenge, we are developing a supercomputing online re-planning environment (SCORE) at […]
Dec, 21

DSPSR: Digital Signal Processing Software for Pulsar Astronomy

DSPSR is a high-performance, open-source, object-oriented, digital signal processing software library and application suite for use in radio pulsar astronomy. Written primarily in C++, the library implements an extensive range of modular algorithms that can optionally exploit both multiple-core processors and general-purpose graphics processing units. After over a decade of research and development, DSPSR is […]
Dec, 21

Efficiency of the energy transfer in the Fenna-Matthews-Olson complex using hierarchical equations on graphics processing units

We study the energy transfer in light-harvesting complexes (LHC) and the importance of quantum coherence and the backaction of the molecular environment on the energy flow. We calculate the energy-transfer efficiency and the trapping time for the Fenna-Matthews-Olson (FMO) complex within the exact hierarchical approach proposed by Ishizaki and Fleming (J. Chem. Phys. vol 130, […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: