Posts
Jun, 5
Exploiting Computing Power on Graphics Processing Unit
With recent technological advances, graphics processing units (GPUs) are providing increasingly higher performance with improvement programmability. This paper investigates NVIDIApsilas CUDA technology that enables data mining algorithm be parallelized effectively on GPU. The proposed algorithm exploits the computational power and the memory hierarchy of GPUs, using the shared memory to store frequently accessed data. Experimental […]
Jun, 5
Large neighborhood local search optimization on graphics processing units
Local search (LS) algorithms are among the most powerful techniques for solving computationally hard problems in combinatorial optimization. These algorithms could be viewed as "walks through neighborhoods" where the walks are performed by iterative procedures that allow to move from a solution to another one in the solution space. In these heuristics, designing operators to […]
Jun, 5
Employ Bump Mapping to Enrich the 3D NPR Image
This paper presents a simple approach for adding more details to non-photorealistic image without increasing the complexity of the model. Bump mapping technique is introduced to create the details that can vary automatically with the change of light and view direction, but this technique originally aims to make a rendered surface look more realistic, so […]
Jun, 5
How to Render FDTD Computations More Effective Using a Graphics Accelerator
Graphics processing units (GPUs) for years have been dedicated mostly to real time rendering. Recently leading GPU manufactures have extended their research area and decided to support also graphics computing. In this paper, we describe an impact of new GPU features on development process of an efficient finite difference time domain (FDTD) implementation.
Jun, 5
Interective Point Clouds Fairing on Many-Core System
This Paper proposes an interactive point clouds fairing algorithm running on many-core system. The algorithm is composed of four steps. Firstly, a k nearest neighbor searching method was designed which could fully utilize the computing ability of GPU. Secondly, a parallel Gaussian weighted normal estimation was put forward. Thirdly, a weighted fairing method was proposed […]
Jun, 5
I3DC: Interactive Three-Dimensional Cubes
We present the I3DC system prototype, which constructs the cube for tens of millions of data items within milliseconds, and provides high-quality cube visualization as well as highly-interactive OLAP operations. Our approach is based on a novel blending-as-aggregation (BAA) algorithm that maps distributive OLAP aggregations to the intrinsic rendering mechanisms of the GPU. Our system […]
Jun, 5
Simulating anomalous diffusion on graphics processing units
The computational power of modern graphics processing units (GPUs) has become an interesting alternative in high performance computing. The specialized hardware of GPUs delivers a high degree of parallelism and performance. Various applications in scientific computing have been implemented such that computationally intensive parts are executed on GPUs. In this article, we present a GPU […]
Jun, 5
Implementation of association rule mining using CUDA
The purpose of this paper is to implement association rule mining algorithm using Nvidia CUDA framework for general purpose computing on GPU. The major objective is to perform performance comparison of association rule mining algorithm using C based implementation on Intel Quad Core/Core2 Duo CPU with CUDA based implementation on Nvidia G80 and GTX 200 […]
Jun, 5
A Study on Parallel Imaging Algorithm of 3D Geological Data
This paper discusses the three-dimension visualization of the geological data using the ray-casting algorithm which can display the internal structure of geological body in details. An improved scheme transforming lots of matrix multiplications to a few of vector additions is developed to accelerate the ray-casting algorithm for the imaging of huge amount of three-dimensional geological […]
Jun, 5
Efficient Embarrassingly Parallel on Graphics Processor Unit
The Embarrassingly Parallel (EP) is one kernel benchmark of NAS Parallel Benchmarks (NPB) which are a set of programs designed to help evaluate the performance of parallel supercomputers. In the EP benchmark, two-dimensional statistics are accumulated from a large number of Gaussian pseudo-random numbers, which produced by Linear Congruential Generator (LCG). In this paper, we […]
Jun, 5
Accelerating Unstructured Mesh Computational Fluid Dynamics on the NVidia Tesla GPU Architecture
This report presents steps towards accelerating Fluidity, a general-purpose computational fluid dynamics package. One portion of the code, an iterative solver, is targeted for optimisation by using Graphics Processing Units (GPUs) to perform computations. A literature survey which examines the performance issues of iterative solvers and optimisations which may overcome these issues on classical and […]
Jun, 5
Performance Analysis of the OP2 Framework on Many-core Architectures
We present a performance analysis and benchmarking study of the OP2 "active" library, which provides an abstraction framework for the solution of parallel unstructured mesh applications. OP2 aims to decouple the scientific specification of the application from its parallel implementation, achieving code longevity and near-optimal performance through re-targeting the back-end to different hardware. Runtime performance […]