11397

Posts

Feb, 15

Accelerator Aware MPI Micro-benchmarking using CUDA, OpenACC and OpenCL

Recently MPI implementations have been extended to support accelerator devices, Intel Many Integrated Core (MIC) and nVidia GPU. This has been accomplished by changes to different levels of the software stacks and MPI implementations. In order to evaluate performance and scalability of accelerator aware MPI libraries, we developed portable micro-benchmarks to identify factors that influence […]
Jan, 23

On the Portability of the OpenCL Dwarfs on Fixed and Reconfigurable Parallel Platforms

The proliferation of heterogeneous computing systems presents the parallel computing community with the challenge of porting legacy and emerging applications to multiple processors with diverse programming abstractions. OpenCL is a vendor-agnostic and industry-supported programming model that offers code portability on heterogeneous platforms, allowing applications to be developed once and deployed "anywhere". In this paper, we […]
Jan, 23

clpeak – peak performance of your opencl device

clpeak is a benchmarking tool intended toward developers to fine-tune opencl kernels for a particular device/class of device. It calculates bandwidth & compute performance for different vector-widths of a datatype, say float, float4. Traditionally it is recommended to use scalar code and we expect opencl compiler to auto-vectorize it. But, most of the times compiler […]
Jan, 16

Improving Student Learning in Computer Science Courses by Using Virtual OpenCL Laboratory

Laboratory experience is an essential factor for engineering and science education. Virtual laboratories are widely used by universities and research institutions in various kinds of academic sectors. However, general virtual laboratories always have some weakness for computer graphics which its experiment needs to be done in high performance computers. In the assessment of a graduate […]
Jan, 12

Importance-Driven Isosurface Decimation for Visualization of Large Simulation Data Based on OpenCL

For large simulation data, Parallel Marching Cubes algorithm is efficient and commonly used to extract isosurfaces in 3D scalar field. However, the isosurface meshes are sometimes too dense and it is difficult for scientists to specify the areas they are interested in. In this paper, we provide them a new way to define mesh importance […]
Dec, 27

OpenCL Programming by Example

This book follows an example-driven, simplified, and practical approach to using OpenCL for general purpose GPU programming. If you are a beginner in parallel programming and would like to quickly accelerate your algorithms using OpenCL, this book is perfect for you! You will find the diverse topics and case studies in this book interesting and […]
Dec, 24

Scene Boundary Detection Technique Based on Bottom-Up Attention System and OpenCL Parallel Implementation

This paper spotlights the maintaining of scene boundary detection system in video and process of porting it to the OpenCL. The scene boundary detection algorithm proposed by authors is based on bottom-up focus attention principle. The system builds Gaussian pyramids from input image, calculates map of saliency from the image and then detects the most […]
Dec, 22

Numerical Simulation for the MHD System in 2D Using OpenCL

In this work we compute the MHD equations with divergence cleaning on GPU. The method is based on the finite volume approach and Strang dimensional splitting. The simplicity of the approach makes it a good candidate for a GPU implementation with OpenCL. With adequate memory optimization access, we achieve very high speedups, compared to a […]
Dec, 17

OpenCL Accelerated Multi-GPU Cone-Beam Reconstruction

Volume reconstruction in cone-beam CT is a computationally demanding task. Since recent years, the reconstruction is accelerated by utilizing Graphics Processing Units (GPUs). Frameworks for General Purpose Computations on GPUs are proven tool to access the resources of graphics cards. WIth the Open Computing Language (OpenCL) the first open standard for cross-vendor and cross-platform programming […]
Dec, 15

OpenCL-Accelerated Computation of a 3D SPECT Projection Operator for the Content Adaptive Mesh Model

In this manuscript, we present a preliminary evaluation of a fully 3D projection operator calculation aimed at emission tomography on a non-circular orbit. The proposed methodology uses the content-adaptive mesh model (CAMM) for volumetric data representation. The CAMM is an efficient data representation based on adaptive non-uniform sampling and linear interpolation. The presented projection operator […]
Dec, 9

The Comparisons of OpenCL and OpenMP Computing Paradigm

Graphics processing units (GPUs) in a computer system are increasingly adopted to create a realistic environment in 3D applications. Despite the extremely high parallelism of these devices with a tremendous amount of processing elements, GPUs are seldom used in scientific applications owing to their difficulty in programming. Additionally, vendor-provided APIs are too specific to cross […]
Dec, 4

Computing OpenSURF on OpenCL and General Purpose GPU

Speeded-Up Robust Feature (SURF) algorithm is widely used for image feature detecting and matching in computer vision area. Open Computing Language (OpenCL) is a framework for writing programs that execute across heterogeneous platforms consisting of CPUs, GPUs, and other processors. This paper introduces how to implement an open-sourced SURF program, namely OpenSURF, on general purpose […]
Page 20 of 102« First...10...1819202122...304050...Last »

* * *

* * *

HGPU group © 2010-2018 hgpu.org

All rights belong to the respective authors

Contact us: