5160

Posts

Aug, 8

AES finalists implementation for GPU and multi-core CPU based on OpenCL

Benefit from the OpenCL (Open Computing Language), applications can be easily transplanted among different GPUs, multi-core CPUs, and other processors. In this paper, we present implementation of AES finalists (Rijndael, Serpent, Twofish) in XTS mode, based on OpenCL. Benchmark testing is performed on 4 mainstream GPUs and multi-core CPUs. The results are also compared with […]
Jul, 29

A 3D radiative transfer framework: XIII. OpenCL implementation

We discuss an implementation of our 3D radiative transfer (3DRT) framework with the OpenCL paradigm for general GPU computing. We implement the kernel for solving the 3DRT problem in Cartesian coordinates with periodic boundary conditions in the horizontal $(x,y)$ plane, including the construction of the nearest neighbor $Lstar$ and the operator splitting step. We present […]
Jul, 22

Hybrid OpenCL: Enhancing OpenCL for Distributed Processing

We have been developing Hybrid OpenCL, which enables the utilization of OpenCL devices by connecting them over the network. Hybrid OpenCL opens a gate to scale up OpenCL environments. By using Hybrid OpenCL, applications written in OpenCL can be easily ported to high performance cluster computers, thus, Hybrid OpenCL can provide more various distributed and […]
Jul, 22

Parallel computing of 3D smoking simulation based on OpenCL heterogeneous platform

Open Computing Language (OpenCL) is an open royalty-free standard for general purpose parallel programming across Central Processing Units (CPUs), Graphic Processing Units (GPUs) and other processors. This paper introduces OpenCL to implement real-time smoking simulation in a virtual surgery training simulation system. Firstly, the Computational Fluid Dynamics (CFD) is adopted to construct the real-time smoking […]
Jul, 22

GLOpenCL: OpenCL support on hardware- and software-managed cache multicores

OpenCL is an industry supported standard for writing programs that execute on multicore platforms as well as on accelerators, such as GPUs or the SPEs of the Cell B.E. In this paper we introduce GLOpenCL, a unified development framework which supports OpenCL on both homogeneous, shared memory, as well as on heterogeneous, distributed memory multicores. […]
Jul, 14

ForOpenCL: Transformations Exploiting Array Syntax in Fortran for Accelerator Programming

Emerging GPU architectures for high performance computing are well suited to a data-parallel programming model. This paper presents preliminary work examining a programming methodology that provides Fortran programmers with access to these emerging systems. We use array constructs in Fortran to show how this infrequently exploited, standardized language feature is easily transformed to lower-level accelerator […]
Jul, 6

Flexible OpenCL accelerated disparity estimation for video communication applications

Due to widespread broadband connections in normal households, the use of video chats via Internet is no longer limited to business meetings. However, the camera configuration usually makes it impossible to achieve direct eye contact between the conversational partners. This effect can be compensated using virtual view synthesis methods based on disparity maps. The virtual […]
Jun, 21

GPU-based motion correction of contrast-enhanced liver MRI scans: An OpenCL implementation

Clinical diagnosis and quantification of liver disease have been improved through the development of techniques using contrast-enhanced liver MRI sequences. To qualitatively or quantitatively analyze such image sequences, one first needs to correct for rigid and non-rigid motion of the liver. For motion correction of the liver, we have employed bi-directional local correlation coefficient Demons, […]
Jun, 21

Performance evaluation of the multi-device OpenCL FDTD solver

We present results of an evaluation of a multi-device OpenCL FDTD solver. Portability between hardware manufactured by different vendors and also between highly specialized and parallel computing architectures available on the market, i.e. GPUs, multi-core CPUs and devices integrating both technologies in a single-die IC, is the main advantage of this solver. For code execution […]
May, 21

Synthesis of Platform Architectures from OpenCL Programs

The problem of automatically generating hardware modules from a high level representation of an application has been at the research forefront in the last few years. In this paper, we use OpenCL, an industry supported standard for writing programs that execute on multicore platforms and accelerators such as GPUs. Our architectural synthesis tool, SOpenCL (Silicon-OpenCL), […]
May, 12

OpenCL based machine learning labeling of biomedical datasets

In this paper, we propose a two-stage labeling method of large biomedical datasets through a parallel approach in a single GPU. Diagnostic methods, structures volume measurements, and visualization systems are of major importance for surgery planning, intra-operative imaging and image-guided surgery. In all cases, to provide an automatic and interactive method to label or to […]
May, 11

Gemma in April: A matrix-like parallel programming architecture on OpenCL

Nowadays, Graphics Processing Unit (GPU), as a kind of massive parallel processor, has been widely used in general purposed computing tasks. Although there have been mature development tools, it is not a trivial task for programmers to write GPU programs. Based on this consideration, we propose a novel parallel computing architecture. The architecture includes a […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: