Tags Results


Jul, 25

ParTeCL: parallel testing using OpenCL

With the growing complexity of software, the number of test cases needed for effective validation is extremely large. Executing these large test suites is expensive and time consuming, putting an enormous pressure on the software development cycle. In previous work, we proposed using Graphics Processing Units (GPUs) to accelerate test execution by running test cases […]
Jul, 25

OpenCL Library for Parallel Graph Search Algorithms

Graphs are a popular data structure to represent large amounts of data and the relationship between them. As serial hardware hits the wall in terms of computation speed, a lot of research has been made recently in parallelizing Graph Search Algorithms such as Breadth First Search or the Single Source Shortest Path Problem hence make […]
Jul, 14

Multikernel Data Partitioning With Channel on OpenCL-Based FPGAs

Recently, field-programmable gate array (FPGA) vendors (such as Altera) have started to address the programmability issues of FPGAs via OpenCL SDKs. In this paper, we analyze the performance of relational database applications on FPGAs using OpenCL. In particular, we study how to improve the performance of data partitioning, which is a very important building block […]
Jul, 5

OpenCL-Based Implementation of an FPGA Accelerator for Molecular Dynamics Simulation

Molecular dynamics (MD) simulations are very important to studyphysical properties of the atoms and molecules. However, a huge amount of processing time is required to simulate a few nano-seconds of an actual experiment. Although the hardware accelerationusing FPGAs provides promising results, huge design time and hardware design skills are required to implement an accelerator successfully. […]
Jun, 25

Scalar collapse in AdS with an OpenCL open source code

We study the spherically symmetric collapse of a scalar field in anti-de Sitter spacetime using a newly constructed, open-source code which parallelizes over heterogeneous architectures using the open standard OpenCL. An open question for this scenario concerns how to tell, a priori, whether some form of initial data will be stable or will instead develop […]
Jun, 17

Efficient OpenCL-based concurrent tasks offloading on accelerators

Current heterogeneous platforms with CPUs and accelerators have the ability to launch several independent tasks simultaneously, in order to exploit concurrency among them. These tasks typically consist of data transfer commands and kernel computation commands. In this paper we develop a runtime approach to optimize the concurrency between data transfers and kernel computation commands in […]
Jun, 10

Crane – Fast and Migratable GPU Passthrough for OpenCL applications

General purpose GPU (GPGPU) computing in virtualized environments leverages PCI passthrough to achieve GPU performance comparable to bare-metal execution. However, GPU passthrough prevents service administrators from performing virtual machine migration between physical hosts. Crane is a new technique for virtualizing OpenCL-based GPGPU computing that achieves within 5.25% of passthrough GPU performance while supporting VM migration. […]
Jun, 5

Program Acceleration in a Heterogeneous Computing Environment Using OpenCL, FPGA, and CPU

Reaching the so-called "performance wall" in 2004 inspired innovative approaches to performance improvement. Parallel programming, distributive computing, and System on a Chip (SOC) design drove change. Hardware acceleration in mainstream computing systems brought significant improvement in the performance of applications targeted directly to a specific hardware platform. Targeting a single hardware platform, however, typically requires […]
Jun, 5

UT-OCL: An OpenCL Framework for Embedded Systems Using Xilinx FPGAs

The number of heterogeneous components on a System-on-Chip (SoC) has continued to increase. Software developers leverage these heterogeneous systems by using high-level languages to enable the execution of applications. For the application to execute correctly, hardware support for features and constructs of the programming model need to be incorporated into the system. OpenCL is a […]
May, 18

CLBlast: A Tuned OpenCL BLAS Library

This work demonstrates how to accelerate dense linear algebra computations using CLBlast, an open-source OpenCL BLAS library providing optimized routines for a wide variety of devices. It is targeted at machine learning and HPC applications and thus provides a fast matrix-multiplication routine (GEMM) to accelerate the core of many applications (e.g. deep learning, iterative solvers, […]
May, 11

Resource-Aware Just-in-Time OpenCL Compiler for Coarse-Grained FPGA Overlays

FPGA vendors have recently started focusing on OpenCL for FPGAs because of its ability to leverage the parallelism inherent to heterogeneous computing platforms. OpenCL allows programs running on a host computer to launch accelerator kernels which can be compiled at run-time for a specific architecture, thus enabling portability. However, the prohibitive compilation times (specifically the […]
Apr, 30

Adaptive Optimization for OpenCL Programs on Embedded Heterogeneous Systems

Heterogeneous multi-core architectures consisting of CPUs and GPUs are commonplace in today’s embedded systems. These architectures offer potential for energy efficient computing if the application task is mapped to the right core. Realizing such potential is challenging due to the complex and evolving nature of hardware and applications. This paper presents an automatic approach to […]
Page 1 of 9812345...102030...Last »

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: