9560

Posts

May, 25

Stencil and Lattice Structures for Field Equation Model Simulations on GPUs

Field equations can be numerically simulated by approximating a continuous space field by a discrete lattice. There are a number of different lattice geometries that can be used to approximate continuous space which may cause structural artefacts in the simulation. These different lattice structures require the use of different stencil operators to approximate the spatial […]
May, 25

A Tuned, Concurrent-Kernel Approach to Speed Up the APSP Problem

The All-Pair Shortest-Path (APSP) problem is a well-known problem in graph theory whose objective is to find the shortest paths between any pair of nodes. Computing the distances from one source node to the rest and repeating this process for every node of the graph is an adequate solution for sparse graphs. During the last […]
May, 25

OpenCV – Accelerated Computer Vision using GPUs (webinar)

OpenCV (Open Source Computer Vision Library: http://opencv.willowgarage.com/wiki/) is an open-source BSD-licensed library that includes several hundreds of computer vision algorithms. In this webinar, learn how this powerful library has been accelerated using CUDA on NVIDIA GPUs. Find out more about OpenCV GPU computational capabilities at: http://docs.opencv.org/modules/gpu/doc/introduction.html Presented by Shalini Gupta, Senior Mobile Computer Vision Engineer, […]
May, 25

On-Demand Generating and Scheduling Optimised Parallel Applications on Heterogeneous Platforms

Scheduling applications tasks across heterogeneous clusters is a growing problem, particularly when new upgraded components are added to a parallel computing system that may have originally been homogeneous. We describe how automatic and just-in-time source code generation techniques can be used to make the best parallel decomposition for whatever resource is available in a heterogeneous […]
May, 25

OpenCL Performance Evaluation on Modern Multi Core CPUs

Utilizing heterogeneous platforms for computation has become a general trend making the portability issue important. OpenCL (Open Computing Language) serves the purpose by enabling portable execution on heterogeneous architectures. However, unpredictable performance variation on different platforms has become a burden for programmers who write OpenCL programs. This is especially true for conventional multicore CPUs, since […]
May, 25

Quantifying the Energy Efficiency of FFT on Heterogeneous Platforms

Heterogeneous computing using Graphic Processing Units (GPUs) has become an attractive computing model given the available scale of data-parallel performance and programming standards such as OpenCL. However, given the energy issues present with GPUs, some devices can exhaust power budgets quickly. Better solutions are needed to effectively exploit the power efficiency available on heterogeneous systems. […]
May, 25

GEMTC: GPU Enabled Many-Task Computing

Current software and hardware limitations prevent Many-Task Computing (MTC) workloads from leveraging hardware accelerators (NVIDIA GPUs, Intel Xeon Phi) boasting Many-Core Computing architectures. Some broad application classes that fit the MTC paradigm are workflows, MapReduce, high-throughput computing, and a subset of high-performance computing. MTC emphasizes using many computing resources over short periods of time to […]
May, 25

JPEG-GPU:: a GPGPU Implementation of JPEG Core Coding Systems

JPEG is a commonly used method of lossy compression for digital photography (image). This work targets on accelerating JPEG’s compressor and decompressor with GPU. Though the final results are not promising, I would like to introduce the lessons I have learned in accelerating a general system with GPGPU.
May, 23

Graphics Processing Unit (GPU) Implementation Methodology of AERMOD Model

Air pollution is one of the major problems the world is facing today. Air pollution is caused due to release of dangerous chemical substances such as carbon monoxide, CFC (Chlorofluorocarbon), carbon dioxide, hydro carbon, sulfur dioxide, etc. in to the atmosphere. These substances are produced by various anthropological activities such as usage of vehicles, factory […]
May, 23

Composing multiple StarPU applications over heterogeneous machines: a supervised approach

Enabling HPC applications to perform efficiently when invoking multiple parallel libraries simultaneously is a great challenge. Even if a single runtime system is used underneath, scheduling tasks or threads coming from different libraries over the same set of hardware resources introduces many issues, such as resource oversubscription, undesirable cache flushes or memory bus contention. This […]
May, 23

Sequential Consistency for Heterogeneous-Race-Free: Programmer-centric Memory Models for Heterogeneous Platforms

Hardware vendors now provide heterogeneous platforms in commodity markets (e.g., integrated CPUs and GPUs), and are promising an integrated, shared memory address space for such platforms in future iterations. Because not all threads in a heterogeneous platform can communicate with the same latency, vendors are proposing synchronization mechanisms that allow threads to communicate with a […]
May, 23

Surface Reconstruction from Scattered Point via RBF Interpolation on GPU

In this paper we describe a parallel implicit method based on radial basis functions (RBF) for surface reconstruction. The applicability of RBF methods is hindered by its computational demand, that requires the solution of linear systems of size equal to the number of data points. Our reconstruction implementation relies on parallel scientific libraries and is […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: