high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Development of High-Performance Software Components for Emerging Architectures

Development of High-Performance Software Components for Emerging Architectures

Stefan Lemvig Glimberg, Allan Peter Engsig-Karup, Allan S. Nielsen, Bernd Dammann

Technical University of Denmark

November 18, 2014

BibTeX

Download (PDF)

View

Source

Source codes

Package:

GLAS library

2797

views

Massively parallel processors, such as graphical processing units (GPUs), have in recent years proven to be effective for a vast amount of scientific appli- cations. Today, most desktop computers are equipped with one or more pow- erful GPUs, offering heterogeneous high-performance computing to a broad range of scientific researchers and software developers. Though GPUs are now programmable and can be highly effective compute units, they still pose challenges for software developers to fully utilize their efficiency. Sequential legacy codes are not always easily parallelized, and the time spent on conversion might not pay off in the end. This is particular true for heterogenous computers, where the architectural differences between the main- and co-processor can be so significant, that they call for completely different optimization strate- gies. The cache hierarchy management of CPUs and GPUs being an evident example hereof. In the past, industrial companies were able to boost application performance solely by upgrading their hardware systems, with an overt balance between investment and performance speedup. Today, the picture is different, not only do they have to invest in new hardware, but also account for the adaption and training of their software developers. What traditionally used to be a hardware problem, addressed by the chip manufacturers, has now become a software problem for application developers.
Software libraries can be a tremendous help for developers as they make it easier to implement an application, without having to know about the complexity of the underlying computer hardware, known as opacity [1]. The ul- timate goal for a successful library is to simplify the process of writing new software and thus to increase developer productivity. Since programmable heterogeneous CPU/GPU systems are a rather new phenomenon, there is yet a limited number of established software libraries that take full advantage of such heterogeneous high performance systems, and there are no de-facto design standards for such systems either. Some existing libraries for conventional ho- mogeneous systems have already added support for offloading computational intense operations onto co-processing GPUs. However, this approach comes with the cost of frequent memory transfers across the low bandwidth PCIe bus.

Tags: Computer science, CUDA, GPGPU, iterative methods, Mathematics, multigrid, Package, parallel computing, Partial differential equations, Software development

March 12, 2014 by apek

No votes yet.

Please wait...

* * *

high performance computing on graphics processing units: hgpu.org

Development of High-Performance Software Components for Emerging Architectures

Package:

Recent source codes

XaaS containers

microSYCL: SYCL micro-benchmarks repository

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

Most viewed papers (last 30 days)

Development of High-Performance Software Components for Emerging Architectures

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)