high performance computing on graphics processing units: hgpu.org

Posts

Dec, 6

GPU-Based Liquid Crystal Display Processing Platform

In the past decade liquid crystal displays (LCD) have taken over the television (TV) and monitor market from cathode ray tube (CRT) display. Compared to CRT displays, LCD offers larger screen sizes, higher resolution, thinner, lighter, and more energy efficient. However, with respect to image quality, LCD does not catch up to CRT display in […]

CUDA

Dec, 6

Performance Analysis of GPU compared to Single-core and Multi-core CPU for Natural Language Applications

In Natural Language Processing (NLP) applications, the main time-consuming process is string matching due to the large size of lexicon. In string matching processes, data dependence is minimal and hence it is ideal for parallelization. A dedicated system with memory interleaving and parallel processing techniques for string matching can reduce this burden of host CPU, […]

CUDA

Dec, 6

Real-time Terrain Modeling using CPU-GPU Coupled Computation

Motivated by the importance of having real-time feedback in sketch-based modeling tools, we present a framework for terrain edition capable of generating and displaying complex and high-resolution terrains. Our system is efficient and fast enough to allow the user to see the terrain morphing at the same time the drawing editing occurs. We have two […]

OpenGL

Dec, 6

Performance engineering for the Lattice Boltzmann method on GPGPUs: Architectural requirements and performance results

GPUs offer several times the floating point performance and memory bandwidth of current standard two socket CPU servers, e.g. NVIDIA C2070 vs. Intel Xeon Westmere X5650. The lattice Boltzmann method has been established as a flow solver in recent years and was one of the first flow solvers to be successfully ported and that performs […]

CUDA

•

OpenCL

Dec, 6

A Fast 3D Spatial Analysis Technique Using Graphic Process Units

More and more 3D terrain information is converted and stored in a digital format, and they have been used as a primary data source for a variety of spatial analyses in various application domains. One unsolved issue is that 3D spatial analyses (e.g., 3D ray tracing) often require huge computing cost in terms of processing […]

OpenCL

Dec, 5

Soft Error Resilient QR Factorization for Hybrid System with GPGPU

The general purpose graphics processing units (GPGPU) are increasingly deployed for scientific computing due to their performance advantages over CPUs. What followed is the fact that fault tolerance has become a more serious concern compared to the period when GPGPUs were used exclusively for graphics applications. Using GPUs and CPUs together in a hybrid computing […]

CUDA

Dec, 5

Effectiveness of GPGPU for Solving the Magnetohydrodynamics Equations Using the CIP-MOCCT Method

A simple parallelization approach using General Purpose computation on Graphics Processing Unit was applied for solving the MHD equations using the CIP-MOCCT method. We investigated the efficiency of this parallelization approach and found that the computational speed of the modified code is significantly improved despite the simple modification.

CUDA

Dec, 5

GMM based Fisher vector calculation on GPGPU

We describe an accurate yet very fast implementation of a visual word generation method by using general purpose graphical processors (GPUs). Visual words have recently proved to be a key tool in image classification. Best performing Pascal VOC and ImageCLEF systems use Gaussian mixtures or k-means clustering to define visual words based on the content-based […]

CUDA

Dec, 5

Parallel Quadtree Coding of Large-Scale Raster Geospatial Data on GPGPUs

Global remote sensing and large-scale environmental modeling have generated huge amounts of raster geospatial data. While the inherent data parallelism of large-scale raster geospatial data allows straightforward coarse-grained parallelization at the chunk level on CPUs, it is largely unclear how to effectively exploit such data parallelism on massively parallel General Purpose Graphics Processing Units (GPGPUs) […]

CUDA

Dec, 5

Real-Time Handling of GPU Interrupts in LITMUS RT

Graphics processing units (GPUs) are becoming increasingly important in today’s platforms as their increased generality allows for them to be used as powerful co-processors. However, unlike standard CPUs, GPUs are treated as I/O devices and require the use of interrupts to facilitate communication with the CPU. Interrupts cause delays in the execution of real-time tasks, […]

CUDA

Dec, 5

A dynamic scheduling runtime and tuning system for heterogeneous multi and many-core desktop platforms

A modern personal computer can be now considered as a one-node heterogeneous cluster that simultaneously processes several applications’ tasks. It can be composed by asymmetric Processing Units (PUs), like the multi-core Central Processing Unit (CPU), the many-core Graphics Processing Units (GPUs) – which have become one of the main co-processors that contributed towards high performance […]

CUDA

•

OpenCL

Dec, 5

GPU-Euler: Sequence Assembly Using GPGPU

Advances in sequencing technologies have revolutionized the field of genomics by providing cost effective and high throughput solutions. In this paper, we develop a parallel sequence assembler implemented on general purpose graphic processor units (GPUs). Our work was largely motivated by a growing need in the genomic community for sequence assemblers and increasing use of […]

CUDA

* * *

high performance computing on graphics processing units: hgpu.org

Posts

GPU-Based Liquid Crystal Display Processing Platform

Performance Analysis of GPU compared to Single-core and Multi-core CPU for Natural Language Applications

Real-time Terrain Modeling using CPU-GPU Coupled Computation

Performance engineering for the Lattice Boltzmann method on GPGPUs: Architectural requirements and performance results

A Fast 3D Spatial Analysis Technique Using Graphic Process Units

Soft Error Resilient QR Factorization for Hybrid System with GPGPU

Effectiveness of GPGPU for Solving the Magnetohydrodynamics Equations Using the CIP-MOCCT Method

GMM based Fisher vector calculation on GPGPU

Parallel Quadtree Coding of Large-Scale Raster Geospatial Data on GPGPUs

Real-Time Handling of GPU Interrupts in LITMUS RT

A dynamic scheduling runtime and tuning system for heterogeneous multi and many-core desktop platforms

GPU-Euler: Sequence Assembly Using GPGPU

Recent source codes

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

MSCCL++: A GPU-driven communication stack for scalable AI applications

Benchmark compute shader of Unity against InteropUnityCUDA

Most viewed papers (last 30 days)