30342

Characterizing the Performance of Parallel Data-Compression Algorithms across Compilers and GPUs

Brandon Alexander Burtchell, Martin Burtscher
Department of Computer Science, Texas State University, San Marcos, TX, USA
Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC Workshops ’25), 2025

@article{burtchell2025characterizing,

   title={Characterizing the Performance of Parallel Data-Compression Algorithms across Compilers and GPUs},

   author={Burtchell, Brandon Alexander and Burtscher, Martin},

   year={2025}

}

Download Download (PDF)   View View   Source Source   Source codes Source codes

Package:

91

views

Different compilers can generate code with notably different performance characteristics – even on the same system. Today, GPU developers have three popular options for compiling CUDA or HIP code for GPUs. First, CUDA code can be compiled by either NVCC or Clang for NVIDIA GPUs. Alternatively, AMD’s recently introduced HIP platform makes porting from CUDA to HIP relatively simple, enabling compilation for AMD and NVIDIA GPUs. This study compares the performance of 107,632 data-compression algorithms when compiling them with different compilers and running them on different GPUs from NVIDIA and AMD. We find that the relative performance of some of these codes changes significantly depending on the compiler and hardware used. For example, Clang tends to produce relatively slow compressors but relatively fast decompressors compared to NVCC and HIPCC.
No votes yet.
Please wait...

You must be logged in to post a comment.

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: