Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with Protein Database Search
III-LIDI, Facultad de Inform ́atica, 50 y 120, La Plata, 1900, Buenos Aires, Argentina
arXiv:2412.08308 [cs.DC], (11 Dec 2024)
@misc{costanzo2024analyzing,
title={Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with Protein Database Search},
author={Manuel Costanzo and Enzo Rucci and Carlos García-Sánchez and Marcelo Naiouf and Manuel Prieto-Matías},
year={2024},
eprint={2412.08308},
archivePrefix={arXiv},
primaryClass={cs.DC}
}
The high-performance computing (HPC) landscape is undergoing rapid transformation, with an increasing emphasis on energy-efficient and heterogeneous computing environments. This comprehensive study extends our previous research on SYCL’s performance portability by evaluating its effectiveness across a broader spectrum of computing architectures, including CPUs, GPUs, and hybrid CPU-GPU configurations from NVIDIA, Intel, and AMD. Our analysis covers single-GPU, multi-GPU, single-CPU, and CPU-GPU hybrid setups, using the SW# protein database search application as a case study. The results demonstrate SYCL’s versatility across different architectures, maintaining comparable performance to CUDA on NVIDIA GPUs while achieving similar architectural efficiency rates on most CPU configurations. Although SYCL showed excellent functional portability in hybrid CPU-GPU configurations, performance varied significantly based on specific hardware combinations. Some performance limitations were identified in multi-GPU and CPU-GPU configurations, primarily attributed to workload distribution strategies rather than SYCL-specific constraints. These findings position SYCL as a promising unified programming model for heterogeneous computing environments, particularly for bioinformatic applications.
December 15, 2024 by hgpu