Application Performance Profiling on Intel GPUs with Oneprof and Onetrace
Intel Corp., USA
Proceedings of the SC ’23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis (SC-W ’23), 2023
@inproceedings{umar2023pti,
title={PTI-GPU: Kernel Profiling and Assessment on Intel GPUs},
author={Umar, Mariam and Jong, Maxwell},
booktitle={Proceedings of the SC’23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis},
pages={681–684},
year={2023}
}
Modern supercomputing applications are complex programs built on optimized frameworks and accelerated on GPUs. As such, dedicated tools for profiling GPU kernel utilization and performance are needed to support development of these applications, which in turn accelerates progress for the scientific computing and machine learning communities. This paper presents the Oneprof and Onetrace tools from the Intel PTI-GPU framework. These tools are capable of profiling applications and different levels of the runtime stack executing on Intel GPUs. To demonstrate the features and utility of these tools, we examine one HPC and one AI application.
December 18, 2023 by hgpu