high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Intel(R) SHMEM: GPU-initiated OpenSHMEM using SYCL

Intel(R) SHMEM: GPU-initiated OpenSHMEM using SYCL

Alex Brooks, Philip Marshall, David Ozog, Md. Wasi-ur- Rahman, Lawrence Stewart, Rithwik Tom

Intel Corporation

arXiv:2409.20476 [cs.DC], (30 Sep 2024)

DOI:10.48550/arXiv.2409.20476

BibTeX

Download (PDF)

View

Source

Source codes

Package:

Intel® SHMEM: Device initiated shared memory based communication library

988

views

Modern high-end systems are increasingly becoming heterogeneous, providing users options to use general purpose Graphics Processing Units (GPU) and other accelerators for additional performance. High Performance Computing (HPC) and Artificial Intelligence (AI) applications are often carefully arranged to overlap communications and computation for increased efficiency on such platforms. This has led to efforts to extend popular communication libraries to support GPU awareness and more recently, GPU-initiated operations. In this paper, we present Intel SHMEM, a library that enables users to write programs that are GPU aware, in that API calls support GPU memory, and also support GPU-initiated communication operations by embedding OpenSHMEM style calls within GPU kernels. We also propose thread-collaborative extensions to the OpenSHMEM standard that can enable users to better exploit the strengths of GPUs. Our implementation adapts to choose between direct load/store from GPU and the GPU copy engine based transfer to optimize performance on different configurations.

Tags: Artificial intelligence, Computer science, Heterogeneous systems, Intel, Intel Data Center GPU Max 1550, oneAPI, Package, SYCL

October 6, 2024 by hgpu

No votes yet.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

Intel(R) SHMEM: GPU-initiated OpenSHMEM using SYCL

Package:

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Intel(R) SHMEM: GPU-initiated OpenSHMEM using SYCL

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)