7534

One Stone Two Birds: Synchronization Relaxation and Redundancy Removal in GPU-CPU Translation

Ziyu Guo, Bo Wu, Xipeng Shen
Qualcomm CDMA Technologies, San Diego, CA, USA
ACM International Conference on Supercomputing, 2012

@article{guo2012one,

   title={One Stone Two Birds: Synchronization Relaxation and Redundancy Removal in GPU-CPU Translation},

   author={Guo, Z. and Wu, B. and Shen, X.},

   year={2012}

}

Download Download (PDF)   View View   Source Source   

1549

views

As an approach to promoting whole-system synergy on a heterogeneous computing system, compilation of fine-grained SPMD-threaded code (e.g., GPU CUDA code) for multicore CPU has drawn some recent attentions. This paper concentrates on two important sources of inefficiency that limit existing translators. The first is overly strong synchronizations; the second is thread-level partially redundant computations. In this paper, we point out that both kinds of inefficiency essentially come from a single reason: the nonuniformity among threads. Based on that observation, we present a thread-level dependence analysis, which leads to a code generator with three novel features: an instance-level instruction scheduler for synchronization relaxation, a graph pattern recognition scheme for code shape optimization, and a fine-grained analysis for thread-level partial redundancy removal. Experiments show that the unified solution is effective in resolving both inefficiencies, yielding speedup as much as a factor of 14.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: