high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Many-core parallel computing – Can compilers and tools do the heavy lifting?

Many-core parallel computing – Can compilers and tools do the heavy lifting?

W.-m.W. Hwu

FCRP GSRC, Illinois UPCRC, Illinois CUDA CoE, IACAT, IMPACT, University of Illinois, Urbana-Champaign, Urbana, IL, USA

IEEE International Symposium on Parallel & Distributed Processing, 2009. IPDPS 2009

DOI:10.1109/IPDPS.2009.5160859

@article{wen2009many,

title={Many-core parallel computing-Can compilers and tools do the heavy lifting?},

author={Wen-mei, W.H.},

year={2009},

publisher={IEEE}

}

Download (PDF)

View

Source

1749

views

Modern GPUs such as the NVIDIA GeForce GTX280, ATI Radeon 4860, and the upcoming Intel Larrabee are massively parallel, many-core processors. Today, application developers for these many-core chips are reporting 10X-100X speedup over sequential code on traditional microprocessors. According to the semiconductor industry roadmap, these processors could scale up to over 1,000X speedup over single cores by the end of the year 2016. Such a dramatic performance difference between parallel and sequential execution will motivate an increasing number of developers to parallelize their applications. Today, an application programmer has to understand the desirable parallel programming idioms, manually work around potential hardware performance pitfalls, and restructure their application design in order to achieve their performance objectives on many-core processors. Although many researchers have given up on parallelizing compilers, I will show evidence that by systematically incorporating high-level application design knowledge into the source code, a new generation of compilers and tools can take over the heavy lifting in developing and tuning parallel applications. I will also discuss roadblocks whose removal will require innovations from the entire research community.

Tags: Compilers, Computer science, CUDA, Image processing, nVidia, Presentation, Review

July 10, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Many-core parallel computing – Can compilers and tools do the heavy lifting?

Your response

Recent source codes

AutoDock-GPU: AutoDock for GPUs and other accelerators

NCCLX: collective communication framework

Tutoring LLM into a Better CUDA Optimizer

Kernel Library for LLM Serving

Adaptivity in AdaptiveCpp: Optimizing Performance by Leveraging Runtime Information During JIT-Compilation

Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs

Genten: Software for Generalized Tensor Decompositions by Sandia National Laboratories

Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR

Pinocchio: PINpointing Orbit Crossing Collapsed Hierarchical Objects

KernelCoder: trained on a curated dataset of reasoning traces and CUDA kernel pairs

Most viewed papers (last 30 days)

Many-core parallel computing – Can compilers and tools do the heavy lifting?

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)