high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Auto-tuning on the macro scale: high level algorithmic auto-tuning for scientific applications

Auto-tuning on the macro scale: high level algorithmic auto-tuning for scientific applications

Cy P Chan

Dept. of Electrical Engineering and Computer Science, Massachusetts Institute of Technology

Massachusetts Institute of Technology, 2012

@phdthesis{chan2012auto,

title={Auto-tuning on the macro scale: high level algorithmic auto-tuning for scientific applications},

author={Chan, C.P.},

year={2012},

school={Massachusetts Institute of Technology}

}

Download (PDF)

View

Source

1999

views

In this thesis, we describe a new classification of auto-tuning methodologies spanning from low-level optimizations to high-level algorithmic tuning. This classification spectrum of auto-tuning methods encompasses the space of tuning parameters from low-level optimizations (such as block sizes, iteration ordering, vectorization, etc.) to high-level algorithmic choices (such as whether to use an iterative solver or a direct solver). We present and analyze four novel auto-tuning systems that incorporate several techniques that fall along a spectrum from the low-level to the high-level: i) a multiplatform, auto-tuning parallel code generation framework for generalized stencil loops, ii) an auto-tunable algorithm for solving dense triangular systems, iii) an auto-tunable multigrid solver for sparse linear systems, and iv) tuned statistical regression techniques for fine-tuning wind forecasts and resource estimations to assist in the integration of wind resources into the electrical grid. We also include a project assessment report for a wind turbine installation for the City of Cambridge to highlight an area of application (wind prediction and resource assessment) where these computational auto-tuning techniques could prove useful in the future.

Tags: Algorithms, Code generation, Computer science, nVidia, nVidia GeForce GTX 280, Optimization, Thesis

November 23, 2012 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

Auto-tuning on the macro scale: high level algorithmic auto-tuning for scientific applications

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

Auto-tuning on the macro scale: high level algorithmic auto-tuning for scientific applications

Share this:

Recent source codes

Most viewed papers (last 30 days)