high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Automatic code generation and tuning for stencil kernels on modern shared memory architectures

Automatic code generation and tuning for stencil kernels on modern shared memory architectures

Matthias Christen, Olaf Schenk, Helmar Burkhart

Department of Mathematics and Computer Science, University of Basel, Klingelbergstrasse 50, 4056 Basel, Switzerland

Computer Science – Research and Development, Volume 26, Numbers 3-4, pp. 205-210, 2011

DOI:10.1007/s00450-011-0160-6

BibTeX

Download (PDF)

View

Source

Source codes

Package:

PATUS

2081

views

In this paper, we present Patus, a code generation and auto-tuning framework for stencil computations targeted at multi- and manycore processors, such as multicore CPUs and graphics processing units. Patus, which stands for "Parallel Autotuned Stencils," generates a compute kernel from a specification of the stencil operation and a strategy which describes the parallelization and optimization to be applied, and leverages the autotuning methodology to optimize strategy-specific parameters for the given hardware architecture.

Tags: Code generation, Computer science, CUDA, nVidia, Package, Tesla C2050

December 18, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Automatic code generation and tuning for stencil kernels on modern shared memory architectures

Package:

Your response

Recent source codes

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Most viewed papers (last 30 days)

Automatic code generation and tuning for stencil kernels on modern shared memory architectures

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)