high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » BAT: A Benchmark suite for AutoTuners

BAT: A Benchmark suite for AutoTuners

Ingunn Sund, Knut A. Kirkhorn, Jacob O. Tørring, Anne C. Elster

Norwegian University of Science and Technology (NTNU), Trondheim, Norway

NIK Norsk informatikkonferanse , 2021

BibTeX

Download (PDF)

View

Source

Source codes

Package:

BAT: A standardized benchmark suite for auto-tuners

1346

views

An autotuner takes a parameterized code as input and tries to optimize the code by finding the best possible values for a given architecture. To our knowledge, there are currently no standardized benchmark suites for comparing and testing autotuners. Developers of autotuners thus make their own when presenting and comparing autotuners. We thus present BAT, a Benchmark suite for AutoTuners with HPCbased parameterized GPU programs. CUDA programs and kernels from "The Scalable Heterogeneous Computing (SHOC) Benchmark" are parameterized. BAT contains a varied selection of benchmarks of different complexity that can utilize multiple GPUs on one system, either by running the same program and computations on multiple nodes, or by splitting the work between nodes. BAT contains 9 different HPC benchmarks that provide a large search space of autotuning parameters, and are modified to suite many different autotuners. BAT also includes a CLI that facilitates autotuning with the benchmarks. Our benchmark suite is tested with four different autotuners, OpenTuner, Kernel Tuner, CLTune and KTT. They differ in setup and how they tune. The impact of the different benchmark parameters on the running time across architectures is analyzed. Test systems used include a DGX-2, IBM Power System AC922 with Tesla V100-SXM2 32 GB GPUs, an RTX Titan, a GeForce GTX 980 and a server with 20 Tesla T4 GPUs.

Tags: Auto-Tuning, Benchmarking, Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 980, Package, Performance, Tesla T4, Tesla V100

November 21, 2021 by hgpu

No votes yet.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

BAT: A Benchmark suite for AutoTuners

Package:

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

BAT: A Benchmark suite for AutoTuners

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)