19196

Framework for Parallel Kernels Auto-tuning

Filip Petrovič
Masaryk University, Faculty of Informatics
Masaryk University, 2018
BibTeX

Download Download (PDF)   View View   Source Source   Source codes Source codes

2444

views

The result of this thesis is a framework for auto-tuning of parallel kernels which are written in either OpenCL or CUDA language. The framework includes advanced functionality such as support for composite kernels and online auto-tuning. The thesis describes API and internal structure of the framework and presents several examples of its utilization for kernel optimization.
Rating: 3.5/5. From 2 votes.
Please wait...

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org