Optimization of Pattern Matching Algorithms for Multi- and Many-Core Platforms

hgpu.org » Programming » Algorithms » Optimization of Pattern Matching Algorithms for Multi- and Many-Core Platforms

Optimization of Pattern Matching Algorithms for Multi- and Many-Core Platforms

Pedro Miguel Marques Pereira

Polytechnic Institute of Leiria

Polytechnic Institute of Leiria, 2016

@phdthesis{pereira2016optimization,

title={Optimization of Pattern Matching Algorithms for Multi-and Many-Core Platforms},

author={Pereira, Pedro Miguel Marques},

year={2016}

}

Download (PDF)

View

Source

2450

views

Image and video compression play a major role in the world today, allowing the storage and transmission of large multimedia content volumes. However, the processing of this information requires high computational resources, hence the improvement of the computational performance of these compression algorithms is very important. The Multidimensional Multiscale Parser (MMP) is a pattern-matching-based compression algorithm for multimedia contents, namely images, achieving high compression ratios, maintaining good image quality, Rodrigues et al. [2008]. However, in comparison with other existing algorithms, this algorithm takes some time to execute. Therefore, two parallel implementations for GPUs were proposed by Ribeiro [2016] and Silva [2015] in CUDA and OpenCL-GPU, respectively. In this dissertation, to complement the referred work, we propose two parallel versions that run the MMP algorithm in CPU: one resorting to OpenMP and another that converts the existing OpenCL-GPU into OpenCL-CPU. The proposed solutions are able to improve the computational performance of MMP by 3x and 2.7x, respectively. The High Efficiency Video Coding (HEVC/H.265) is the most recent standard for compression of image and video. Its impressive compression performance, makes it a target for many adaptations, particularly for holoscopic image/video processing (or light field). Some of the proposed modifications to encode this new multimedia content are based on geometry-based disparity compensations (SS), developed by Conti et al. [2014], and a Geometric Transformations (GT) module, proposed by Monteiro et al. [2015]. These compression algorithms for holoscopic images based on HEVC present an implementation of specific search for similar micro-images that is more efficient than the one performed by HEVC, but its implementation is considerably slower than HEVC. In order to enable better execution times, we choose to use the OpenCL API as the GPU enabling language in order to increase the module performance. With its most costly setting, we are able to reduce the GT module execution time from 6.9 days to less then 4 hours, effectively attaining a speedup of 45x.

Tags: Algorithms, AMD Radeon R9 290X, ARM, ATI, ATI Radeon HD 7970, Compression, Computer science, nVidia, nVidia GeForce GTX 480, nVidia GeForce GTX 570 M, nVidia GeForce GTX 680, nVidia GeForce GTX 750 Ti, nVidia GeForce GTX Titan Black, nVidia Jetson TK1, OpenCL, OpenMP, Thesis

November 30, 2016 by hgpu

Rating: 1.5/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org