high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Accelerating Kernel Density Estimation on the GPU Using the CUDA Framework

Accelerating Kernel Density Estimation on the GPU Using the CUDA Framework

Panagiotis D. Michailidis, Konstantinos G. Margaritis

Department of Balkan Studies, University of Western Macedonia, 3rd Km Florinas-Nikis National Road, Florina, 53100, Greece

Applied Mathematical Sciences, Vol. 7, no. 30, 1447 – 1476, 2013

@article{michailidis2013accelerating,

title={Accelerating Kernel Density Estimation on the GPU Using the CUDA Framework},

author={Michailidis, Panagiotis D and Margaritis, Konstantinos G},

journal={Applied Mathematical Sciences},

volume={7},

number={30},

pages={1447–1476},

year={2013}

}

Download (PDF)

View

Source

2386

views

The main problem of the kernel density estimation methods is the huge computational requirements, especially for large data sets. One way for accelerating these methods is to use the parallel processing. Recent advances in parallel processing have focused on the use Graphics Processing Units (GPUs) using Compute Unified Device Architecture (CUDA) programming model. In this work we discuss a naive and two optimised CUDA algorithms for the two kernel estimation methods: univariate and multivariate. These optimised algorithms are based on the use of shared memory tiles and loop unrolling techniques. We also present exploratory experimental results of the proposed CUDA algorithms according to the several values of parameters such as number of threads per block, tile size, loop unroll level, number of variables and data (sample) size. The experimental results show significant performance gains of all proposed CUDA algorithms over serial CPU version and small performance speed-ups of the two optimised CUDA algorithms over naive GPU algorithms. Finally, based on extended performance results are obtained general conclusions of all proposed CUDA algorithms for some parameters.

Tags: Algorithms, Computer science, CUDA, nVidia, nVidia GeForce GTX 280, Programming techniques

March 2, 2013 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Accelerating Kernel Density Estimation on the GPU Using the CUDA Framework

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Accelerating Kernel Density Estimation on the GPU Using the CUDA Framework

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)