high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Auto-optimization of a Feature Selection Algorithm

Auto-optimization of a Feature Selection Algorithm

Didem Unat, Han Suk Kim, Jurgen P. Schulze, Scott B. Baden

Department of Computer Science and Engineering, University of California San Diego, La Jolla, California 92093, USA

4th Workshop on Emerging Applications and Many-core Architecture, 2011

BibTeX

Download (PDF)

View

Source

1741

views

Advanced visualization algorithms are typically computationally expensive but highly data parallel which make them attractive candidates for GPU architectures. However, porting algorithms on a GPU still remains a challenging process. The Mint programming model addresses this issue with its simple and high level interface. It targets the users who seek real-time performance without investing in significant programming effort. In this work, we present automatic CUDA parallelization and optimizations of the Harris interest point detection algorithm with Mint. Mint generates highly optimized CUDA C from annotated C source and performs several optimizations. For 4 well-known datasets in volume rendering, on Tesla C1060 the Mint-generated kernels run under a second and deliver on average 10 times the performance of OpenMP running with 4 threads on a Nehalem processor.

Tags: Algorithms, Computer science, Computer vision, CUDA, nVidia, Optimization, Rendering, Tesla C1060, Visualization

December 2, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations

microSYCL: SYCL micro-benchmarks repository

Exploring SYCL as a Portability Layer for High-Performance Computing on CPUs

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Auto-optimization of a Feature Selection Algorithm

Your response

Recent source codes

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

exa-AMD: Exascale Accelerated Materials Discovery

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

Most viewed papers (last 30 days)

Auto-optimization of a Feature Selection Algorithm

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)