high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Computer vision » Accelerating Random Forests on CPUs and GPUs for Object-Class Image Segmentation

Accelerating Random Forests on CPUs and GPUs for Object-Class Image Segmentation

Benedikt Waldvogel

Rheinische Friedrich-Wilhelms Universitat Bonn

Bonn University, 2013

BibTeX

Download (PDF)

View

Source

Source codes

Package:

CUDA Random Forest implementation for Image Labeling tasks

3290

views

Random forests are a machine learning method that has recently become popular in the computer vision community to solve image segmentation and object detection tasks. Existing random forest implementations are either general purpose and not efficiently applicable for image segmentation or focus only on the speed of prediction. The implementation for the Microsoft Kinect gaming platform, for instance, achieves real-time speed on a single Microsoft Xbox GPU to recognize the pose of the user. Random forest training, however, has been conducted on a large cluster with 1000 CPU cores. Generally, training on large datasets is computationally demanding and impedes scientific research since the process takes long if a computing cluster is not available or too expensive for the task at hand. It is the goal of this master’s thesis to accelerate training and prediction of random forests for object-class image segmentation on RGB-D datasets by efficiently using CPUs and the massively parallel computing power offered by GPUs. We present an implementation that runs up to 28 times faster on GPU and is capable to train a random forest in less than four minutes on a GPU; thus drastically abbreviating a process that previously took about one whole day on a CPU. Dense classification of RGB-D images in VGA resolution runs in real-time speed on a single mobile GPU.

Tags: Computer science, Computer vision, CUDA, Machine learning, nVidia, nVidia GeForce GTX 480, nVidia GeForce GTX 690, nVidia GeForce GTX Titan, Package, Tesla K20, Thesis

August 16, 2013 by hgpu

Rating: 2.5/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Accelerating Random Forests on CPUs and GPUs for Object-Class Image Segmentation

Package:

Your response

Recent source codes

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Most viewed papers (last 30 days)

Accelerating Random Forests on CPUs and GPUs for Object-Class Image Segmentation

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)