A New Parallel Implementation of DSI Based Disparity Computation Using CUDA

A. Mehmood, Y. Soh, I. Kim
The Myongji University, Yongin, 449-728, Korea
International Journal of Computer and Communication Engineering 2014 3 51

   title={A New Parallel Implementation of DSI Based Disparity Computation Using CUDA},

   author={Mehmood, Aamer and Soh, Youngsung and Kim, Intaek}


Download Download (PDF)   View View   Source Source   



Stereo matching techniques are used to extract 3D information from 2D stereo pair of images. It can be classified into feature based approach, window (area) based approach, and optimization based approach. Feature based approach generally generates sparse disparity map with high accuracy and low execution time. Window based approach produces dense disparity map with low accuracy and low execution time. Optimization based approach generates dense disparity map with high accuracy and high execution time. Since the ultimate goal of stereo matching is to obtain dense disparity map with high accuracy and low execution time, we choose to select optimization based approach and implement it in parallel framework to overcome execution speed deficiency. There are several optimization methods including dynamic programming, energy minimization, and graph algorithms. We choose to use dynamic programming based on disparity space image (DSI) since it is most appropriate for parallel framework. In this paper, we propose a new parallel algorithm and framework for DSI construction, dynamic programming (DP), and disparity computation using Compute Unified Device Architecture (CUDA). We tested the method on several stereo pairs and found that the method shows remarkable speedup while preserving the quality at a reasonable level.
VN:F [1.9.22_1171]
Rating: 0.0/5 (0 votes cast)

* * *

* * *

Follow us on Twitter

HGPU group

1666 peoples are following HGPU @twitter

Like us on Facebook

HGPU group

339 people like HGPU on Facebook

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: nVidia CUDA Toolkit 6.5.14, AMD APP SDK 3.0
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.3
  • SDK: AMD APP SDK 3.0

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2015 hgpu.org

All rights belong to the respective authors

Contact us: