8983

A Fast and Efficient SIFT Detector Using the Mobile GPU

Blaine Rister, Guohui Wang, Michael Wu, Joseph R. Cavallaro
Department of Electrical and Computer Engineering, Rice University, Houston, Texas
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013
@article{rister2013fast,

   title={A FAST AND EFFICIENT SIFT DETECTOR USING THE MOBILE GPU},

   author={Rister, Blaine and Wang, Guohui and Wu, Michael and Cavallaro, Joseph R},

   year={2013}

}

Download Download (PDF)   View View   Source Source   

698

views

Emerging mobile applications, such as augmented reality, demand robust feature detection at high frame rates. We present an implementation of the popular Scale-Invariant Feature Transform (SIFT) feature detection algorithm that incorporates the powerful graphics processing unit (GPU) in mobile devices. Where the usual GPU methods are inefficient on mobile hardware, we propose a heterogeneous dataflow scheme. By methodically partitioning the computation, compressing the data for memory transfers, and taking into account the unique challenges that arise out of the mobile GPU, we are able to achieve a speedup of 4-8x over an optimized CPU version, and a 6.4x speedup over a published GPU implementation. Additionally, we reduce energy consumption by 87 percent per image. We achieve near-realtime detection without compromising the original algorithm.
VN:F [1.9.22_1171]
Rating: 5.0/5 (1 vote cast)
A Fast and Efficient SIFT Detector Using the Mobile GPU, 5.0 out of 5 based on 1 rating

* * *

* * *

Like us on Facebook

HGPU group

136 people like HGPU on Facebook

Follow us on Twitter

HGPU group

1208 peoples are following HGPU @twitter

Featured events

* * *

Free GPU computing nodes at hgpu.org

Registered users can now run their OpenCL application at hgpu.org. We provide 1 minute of computer time per each run on two nodes with two AMD and one nVidia graphics processing units, correspondingly. There are no restrictions on the number of starts.

The platforms are

Node 1
  • GPU device 0: AMD/ATI Radeon HD 5870 2GB, 850MHz
  • GPU device 1: AMD/ATI Radeon HD 6970 2GB, 880MHz
  • CPU: AMD Phenom II X6 @ 2.8GHz 1055T
  • RAM: 12GB
  • OS: OpenSUSE 13.1
  • SDK: AMD APP SDK 2.9
Node 2
  • GPU device 0: AMD/ATI Radeon HD 7970 3GB, 1000MHz
  • GPU device 1: nVidia GeForce GTX 560 Ti 2GB, 822MHz
  • CPU: Intel Core i7-2600 @ 3.4GHz
  • RAM: 16GB
  • OS: OpenSUSE 12.2
  • SDK: nVidia CUDA Toolkit 6.0.1, AMD APP SDK 2.9

Completed OpenCL project should be uploaded via User dashboard (see instructions and example there), compilation and execution terminal output logs will be provided to the user.

The information send to hgpu.org will be treated according to our Privacy Policy

HGPU group © 2010-2014 hgpu.org

All rights belong to the respective authors

Contact us: