Parallelize L-BFGS-B on the GPU

Yun Fei, Wenping Wang, Bin Wang
The University of Hong Kong
Tech. Report, The University of Hong Kong, 2012


   title={Parallelize L-BFGS-B on the GPU},

   author={Fei, Y. and Wang, W. and Wang, B.},



Nonlinear optimization is at the heart of many algorithms in engineering. Recently, due to the rise of general purpose graphics processing unit (GPGPU), it is promising to investigate the performance improvement of optimization methods after parallelized. While much has been done for simple optimization methods such as conjugate gradient, due to the strong dependencies contained, little has been done for other more sophisticated ones, such as the limited memory Broyden-FletcherGoldfarb-Shanno with boundaries (L-BFGS-B). In this paper, for the first time, a parallelized implementation of L-BFGS-B on the GPU is introduced. We show how to remove those dependencies, and also demonstrate its significant speed-up for practical applications, in particular, for solving some general scientific problems and the centroidal Voronoi tessellation (CVT) problem.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: