high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Accelerating Topic Model Training on a Single Machine

Accelerating Topic Model Training on a Single Machine

Mian Lu, Ge Bai, Qiong Luo, Jie Tang, Jiuxin Zhao

A*STAR Institute of High Performance Computing, Singapore

Fifteenth International Asia-Pacific Web Conference (APWeb’13), 2013

@article{lu2013accelerating,

title={Accelerating Topic Model Training on a Single Machine},

author={Lu, M. and Bai, G. and Luo, Q. and Tang, J. and Zhao, J.},

year={2013}

}

Download (PDF)

View

Source

1694

views

We present the design and implementation of GLDA, a library that utilizes the GPU (Graphics Processing Unit) to perform Gibbs sampling of Latent Dirichlet Allocation (LDA) on a single machine. LDA is an effective topic model used in many applications, e.g., classification, feature selection, and information retrieval. However, training an LDA model on large data sets takes hours, even days, due to the heavy computation and intensive memory access. Therefore, we explore the use of the GPU to accelerate LDA training on a single machine. Specifically, we propose three memory-efficient techniques to handle large data sets on the GPU: (1) generating document-topic counts as needed instead of storing all of them, (2) adopting a compact storage scheme for sparse matrices, and (3) partitioning word tokens. Through these techniques, the LDA training which would take 10 GB memory originally, can be performed on a commodity GPU card with only 1 GB GPU memory. Furthermore, our GLDA achieves a speedup of 15X over the original CPU-based LDA for large data sets.

Tags: Computer science, CUDA, Information Retrieval, Latent Dirichlet allocation, nVidia, nVidia GeForce GTX 280

January 12, 2013 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

* * *

high performance computing on graphics processing units: hgpu.org

Accelerating Topic Model Training on a Single Machine

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)

Accelerating Topic Model Training on a Single Machine

Share this:

Recent source codes

Most viewed papers (last 30 days)