high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » GA3C: GPU-based A3C for Deep Reinforcement Learning

GA3C: GPU-based A3C for Deep Reinforcement Learning

Mohammad Babaeizadeh, Iuri Frosio, Stephen Tyree, Jason Clemons, Jan Kautz

NVIDIA

arXiv:1611.06256 [cs.LG], (18 Nov 2016)

@article{babaeizadeh2016gpubased,

title={GA3C: GPU-based A3C for Deep Reinforcement Learning},

author={Babaeizadeh, Mohammad and Frosio, Iuri and Tyree, Stephen and Clemons, Jason and Kautz, Jan},

year={2016},

month={nov},

archivePrefix={"arXiv"},

primaryClass={cs.LG}

}

Download (PDF)

View

Source

2018

views

We introduce and analyze the computational aspects of a hybrid CPU/GPU implementation of the Asynchronous Advantage Actor-Critic (A3C) algorithm, currently the state-of-the-art method in reinforcement learning for various gaming tasks. Our analysis concentrates on the critical aspects to leverage the GPU’s computational power, including the introduction of a system of queues and a dynamic scheduling strategy, potentially helpful for other asynchronous algorithms as well. We also show the potential for the use of larger DNN models on a GPU. Our TensorFlow implementation achieves a significant speed up compared to our CPU-only implementation, and it will be made publicly available to other researchers.

Tags: Computer science, CUDA, Deep learning, Neural networks, nVidia, nVidia GeForce GTX Titan X, TensorFlow

November 23, 2016 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

GA3C: GPU-based A3C for Deep Reinforcement Learning

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

GA3C: GPU-based A3C for Deep Reinforcement Learning

Share this:

Recent source codes

Most viewed papers (last 30 days)