high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Networks

From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Networks

John-Alexander M. Assael

Imperial College London, Department of Computing

Imperial College London, 2015

BibTeX

Download (PDF)

View

Source

Source codes

Package:

From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Neural Networks (DDCNN)

2751

views

Data-efficient learning in continuous state-action spaces using high-dimensional observations remains an elusive challenge in developing fully autonomous systems. An instance of this challenge is the pixels to torques problem, which identifies key elements of an autonomous agent: autonomous thinking and decision making using sensor measurements only, learning from mistakes, and applying past experiences to novel situations. In this research, we introduce a deep dynamical convolutional model, able to learn complex non-linear dynamics and do long-term predictions. Compared to state-of-the-art reinforcement learning methods for continuous state and action space problems, our approach is solid and efficient as it is model-based, is scalable to high-dimensional state spaces, learns quickly, and is a major step towards fully autonomous learning from pixels to torques.

Tags: CNN, Computer science, CUDA, Deep learning, Lua, Machine learning, nVidia, Package, Tesla K40, Thesis

September 26, 2015 by hgpu

Rating: 2.5/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Networks

Package:

Your response

Recent source codes

VibeCodeHPC - Multi Agentic Vibe Coding for HPC

Compile-Time Resource Safety for GPU APIs: A Low-Overhead Typestate Framework

exa-AMD: Exascale Accelerated Materials Discovery

TRUST: a thermalhydraulic software package for CFD simulations

Modular: The Modular Platform (includes MAX & Mojo)

Allo: Accelerator Design Language

Towards Robust Agentic CUDA Kernel Benchmarking, Verification, and Optimization

HPC Benchmark Survey

HDM: Home made Diffusion Models

General Matrix Multiplication (GEMM)

Most viewed papers (last 30 days)

From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Networks

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)