14613

From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Networks

John-Alexander M. Assael
Imperial College London, Department of Computing
Imperial College London, 2015

@article{assael2015pixels,

   title={From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Networks},

   author={Assael, John-Alexander M},

   year={2015}

}

Data-efficient learning in continuous state-action spaces using high-dimensional observations remains an elusive challenge in developing fully autonomous systems. An instance of this challenge is the pixels to torques problem, which identifies key elements of an autonomous agent: autonomous thinking and decision making using sensor measurements only, learning from mistakes, and applying past experiences to novel situations. In this research, we introduce a deep dynamical convolutional model, able to learn complex non-linear dynamics and do long-term predictions. Compared to state-of-the-art reinforcement learning methods for continuous state and action space problems, our approach is solid and efficient as it is model-based, is scalable to high-dimensional state spaces, learns quickly, and is a major step towards fully autonomous learning from pixels to torques.
VN:F [1.9.22_1171]
Rating: 5.0/5 (1 vote cast)
From Pixels to Torques: Policy Learning using Deep Dynamical Convolutional Networks, 5.0 out of 5 based on 1 rating

* * *

* * *

TwitterAPIExchange Object
(
    [oauth_access_token:TwitterAPIExchange:private] => 301967669-yDz6MrfyJFFsH1DVvrw5Xb9phx2d0DSOFuLehBGh
    [oauth_access_token_secret:TwitterAPIExchange:private] => o29ji3VLVmB6jASMqY8G7QZDCrdFmoTvCDNNUlb7s
    [consumer_key:TwitterAPIExchange:private] => TdQb63pho0ak9VevwMWpEgXAE
    [consumer_secret:TwitterAPIExchange:private] => Uq4rWz7nUnH1y6ab6uQ9xMk0KLcDrmckneEMdlq6G5E0jlQCFx
    [postfields:TwitterAPIExchange:private] => 
    [getfield:TwitterAPIExchange:private] => ?cursor=-1&screen_name=hgpu&skip_status=true&include_user_entities=false
    [oauth:protected] => Array
        (
            [oauth_consumer_key] => TdQb63pho0ak9VevwMWpEgXAE
            [oauth_nonce] => 1485065092
            [oauth_signature_method] => HMAC-SHA1
            [oauth_token] => 301967669-yDz6MrfyJFFsH1DVvrw5Xb9phx2d0DSOFuLehBGh
            [oauth_timestamp] => 1485065092
            [oauth_version] => 1.0
            [cursor] => -1
            [screen_name] => hgpu
            [skip_status] => true
            [include_user_entities] => false
            [oauth_signature] => tZAIzFGV6pfW4p7JvWAk8iqNPek=
        )

    [url] => https://api.twitter.com/1.1/users/show.json
)
Follow us on Facebook
Follow us on Twitter

HGPU group

2137 peoples are following HGPU @twitter

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: