high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Improving the Neural GPU Architecture for Algorithm Learning

Improving the Neural GPU Architecture for Algorithm Learning

Karlis Freivalds, Renars Liepins

Institute of Mathematics and Computer Science University of Latvia, Raina bulvaris 29, Riga, LV-1459, Latvia

arXiv:1702.08727 [cs.NE], (28 Feb 2017)

@article{freivalds2017improving,

title={Improving the Neural GPU Architecture for Algorithm Learning},

author={Freivalds, Karlis and Liepins, Renars},

year={2017},

month={feb},

archivePrefix={"arXiv"},

primaryClass={cs.NE}

}

Download (PDF)

View

Source

Source codes

Package:

Improving the Neural GPU Architecture for Algorithm Learning

2584

views

Algorithm learning is a core problem in artificial intelligence with significant implications on automation level that can be achieved by machines. Recently deep learning methods are emerging for synthesizing an algorithm from its input-output examples, the most successful being the Neural GPU, capable of learning multiplication. We present several improvements to the Neural GPU that substantially reduces training time and improves generalization. We introduce a technique of general applicability to use hard nonlinearities with saturation cost. We also introduce a technique of diagonal gates that can be applied to active-memory models. The proposed architecture is the first capable of learning decimal multiplication end-to-end.

Tags: Algorithms, Artificial intelligence, Computer science, Deep learning, Memory model, nVidia, Package, Python, TensorFlow, Tesla K40

March 5, 2017 by hgpu

Rating: 2.5/5. From 1 vote.

Please wait...

Your response

You must be logged in to post a comment.

high performance computing on graphics processing units: hgpu.org

Improving the Neural GPU Architecture for Algorithm Learning

Package:

Your response

Recent source codes

tritonBLAS: A Lightweight Triton-based General Matrix Multiplication (GEMM) Library

hls4ml: Machine learning on FPGAs using HLS

ThunderKittens: Tile primitives for speedy kernels

NVIDIA Nemotron Parse 1.1

Iris: AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming

HipKittens: Fast and Furious AMD Kernels

Fortran xDSL dialects

mt4g: Memory Topology 4 GPUs

Falcon: GPU-Based Floating-point Adaptive Lossless Compression

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

Most viewed papers (last 30 days)

Improving the Neural GPU Architecture for Algorithm Learning

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)