high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Deep Neural Machine Translation with Weakly-Recurrent Units

Deep Neural Machine Translation with Weakly-Recurrent Units

Mattia Antonino Di Gangi, Marcello Federico

University of Trento, Italy

arXiv:1805.04185 [cs.CL], (10 May 2018)

@article{gangi2018deep,

title={Deep Neural Machine Translation with Weakly-Recurrent Units},

author={Gangi, Mattia Antonino Di and Federico, Marcello},

year={2018},

month={may},

archivePrefix={"arXiv"},

primaryClass={cs.CL}

}

Download (PDF)

View

Source

Source codes

Package:

SR-NMT: Implementation in pytorch of SR-NMT

2467

views

Recurrent neural networks (RNNs) have represented for years the state of the art in neural machine translation. Recently, new architectures have been proposed, which can leverage parallel computation on GPUs better than classical RNNs. Faster training and inference combined with different sequence-to-sequence modeling also lead to performance improvements. While the new models completely depart from the original recurrent architecture, we decided to investigate how to make RNNs more efficient. In this work, we propose a new recurrent NMT architecture, called Simple Recurrent NMT, built on a class of fast and weakly-recurrent units that use layer normalization and multiple attentions. Our experiments on the WMT14 English-to-German and WMT16 English-Romanian benchmarks show that our model represents a valid alternative to LSTMs, as it can achieve better results at a significantly lower computational cost.

Tags: Computer science, CUDA, NLP, nVidia, nVidia GeForce GTX 1080, Package, RNN, Tesla K80, Torch

May 20, 2018 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Deep Neural Machine Translation with Weakly-Recurrent Units

Package:

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Deep Neural Machine Translation with Weakly-Recurrent Units

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)