high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Marian: Cost-effective High-Quality Neural Machine Translation in C++

Marian: Cost-effective High-Quality Neural Machine Translation in C++

Marcin Junczys-Dowmunt, Kenneth Heafield, Hieu Hoang, Roman Grundkiewicz, Anthony Aue

Microsoft Translator, 1 Microsoft Way, Redmond, WA 98121, USA

arXiv:1805.12096 [cs.CL], (30 May 2018)

BibTeX

Download (PDF)

View

Source

Source codes

Package:

Marian: Cost-effective High-Quality Neural Machine Translation in C++

1708

views

This paper describes the submissions of the "Marian" team to the WNMT 2018 shared task. We investigate combinations of teacher-student training, low-precision matrix products, auto-tuning and other methods to optimize the Transformer model on GPU and CPU. By further integrating these methods with the new averaging attention networks, a recently introduced faster Transformer variant, we create a number of high-quality, high-performance models on the GPU and CPU, dominating the Pareto frontier for this shared task.

Tags: Computer science, CUDA, Machine learning, NLP, nVidia, Package

June 2, 2018 by hgpu

Rating: 3.7/5. From 3 votes.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

Marian: Cost-effective High-Quality Neural Machine Translation in C++

Package:

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Marian: Cost-effective High-Quality Neural Machine Translation in C++

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)