high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » TopicBERT for Energy Efficient Document Classification

TopicBERT for Energy Efficient Document Classification

Yatin Chaudhary, Pankaj Gupta, Khushbu Saxena, Vivek Kulkarni, Thomas Runkler, Hinrich Schütze

DRIMCo GmbH, Munich, Germany

arXiv:2010.16407 [cs.CL], (15 Oct 2020)

BibTeX

Download (PDF)

View

Source

1439

views

Prior research notes that BERT’s computational cost grows quadratically with sequence length thus leading to longer training times, higher GPU memory constraints and carbon emissions. While recent work seeks to address these scalability issues at pre-training, these issues are also prominent in fine-tuning especially for long sequence tasks like document classification. Our work thus focuses on optimizing the computational cost of fine-tuning for document classification. We achieve this by complementary learning of both topic and language models in a unified framework, named TopicBERT. This significantly reduces the number of self-attention operations – a main performance bottleneck. Consequently, our model achieves a 1.4x (~40%) speedup with ~40% reduction in CO2 emission while retaining 99.9% performance over 5 datasets.

Tags: classification, Computer science, CUDA, Deep learning, NLP, nVidia, Tesla T4

November 8, 2020 by hgpu

No votes yet.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

TopicBERT for Energy Efficient Document Classification

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

TopicBERT for Energy Efficient Document Classification

Share this:

Recent source codes

Most viewed papers (last 30 days)