high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Kamal Raj Kanakarajan, Bhuvana Kundumani, Malaikannan Sankarasubbu

SAAMA AI Research Lab, Chennai, India

arXiv:2109.10847 [cs.LG], (23 Sep 2021)

@misc{kanakarajan2021smallbench,

title={Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing},

author={Kamal Raj Kanakarajan and Bhuvana Kundumani and Malaikannan Sankarasubbu},

year={2021},

eprint={2109.10847},

archivePrefix={arXiv},

primaryClass={cs.LG}

}

Download (PDF)

View

Source

1178

views

Recent progress in the Natural Language Processing domain has given us several State-of-the-Art (SOTA) pretrained models which can be finetuned for specific tasks. These large models with billions of parameters trained on numerous GPUs/TPUs over weeks are leading in the benchmark leaderboards. In this paper, we discuss the need for a benchmark for cost and time effective smaller models trained on a single GPU. This will enable researchers with resource constraints experiment with novel and innovative ideas on tokenization, pretraining tasks, architecture, fine tuning methods etc. We set up Small-Bench NLP, a benchmark for small efficient neural language models trained on a single GPU. Small-Bench NLP benchmark comprises of eight NLP tasks on the publicly available GLUE datasets and a leaderboard to track the progress of the community. Our ELECTRA-DeBERTa (15M parameters) small model architecture achieves an average score of 81.53 which is comparable to that of BERT-Base’s 82.20 (110M parameters). Our models, code and leaderboard are available.

Tags: Benchmarking, Computer science, NLP

September 26, 2021 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Your response

Recent source codes

Kernel Library for LLM Serving

Adaptivity in AdaptiveCpp: Optimizing Performance by Leveraging Runtime Information During JIT-Compilation

Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs

Genten: Software for Generalized Tensor Decompositions by Sandia National Laboratories

Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR

Pinocchio: PINpointing Orbit Crossing Collapsed Hierarchical Objects

KernelCoder: trained on a curated dataset of reasoning traces and CUDA kernel pairs

VibeCodeHPC - Multi Agentic Vibe Coding for HPC

Compile-Time Resource Safety for GPU APIs: A Low-Overhead Typestate Framework

exa-AMD: Exascale Accelerated Materials Discovery

Most viewed papers (last 30 days)

Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)