high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Biology » FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Shenggan Cheng, Ruidong Wu, Zhongming Yu, Binrui Li, Xiwen Zhang, Jian Peng, Yang You

National University of Singapore

arXiv:2203.00854 [cs.LG], (2 Mar 2022)

DOI:10.48550/arXiv.2203.00854

BibTeX

Download (PDF)

View

Source

Source codes

Package:

FastFold: Optimizing Protein Structure Prediction Model Training and Inference on GPU Clusters

1381

views

Protein structure prediction is an important method for understanding gene translation and protein function in the domain of structural biology. AlphaFold introduced the Transformer model to the field of protein structure prediction with atomic accuracy. However, training and inference of the AlphaFold model are time-consuming and expensive because of the special performance characteristics and huge memory consumption. In this paper, we propose FastFold, a highly efficient implementation of protein structure prediction model for training and inference. FastFold includes a series of GPU optimizations based on a thorough analysis of AlphaFold’s performance. Meanwhile, with Dynamic Axial Parallelism and Duality Async Operation, FastFold achieves high model parallelism scaling efficiency, surpassing existing popular model parallelism techniques. Experimental results show that FastFold reduces overall training time from 11 days to 67 hours and achieves 7.5-9.5x speedup for long-sequence inference. Furthermore, We scaled FastFold to 512 GPUs and achieved an aggregate of 6.02 PetaFLOPs with 90.1% parallel efficiency. The implementation is available.

Tags: Biology, Computer science, CUDA, Machine learning, nVidia, nVidia A100, Package

March 6, 2022 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Package:

Your response

Recent source codes

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Most viewed papers (last 30 days)

FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)