high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Automated Architecture Design for Deep Neural Networks

Automated Architecture Design for Deep Neural Networks

Steven Abreu

Jacobs University Bremen

arXiv:1908.10714 [cs.LG], (22 Aug 2019)

BibTeX

Download (PDF)

View

Source

1736

views

Machine learning has made tremendous progress in recent years and received large amounts of public attention. Though we are still far from designing a full artificially intelligent agent, machine learning has brought us many applications in which computers solve human learning tasks remarkably well. Much of this progress comes from a recent trend within machine learning, called deep learning. Deep learning models are responsible for many state-of-the-art applications of machine learning. Despite their success, deep learning models are hard to train, very difficult to understand, and often times so complex that training is only possible on very large GPU clusters. Lots of work has been done on enabling neural networks to learn efficiently. However, the design and architecture of such neural networks is often done manually through trial and error and expert knowledge. This thesis inspects different approaches, existing and novel, to automate the design of deep feedforward neural networks in an attempt to create less complex models with good performance that take away the burden of deciding on an architecture and make it more efficient to design and train such deep networks.

Tags: Computer science, Deep learning, GPU cluster, Machine learning, Neural networks, nVidia, Tesla K80, Thesis

September 1, 2019 by hgpu

No votes yet.

Please wait...

high performance computing on graphics processing units: hgpu.org

Automated Architecture Design for Deep Neural Networks

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Automated Architecture Design for Deep Neural Networks

Share this:

Recent source codes

Most viewed papers (last 30 days)