high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

Matthieu Courbariaux, Yoshua Bengio

Universite de Montreal

arXiv:1602.02830 [cs.LG], (9 Feb 2016)

@article{courbariaux2016binarynet,

title={BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1},

author={Courbariaux, Matthieu and Bengio, Yoshua},

year={2016},

month={feb},

archivePrefix={"arXiv"},

primaryClass={cs.LG}

}

Download (PDF)

View

Source

Source codes

Package:

BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

2744

views

We introduce BinaryNet, a method which trains DNNs with binary weights and activations when computing parameters’ gradient. We show that it is possible to train a Multi Layer Perceptron (MLP) on MNIST and ConvNets on CIFAR-10 and SVHN with BinaryNet and achieve nearly state-of-the-art results. At run-time, BinaryNet drastically reduces memory usage and replaces most multiplications by 1-bit exclusive-not-or (XNOR) operations, which might have a big impact on both general-purpose and dedicated Deep Learning hardware. We wrote a binary matrix multiplication GPU kernel with which it is possible to run our MNIST MLP 7 times faster than with an unoptimized GPU kernel, without suffering any loss in classification accuracy. The code for BinaryNet is available.

Tags: Computer science, CUDA, Deep learning, Machine learning, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 750, Package, Python

February 10, 2016 by hgpu

Rating: 2.3/5. From 4 votes.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

Package:

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)