Learning a Metric Embedding for Face Recognition using the Multibatch Method

hgpu.org » Programming » Algorithms » Learning a Metric Embedding for Face Recognition using the Multibatch Method

Learning a Metric Embedding for Face Recognition using the Multibatch Method

Oren Tadmor, Yonatan Wexler, Tal Rosenwein, Shai Shalev-Shwartz, Amnon Shashua

Orcam Ltd., Jerusalem, Israel

arXiv:1605.07270 [cs.CV], (24 May 2016)

BibTeX

Download (PDF)

View

Source

1697

views

This work is motivated by the engineering task of achieving a near state-of-the-art face recognition on a minimal computing budget running on an embedded system. Our main technical contribution centers around a novel training method, called Multibatch, for similarity learning, i.e., for the task of generating an invariant "face signature" through training pairs of "same" and "not-same" face images. The Multibatch method first generates signatures for a mini-batch of $k$ face images and then constructs an unbiased estimate of the full gradient by relying on all $k^2-k$ pairs from the mini-batch. We prove that the variance of the Multibatch estimator is bounded by $O(1/k^2)$, under some mild conditions. In contrast, the standard gradient estimator that relies on random $k/2$ pairs has a variance of order $1/k$. The smaller variance of the Multibatch estimator significantly speeds up the convergence rate of stochastic gradient descent. Using the Multibatch method we train a deep convolutional neural network that achieves an accuracy of $98.2%$ on the LFW benchmark, while its prediction runtime takes only $30$msec on a single ARM Cortex A9 core. Furthermore, the entire training process took only 12 hours on a single Titan X GPU.

Tags: Algorithms, ARM, Computer science, Deep learning, Neural networks, nVidia, nVidia GeForce GTX Titan X, Performance

May 26, 2016 by hgpu

No votes yet.

Please wait...

high performance computing on graphics processing units: hgpu.org

Learning a Metric Embedding for Face Recognition using the Multibatch Method

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)

Learning a Metric Embedding for Face Recognition using the Multibatch Method

Share this:

Recent source codes

Most viewed papers (last 30 days)