high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Image Classification with Pyramid Representation and Rotated Data Augmentation on Torch 7

Image Classification with Pyramid Representation and Rotated Data Augmentation on Torch 7

Keven (Kedao) Wang

Stanford University

Stanford University, 2015

BibTeX

Download (PDF)

View

Source

3097

views

This project classifies images in Tiny ImageNet Challenge, a dataset with 200 classes and 500 training examples for each class. Three network architectures are experimented: a traditional architecture with 4 convolutional layers + 2 fully-connected layers; a Tiny GoogleNet with 3 inception layers; and a pyramid representation-based network. Tiny GoogleNet achieved the highest top-1 validation accuracy of 47%. Work is done to reduce overfitting. Dropout improves validation accuracy by 10%. Data-augmentation of random crop and horizontal flip increased validation accuracy by 10%. Rotation does not appear to improve validation accuracy. Pyramid representation shows significant computational efficiency, achieving similar top result 240% faster computation time per batch. Training accuracy converges at 65 – 70% for all three networks. Future work is to increase expressive power of network. Training was done on Torch 7 with Facebook’s Deep Learning Extension.

Tags: Computer science, CUDA, Deep learning, Machine learning, Neural networks, nVidia, nVidia GRID K520

April 14, 2015 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Image Classification with Pyramid Representation and Rotated Data Augmentation on Torch 7

Your response

Recent source codes

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

KISim: Kubernetes Intelligent Scheduling Simulator

Efficient GPU Implementation of Multi-Precision Integer Division

exa-AMD: Exascale Accelerated Materials Discovery

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Most viewed papers (last 30 days)

Image Classification with Pyramid Representation and Rotated Data Augmentation on Torch 7

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)