high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » CUDA » Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition

Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition

Dan Claudiu Ciresan, Ueli Meier, Luca Maria Gambardella, Juergen Schmidhuber

IDSIA, Galleria 2, 6928 Manno-Lugano, Switzerland

arXiv:1003.0358v1 [cs.NE] (1 Mar 2010)

@article{2010arXiv1003.0358C,

author={Claudiu Ciresan}, D. and {Meier}, U. and {Gambardella}, L.~M. and {Schmidhuber}, J.},

title={“{Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition}”},

journal={ArXiv e-prints},

archivePrefix={“arXiv”},

eprint={1003.0358},

keywords={Computer Science – Neural and Evolutionary Computing, Computer Science – Artificial Intelligence},

year={2010},

month={mar},

adsurl={http://adsabs.harvard.edu/abs/2010arXiv1003.0358C},

adsnote={Provided by the SAO/NASA Astrophysics Data System}

}

View

Source

2862

views

Good old on-line back-propagation for plain multi-layer perceptrons yields a very low 0.35% error rate on the famous MNIST handwritten digits benchmark. All we need to achieve this best result so far are many hidden layers, many neurons per layer, numerous deformed training images, and graphics cards to greatly speed up learning.

Tags: Artificial intelligence, CUDA, Image processing, Image recognition, Neural and Evolutionary Computing, Neural networks, nVidia, nVidia GeForce GTX 280

February 9, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

tritonBLAS: A Lightweight Triton-based General Matrix Multiplication (GEMM) Library

tritonBLAS: Triton-based Analytical Approach for GEMM Kernel Parameter Selection

hls4ml: Machine learning on FPGAs using HLS

hls4ml: A Flexible, Open-Source Platform for Deep Learning Acceleration on Reconfigurable Hardware

ThunderKittens: Tile primitives for speedy kernels

ParallelKittens: Systematic and Practical Simplification of Multi-GPU AI Kernels

NVIDIA Nemotron Parse 1.1

NVIDIA Nemotron Parse 1.1

Iris: AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming

Iris: First-Class Multi-GPU Programming Experience in Triton

HipKittens: Fast and Furious AMD Kernels

HipKittens: Fast and Furious AMD Kernels

Fortran xDSL dialects

An MLIR pipeline for offloading Fortran to FPGAs via OpenMP

mt4g: Memory Topology 4 GPUs

MT4G: A Tool for Reliable Auto-Discovery of NVIDIA and AMD GPU Compute and Memory Topologies

Falcon: GPU-Based Floating-point Adaptive Lossless Compression

A High-Throughput GPU Framework for Adaptive Lossless Compression of Floating-Point Data

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization

See all packages

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Login | Sitemap | Feedback | Policy

Contact us: