high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Computer vision » Learning Representation for Scene Understanding: Epitomes, CRFs, and CNNs

Learning Representation for Scene Understanding: Epitomes, CRFs, and CNNs

Liang-Chieh Chen

University of California, Los Angeles

University of California, 2015

@article{chen2015learning,

title={Learning Representation for Scene Understanding: Epitomes, CRFs, and CNNs},

author={Chen, Liang-Chieh},

year={2015}

}

Download (PDF)

View

Source

Source codes

Package:

DeepLab: deep learning system for semantic image segmentation

2879

views

Scene understanding, such as image classification and semantic image segmentation, has been a challenging problem in computer vision. The difficulties mainly come from the feature representation, i.e., how to find a good representation for images. Instead of improving over hand-crafted features such as SIFT or HoG, we focus on learning image representations by generative and discriminative methods. In this thesis, we explore three areas: (1) generative models, (2) graphical models, and (3) deep neural networks for learning image representations. In particular, we propose a dictionary of epitomes, a compact generative representation for explicitly modeling object co-relation within edge patches, and for explicitly modeling photometric and position variability of image patches. Subsequently, we exploit Conditional Random Fields (CRFs) to take into account the dependencies between outputs. Finally, we employ Deep Convolutional Neural Networks trained with large-scale datasets to learn feature representations. We further combine CRFs with deep networks to estimate complex representations. Specifically, We show that our proposed model can achieve state-of-art performance on challenging semantic image segmentation benchmarks.

Tags: Caffe, Computer science, Computer vision, CUDA, Deep learning, Neural networks, nVidia, nVidia GeForce GTX 650 M, OpenCV, Tesla K40, Thesis

November 24, 2015 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Learning Representation for Scene Understanding: Epitomes, CRFs, and CNNs

Package:

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Learning Representation for Scene Understanding: Epitomes, CRFs, and CNNs

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)