Robust Representation Learning with Nonnegativity Constraintshtk/publication/2014-icml... ·...

Stable and EfficientRepresentation Learning with

Nonnegativity Constraints

Tsung-Han Lin and H.T. Kung

Unsupervised Representation Learning

Layer 1 Representation

Dictionary

EncodingSparse encoder

(e.g., l1-regularized sparse coding)

Large Dictionary

Why Sparse Representations?

• Prior knowledge is better encoded into sparse representations– Data is explained by only a few underlying factors– Representations are more linearly separable

Feature B

Feature A Simplifies supervised classifier training: sparse representations work well even when labeled samples are few

Computing Sparse RepresentationsSparse approximation:

= 0.5 0.3 +× ×

Computing Sparse RepresentationsSparse approximation:

• L1 relaxation approach: good classification accuracy, but computation is expensive

• Greedy approach (e.g., orthogonal matching pursuit): fast, but yields suboptimal classification accuracy

L1-regularized OMPClassification accuracy (%) 78.7 76.0

[Coates 2011]

CIFAR-10 classification with single-layer architecture

Major Findings

• Weak stability is the key to OMP’s suboptimal performance

• By allowing only additive features (via nonnegativity constraints), classification with OMP delivers higher accuracy by large margins

• Competitive classification accuracy with deep neural networks

Stability of Representations

Data Input

Encoder

Select the atom that has the largest correlation with the residual

Support set

Orthogonal Matching Pursuit (OMP)Select k atoms from a dictionary D that minimize |x-Dz|

Support set d1

Estimate the coefficients of the selected atoms by least squaresDz(1)

Update the residual using current estimate

Support set d1 d3

Estimate the coefficients of the selected atoms by least squaresDz(1)

Update the residual using current estimate

residual

Nonnegative OMPUse only additive features by constraining the atoms and coefficients to be nonnegative

residual

1. Larger region for noise tolerance

2. Terminate without overfitting

“+d1”

“-d2”

Allowing Only Additive Features

Cancellation

Allowing Only Additive Features

Enforce nonnegativity to eliminate cancellationOn input:

300021Sign splitting

“+” channel

“−” channel

On dictionary:

On representation:

• Any nonnegative sparse coding algorithms

• We use spherical K-means

• Encode with nonnegative OMP (NOMP)

Evaluate the Stability of Representations

Encoder Rotation angle δ0 0.01π 0.02π 0.03π 0.04π

OMP 1 0.71 0.54 0.43 0.34NOMP 1 0.92 0.80 0.68 0.57

Grating A

Encode by OMP/NOMP

Feature dictionary learned from image datasets

Representation A

Correlation between representation A and B

Grating B

Rotate by some small angle δ

Representation B

Measure change by their correlation

Classification: NOMP vs OMP

Classification accuracy on CIFAR-10NOMP has ~3% improvement over OMP

NOMP Outperforms When Fewer Labeled Samples Are Available

Classification accuracy on CIFAR-10 with fewer labeled training samples

STL-10: 10 classes, 100 labeled samples/class, 96x96 images

64.5%Hierarchical matching

pursuit (2012)

67.9%This work

61.4%Maxout network (2013)

60.1%This work

airplane, bird, car, cat, deer, dog, horse, monkey, ship, truck

CIFAR-100: 100 classes, 500 labeled samples/class, 32x32 images

aquatic mammals, fish, flowers, food containers, fruit and vegetables, household electrical devices, household furniture, insects, large carnivores, large man-made outdoor things, large natural outdoor scenes, large omnivores and herbivores, medium-sized mammals, non-insect invertebrates, people, reptiles, small mammals, trees, vehicles

Conclusion

• Greedy sparse encoder is useful, giving a scalable unsupervised representation learning pipeline that attains state-of-the-art classification performance

• Proper choice of encoder is critical: the stability of encoder is a key to the quality of representations

Robust Representation Learning with Nonnegativity Constraintshtk/publication/2014-icml... ·...

Documents

Transcript of Robust Representation Learning with Nonnegativity Constraintshtk/publication/2014-icml... ·...

Nonnegativity Constraints in Numerical Analysisusers.wfu.edu/plemmons/papers/nonneg.pdfNonnegativity Constraints in Numerical Analysis Donghui Chen and Robert J. Plemmonsy Abstract

Chapter 3 The Information Layer: Data Representation.

Guar Gum/Montmorillonite Nanocomposites and Their Potential … · 2017-01-31 · 4.1 X-ray Diffraction 21 4.2 X-Ray ... Figure 2 Schematic representation of 1:1 layer and 2:1 layer

Multi-layer Representation Learning for Medical Concepts · Multi-layer Representation Learning for Medical Concepts Edward Choi Mohammad Taha Bahadori Elizabeth Searles yCatherine

A GUIDE TO OPENLAYERS V3 · 2016-11-08 · 4 Layer A layer is a visual representation of data from a source. OpenLayers 3 has three basic types of layers: ol.layer.Tile ol.layer.Image

VERA BARNETT - Valley House Gallery€¦ · 4 The Fabrications of Vera Barnett: an Appreciation era Barnett works with layer upon layer of representation. Consider the aesthetic value

A Thermal Plume Model for the Convective Boundary Layer ...hourdin/PUBLIS/Rio2008.pdf · A Thermal Plume Model for the Convective Boundary Layer: Representation of Cumulus Clouds

and their applications - univie.ac.atneum/glopt/mss/Par04.pdfConvex optimization, semideﬁnite programming. • Nonnegativity of polynomials. • Applications: – Global optimization,

Layer by Layer

Ricci flow and nonnegativity of sectional curvature

Polynomial nonnegativity Sum of squares (SOS ...parrilo/cdc03_workshop/08_sum_of_squares_2003_12_07_07_screen.pdf8 - 12 Sum of Squares P. Parrilo and S. Lall, CDC 2003 2003.12.07.07

Neural Networks - UCF Computer Sciencegqi/CAP5610/CAP5610Lecture09.pdf•A three layer neural network ... Learning feature representation by neural networks •A compact representation

Chapter 3: Protocols and Services€¦ · Presentation Layer Application Layer Application Process Physical Layer Data Link Layer Network Layer Physical Layer Data Link Layer Network

1 Application Layer Presentation Layer Session Layer Transport Layer Network Layer Data Link Layer Physical Layer Application Layer Presentation Layer.

Deep Learning - LMU Munich › Lehre › MaschLernen › SS2015 › Skript › ... · To model complex functions, ... The output of an auto-encoder layer is the hidden representation

Rahmady Liyantanto liyantanto88@gmail.com … · Session Layer Transport Layer Network Layer Datalink Layer ... Layer 1: Physical Layer Fungsi Utama: Berhubungan dengansinyal elektrik

The Near-Surface Layer of the Ocean - CARTHEcarthe.org/tutorials_pdf/2013_10/Soloviev_Alex_The_Near_Surface_Layer... · Ramp-like Structures in the Near-surface Ocean Schematic representation

Chapter 4: Representation of relief - SFU.cahickin/Maps/Chapter 4.pdf · Chapter 4: Representation of relief ... Layer shading and tinting ... Because contours join only points which

Ekman Layerkrueger/6220/Ekman_Layer.pdfThe Ekman layer solution is not applicable to the surface layer. A better representation is obtained by combining a log wind proﬁle for the

Physical layer Taekyoung Kwon. signal physical representation of data function of time and location signal parameters of periodic signals: period T, frequency.