SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.

Automatic detection of object based Region-of-Interest for image

compression

Sunhyoung Han

Transmission in erroneous channel

Basic Motivation

Spatially differentSuper resolution

ConstraintsLimited Resources &Channel Errors

Basic motivation

By having information about importance of regionsOne can wisely use the limited resources

User-adaptive Coder

• visual concepts of interest can

be anything

• main idea:

• let users define a universe

of objects of interest

• train saliency detector for

each object

• e.g. regions of “people”,

“the Capitol”, “trees”, etc.

User Adaptive Coder

query providedby user

traindetector

current trainingsets

User-adaptive coder

• user-adaptive coder:– detector should be generic enough to handle large

numbers of object categories

– training needs to be reasonably fast (including example preparation time)

“face” “lamp” “car”

User-adaptive coder• proposed detector

– top-down object detector (object category specified by user)

– focus on weak supervision instead of highly accurate localization

– composed of saliency detection and saliency validation

– discriminant saliency:

saliencyfilters

training

FIND best features

Discriminant Saliency

• start from a universe of classes (e.g. “faces”, “trees”,

“cars”, etc.)

• design a dictionary of features: e.g. linear

combinations of DCT coefficients at multiple scales

• salient features: those that best distinguish the object

class of interest from random background scenes.

• salient regions are the regions of the image where

these detectors have strong response

• see [Gao & Vasconcelos, NIPS, 2004].

Top-down Discriminant Saliency Model

Scale Selection

Faces Discriminant Feature

Selection

Salient Features

Background

Saliency Map

Original Feature Set

Malik-Perona pre-attentive perception model

ZXIk kk

;maxarg*

• saliency detector

• salient point sali:– magnitude i

– location li– scale si

• saliency map approximated by a Gaussian mixture

Saliency representation

image saliency map salient points

Probability map

Saliency validation• saliency detection:

– due to limited feature dictionary and/or limited training set

– coarse detection of object class of interest

• need to eliminate false positives

• saliency validation:– geometric consistency– reject salient points whose

spatial configuration is inconsistent with training examples

original Image

saliency map for ‘street sign’

example of saliency map

Saliency validation• learning a geometric model of salient point con-

figuration• two components:

- image alignment

• model:- classify pointsinto

• true positives

- configuration model

• false positives- model eachas Gaussian

Saliency validation

• model: two classes of points Y={0,1}– Y=1 true positive– Y=0 false positive

• saliency map: mixture of true and false positive saliency distributions

• each distribution approximated by aGaussian

• this is a two class clustering problem– can be solved by expectation-maximization

• graphical model

• non-standard issues– we start from distributions, not points– alignment does not depend on false

negatives

Saliency validation

E-stepM-step

L~uniform

Y~Bernoulli (1)

C|Y=i~multinomial (i)

X|Y=i,L=l,S=s,~G(x, l-, )

Saliency ValidationFor K training examples (# of saliency point is Nk for kth example) Missing data Y= j,

j {1,0}∈ Parameters j (probability for class j)

∑j (Covariance for class )

k (displacement for kth example)

For robust update

DERIVATION DETAILS

Saliency Validation• visualization of EM algorithm

Saliency detection result

Init saliency points overlapped over 40 samples

Visualized variance ∑1Overlapped points classified as ‘’object’’

Overlapped points classified as ‘’noise’’

Visualized variance ∑0

Saliency Validation

• examples of classified Points

• in summary, during training we learn– discriminant features– The “right” configuration of salient points

Examples of classified saliency points White if hij1>hij

0 Black otherwise

Region of interest detection

• find image window that best matches the learned configuration

• mathematically: - find location p where the posterior probability of the object class is the largest

Region of interest detection• by Bayes rule

– Posterior Likelihood x Prior

– likelihood is given by matching saliencies within the window

& the model

- prior measuresthe saliency massinside window ?

likelihoodPrior

Region of Interest Detection

• given the model– the likelihood, under it, of

a set of points drawn from the observed saliency distribution is

– and the optimal location is given by

Prior for location PWith saliency detector

DERIVATION DETAILS

Measure configuration matching

2. Determine scale(shape) of ROI mask Observation(∑*) from data and

prior(∑1) from training data are used

3. Thresholds PY|X,P(1|x,p*) to get binary ROI mask

** Once the center point is known the assignment of each point is given by

The observed configuration for Y=1 isx

∑1∑*

Saliency detection (for statue of liberty)Probability map (saliency only)

Probability map (with configuration info.) ROI mask

• Example of ROI Detection

Evaluation

• Using CalTech “Face” database &UIUC “Car side” database

• Evaluate robustness of learning– Dedicated Training set vs. Web Training set

• Evaluation Metric– ROC area curve

– PSNR gain for ROI coding vs. normal coding

Number of positive example: 550

Number of positive example: 100

Evaluation

• ROC area curve

False Positive

False PositiveT

“Car” “Face”

Evaluation

• PSNR performance comparison

“Car” “Face”Bit Per Pixel

Bit Per PixelP

14.3% bits can be saved even with web train uniform casefor the same image quality

Result Examples

ResultComparison of needed bits to get the same PSNR (30 dB) for ROI

Maximally, ¼ bits are enough to get the same quality for ROI area

Result Examples

Normal coding ROI coding

EM derivation• Want to fit lower level observation

• For a virtual sample X = {Xik|i=1, …, Nk and k=1, …, K} with the size of Mik=ik*N, likelihood becomes

• For complete set the log likelihood becomes

EM derivation

Maximization in the m-step is carried out by maximizing the Lagrangian

ROI Detection

For one sample point x1

For samples having distribution of

ROI Detection

Therefore,

SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.

Documents

Transcript of SVCL Automatic detection of object based Region-of-Interest for image compression Sunhyoung Han.

Emerging Innovations In Analytical Databases data innovations in analytical... · Breakthrough technology Updateable Column Store Automatic Compression Automatic Storage Indexes Minimize

AutoCompress: An Automatic DNN Structured …AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates Ning Liuy 1, Xiaolong Ma , Zhiyuan Xu3, Yanzhi

DOCTOR LIFE AIR COMPRESSION THERAPY SOLUTIONS - Electromedical.pdf · I-TECH UT2 - PROFESSIONAL ULTRASOUND THERAPY 28358. ELECTROMEDICAL DEVICES 122 • 28310 GIMA UT AUTOMATIC ...

Review of linear algebra - SVCL

image compression in data compression

Automatic High-speed Tablet Press - Capsule Filling … and lower Compression rollers adopt ... If over-filling or less-filling or the punch force over-limit, the machine. ... Automatic

EN Camera Series - JAI€¦ · EN Camera performs JPEG compression and automatic FTP transmission of captured i mages to a file server. Typical applications include Automatic Number

Integrated Systems Advanced Military for Air Traffic ... · Best signal selection Automatic delay compensation Echo cancellation Climax Radio-Telephone coupling Audio compression

kernels.ppt - SVCL

Image Compression and Video Compression 2004 Notes - 6 Audio Compression

Multimedia Compression ( Lossy Compression)

User Guide - Impact Test Automatic Compression... · User Guide User Guide. Automatic Compression Machines. CT340 CT360 CT 380 CT440 CT460 CT480 CT6400 CT770. Impact Test Equipment

Hyundai iMax · Bore x stroke 91.0 mm x 96.0 mm Compression ratio 16.4 Transmission Automatic 5 speed automatic with sequential manual mode ... Tyre dimensions 215/70 R16 C 215/65

Medie Tensiune.pdf · Cleme automate de legătură electrică şi mecanică Fargo automatic splices CALEM. . . . . 33 Cleme cu crestături Compression splices ... La cerere, EXIMPROD

Automatic and Smart Testing Systems Range · This all new automatic Quality Control compression machine is the outcome of 50 years of innovation and technical leadership in concrete

BRIQUETTING PRESSES 2 OR 3 COLUMNS · WORKING PROCESS Continuous feed by conveyor or grab claw Automatic positioning of waste in the compression chamber using feed rams Automatic,

Automatic Detection and Compression for Passive Acoustic ...di/papers/Automatic...Figure 1: The African forest elephant (Loxodonta cyclo-tis) is the smallest of the three extant elephant

Compression of Large Engineering 3D Models using Automatic Discovery of Repeating Geometric Features. Compression of Large Engineering 3D Models using.

Nuno Vasconcelos UCSD - SVCL - Statistical Visual Computing Lab

Automatic and Dynamic Conﬁguration of Data Compression … · Automatic and Dynamic Conﬁguration of Data Compression for Web Servers Eyal Zohar Yahoo! Labs Haifa, Israel eyalz@yahoo-inc.com