Mining Fuzzy Spatial Association Rules from Image Data

Post on 03-Jan-2016

53 views 0 download

Tags:

description

Mining Fuzzy Spatial Association Rules from Image Data. G. Brannon Smith Mississippi State University 6 June 2001. Introduction Motivation Brief Background Fuzzy Relative Position Object Co-occurence. Theory Aggregate Traditional Experiments Conclusion Future Work - PowerPoint PPT Presentation

Transcript of Mining Fuzzy Spatial Association Rules from Image Data

Mining Fuzzy Spatial Mining Fuzzy Spatial Association Rules from Association Rules from

Image DataImage Data

G. Brannon Smith

Mississippi State University

6 June 2001

ContentsContents

• Introduction

• Motivation

• Brief Background– Fuzzy Relative

Position

– Object Co-occurence

• Theory– Aggregate

– Traditional

• Experiments• Conclusion• Future Work• Selected References• Acknowledgements

IntroductionIntroduction

• Operating on raster image data = image space

• Images can be partitioned into regions or objects

(groups of like pixels)

• Like objects compose classes

• Would like to know general spatial, i.e.,

directional, arrangement of these

Depends

on DSK

Depends

on DSK

Introduction (cont.)Introduction (cont.)

• Association Rules seem appropriate – but

not made for raster data, so…

• Need an approach for finding generalized

fuzzy association rules on object spatial

relations pulled from image space

MotivationMotivation

• GENERAL: Periodic collection of vast

amounts of data = tedious for human to

analyze

• SPECIFIC: OKEANOS project sponsored

by NAVO collects many seafloor images

• Data Mining/Knowledge Discovery helps

BackgroundBackground

Fuzzy Set Theory (Zadeh)

Fuzzy Relative Position (Bloch)

Association Rules (Agrawal et al.)

Fuzzy Association Rules (Kuok, Fu & Wong)

Spatial Data Mining (Koperski & Han)

Object co-occurrence rules (Ordoñez &

Omiecinski)

Fuzzy Spatial RelationsFuzzy Spatial Relations

• I. Bloch applies fuzzy sets to spatial

relations

• Fuzzy concepts of position: right of is fuzzy

• Morphology (shape & size) has effect…

AR A R

Fuzzy Spatial Relations (cont.)Fuzzy Spatial Relations (cont.)

• Objects described as fuzzy sets (crisp OK)

Ex. A(x) and R(x) , xS

• Landscape: (R)(x) is whole image S in

relation to R in direction

• Relation: want A(x) and (R)(x) overlap

Fuzzy Landscape (single)Fuzzy Landscape (single)

Test Image Landscape RO#2, =0

ReferenceObject #2

Background;Empty Space

OO#4H G

F

Membership IntervalMembership Interval

• Bloch algorithm on all points in objects

• Result: 3 stats per relation, M[N,]:

SxA

R

ASx

R

ASx

xRxA

A

xxRsA

xxRt

M

N))(()(

1)(Mean

)](1),)(([inf)(Necessity

)](),)(([sup(A)yPossibilitR

membership average)(Mean

membership minimum)(Necessity

membership maximum(A)yPossibilitR

A

A

MN

R

R

Captures imprecision

Captures imprecision

Fuzzy Relation StatsFuzzy Relation Stats

N=0.9959, M=0.9999, = 1.0000

N=0.7557, M=0.9079, = 1.0000

R A R A

Image Data Mining Image Data Mining

• Ordoñez and Omiecinski have done

preliminary work in image space

• Used Blobworld to convert images to

transactions, objects to item meta-data

• ARM to find simple co-occurrence rules

HypothesisHypothesis

• Unified system of above can be made

• Raster Image data input (K&H)

• Fuzzy Spatial Relation metadata (Bloch)

• Fuzzy Assoc Rule mining (Agrawal et al., KFW)

• Result: useful fuzzy rules describing generalities of object spatial relations

Main ProblemMain Problem

• How to get from Fuzzy Relation metadata tuples (Bloch) to useful rules?

• What are rule forms?

• What are Support and Confidence or analogs thereof?

• Time? Space? Usefulness?

TheoryTheory

• A pre-emptive approach

• By aggregating objects into classes first, can do pseudo-mining right away

• PRO: Few landscapes, small, quick, no mining per se

• CON: lost info (e.g., no more indiv objs)

Fuzzy Landscape (multi)Fuzzy Landscape (multi)

Test Image Landscape RO#7, =0

ReferenceObject #7

Background;Empty Space

Theory (cont.)Theory (cont.)

• “Class-class” or “Pixel-Pixel” rule form:

• S & C

),,(),(),(

)()()()(,

yxinDirByinCAxinC

ypBcxpAcyx

NPNPNPSumS BA ),(

yPossibilitC

MeanC

NecessityC

M

N

For any pixel x of class A and any given pixel y of class B, it is implied that y is in direction of x, with some degree of confidence supported by some portion of the (meta) database.

Theory (cont.)Theory (cont.)

• Prev. ex.:

),,0(),(),(

)()()()(,

yxinDirGyinCGxinC

ypGcxpHcyx

%33.83S

%100

%99.99

%59.99

C

C

C

M

N

alpha RC# OC# N M 0.0 H G 0.9959 0.9999 1.0000

Theory (cont.)Theory (cont.)

• More traditional… (aggreg loses obj id)

• Given: relations for all obj pairs in 4 dirs

• 1.

),,(),()()(),()()(,

yxinDirByinCBcyoAxinCAcxoyx

NONONOSumS BA ),(

,,

),,(inDirmax

MNi

NO

yx

CA

Axi

Byi

For any object x of class A, there exists some object y of class B, such that that y is in direction of x, with some degree of confidence supported by some portion of the (meta) database.

Theory (cont.)Theory (cont.)

• Prev. ex. (same source objs):

alpha RO# RC# OO# OC# N M 0.0 1 H 4 G 0.6887 0.7590 0.8233

0.0 1 H 5 G 0.9959 0.9999 1.0000

0.0 2 H 4 G 0.9959 0.9999 1.0000

0.0 2 H 5 G 0.6904 0.7603 0.8242

0.0 3 H 4 G 0.6887 0.7590 0.8233

0.0 3 H 5 G 0.4842 0.5434 0.6019

Theory (cont.)Theory (cont.)

),,0()183,()183()()255,()255()(,

yxinDiryinCcyoxinCcxoyx

%33.83S

%11.94

%96.91

%35.89

C

C

C

M

N

Object based

Theory (cont.)Theory (cont.)

• 2.

),,(),()()(),()()(,

yxinDirByinCBcyoAxinCAcxoyx

formulae C)-(C above usingpixels OR objs with dealcan S''

,,

),,(inDir

MNi

NONO

yx

CBA

Ax Byi

i

Theory (cont.)Theory (cont.)

),,0()183,()183()()255,()255()(,

yxinDiryinCcyoxinCcxoyx

%33.83oS

%54.84

%36.80

%73.75

C

C

C

M

N

object

Time Time Parallel Parallel

• Landscape generation/Relation extraction independent for given RO,

• “Embarrassingly Parallel”

• mpiShell by Wooley shortens development time, allows user to exploit parallel

• Not linear: 16CPU 4; BUT very useful considering min implementation effort…

Experiments:Experiments:Synthetic Data Sample GraphsSynthetic Data Sample Graphs

Scatter plots of rules mined from

synthetic images with a

fuzzy spatial relation extractor, using

Obj-Obj rules

Synthetic DataSynthetic Data

• Synthetic Data Generator to produce images with bias – “loaded” images

• Can we extract rules that reflect the bias?

• Regular

• Extended

• Half

Side EffectsSide Effects

• Edge Effect – image edges

• Counterbias – wrong direction

• “Spillover” - other classes benefit

• Probability – bias is NOT a guarantee

Sample random 2 (6 classes)Sample random 2 (6 classes)

R2 graphR2 graph

Sample 4Sample 4

R=G, A=H of 6 classes, bias=90% Extended, =0

4 graph4 graph

Sample 2Sample 2

R=G, A=H of 6 classes, bias=80% Extended, =0

2 graph2 graph

Sample 10Sample 10

R=G, A=H of 6 classes, bias=95%, =0

10 graph10 graph

R=H, A=I

R=J, A=I

HalfHalf

A=I of 6 classes, bias=85% Half, =0

Half graphHalf graph

SeafloorSeafloor

Seafloor graphSeafloor graph

Seafloor Rule #148Seafloor Rule #148

ConclusionsConclusions

• Fairly recent discovery of Association Rules (1993) has enjoyed much growth. (Agrawal)

• Expansion into categorical, fuzzy , etc. (Srikant, Kuok/Fu/Wong, et al.)

• Many have done work with Spatial Databases – in Object Space (Koperski & Han)

• BUT…

ConclusionsConclusions

• Preliminary investigation on image object co-occurrence rules by Ordonez and Omiecinski aside…

• Very little work done in Association Rule Mining in (raster) Image Space, esp. fuzzy

• We have endeavored to fill this gap

ConclusionsConclusions

• Used Bloch Fuzzy Spatial Relations as tool for meta-data generation

• Used techniques inspired by (not implemented) Kuok, Fu & Wong

• Showed that we can find interesting and useful rules – both “loaded” and unknown

Future WorkFuture Work

• Better exploitation of fuzzy membership interval

• Application of thresholding typical to most AR to prune low fuzzy values

• Addition of a distance measure attribute

• Exploration of different kinds of rules such as Spatial Relation Co-occurence

SummarySummary

• Introduction

• Motivation

• Brief Background– Fuzzy Relative

Position

– Object Co-occurence

• Theory– Aggregate

– Traditional

• Experiments• Conclusion• Future Work• Selected References• Acknowledgements

Selected ReferencesSelected References

Agrawal, R., T. Imielinski, and A. Swami. 1993. Mining associations between sets of items in massive databases. In Proceedings of the 1993 ACM SIGMOD Int’l Conferences on Management of Data held in Washington, DC, May 26-28, 1993, 207-216. New York: ACM Press.

Bloch, I. 1999. Fuzzy relative position between objects in image processing: A morphological approach. IEEE Transactions on Pattern Analysis and Machine Intelligence 21(7):657-664.

Fayyad, U. M., G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy (Eds.). 1996. Advances in knowledge discovery and data mining. Menlo Parks, CA: AAAI/MIT Press.

Knorr, E. M., and R. T. Ng. 1996. Finding aggregate proximity relationships and commonalities in spatial data mining. IEEE Transactions on Knowledge and Data Engineering 8(6):884-897.

Selected References (cont.)Selected References (cont.)

Koperski, K., J. Adhikary, and J. Han. 1996. Knowledge discovery in spatial databases: Progress and challenges. In Proceedings of the 1996 ACM SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery (DMKD’96) held in Montréal, June 2, 1996, 55-70. IRIS/Precarn.

Kuok, C. M., A. W.-C. Fu, and M. H. Wong. 1998. Mining fuzzy association rules in databases. SIGMOD Record 27(1):41-46.

Luo, J. and S. M. Bridges. 2000. Mining fuzzy association rules and frequency episodes for intrusion detection. International Journal of Intelligent Systems 15(8):687-703.

Ordonez, C. and E. Omiecinski. 1999. Discovering association rules based on image content. Proceedings of the 1999 IEEE Forum on Research and Technology Advances in Digital Libraries held in Baltimore, MD, May 19-21, 1999, 38-49. IEEE.

Selected References (cont.)Selected References (cont.)

Wooley, B. 2000. mpiShell Documentation. http://www.cs.msstate.edu/~bwooley/software/mpiShellDoc.html (Accessed 02 May 2001}.

Zadeh, L.A. 1965. Fuzzy sets. Information and Control 8(3):338-353.

Zimmerman, H.-J. 1996. Fuzzy set theory – and its applications (3rd ed.). Boston: Kluwer Academic Publishers.

AcknowledgementsAcknowledgements

Thanks to…

• Dr. Susan Bridges (Major Professor) for being a great editor of a very long document

• Bruce Wooley for creating mpiShell• Sean Taylor for code review

AcknowledgementsAcknowledgements

• Grants from NAVO Research group based at Stennis Space Center in Bay St. Louis, MS– National Science Foundation Grant

#9818489– ONR EPSCoR Grant N00014-96-1-1276– Naval Oceanographic Office via NASA

Stennis NAS1398033 DO92

URL for Thesis MaterialsURL for Thesis Materials

http://www.cs.msstate.edu/~smithg/thesis/

Includes this presentation, previous presentations (proposal, seminar, etc.), proposal text and thesis

text in PostScript and PDF formats

Questions and Comments?Questions and Comments?