Application of the Dominance-based Rough Set Approach to Case-based Reasoning

25
Application of the Dominance-based Rough Set Approach to Case-based Reasoning Marcin Szeląg Institute of Computing Science, Poznań University of Technology, 60-965 Poznań, Poland 12.5.2010 1

Transcript of Application of the Dominance-based Rough Set Approach to Case-based Reasoning

Page 1: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 1/25

Application of the Dominance-based Rough Set

Approach to Case-based Reasoning

Marcin Szeląg

Institute of Computing Science, Poznań University of Technology,

60-965 Poznań, Poland

12.5.2010

1

Page 2: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 2/25

Outline

1   Introduction

2   CBR-DRSA Methodology

Basic Notions and DefinitionsDecision RulesCertain and Possible Fuzzy Classification

3   Conclusions

2

Page 3: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 3/25

Introduction

3

Page 4: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 4/25

General Motivations

Case-based reasoning  (CBR) is a natural way in which peoplesolve problems. It is a process of solving new problems basedon the solutions of  similar  problems from the past.

When calculating similarity of two objects, main difficulty

consists in  aggregation  of different criteria/attributes; usuallysuch aggregation is performed  arbitrary , using weights oraggregation operators like sum, average or distance metrics.

Therefore, there is a need for multi-criteria/multi-attributemodelling method that allows to include domain knowledge,

can handle possible inconsistencies in data, and avoids anyaggregation operators.

4

Page 5: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 5/25

Information and Decision Table

Information table is defined by a set of objects  .

Objects are described by a set   of criteria and regular attributes.

If the set of criteria and regular attributes    is divided into twodisjoint subsets of conditions    and decisions  , then informationtable is called a decision table.

Criterion is an attribute with values ordered according to a scale of preference introduced (by a decision maker) as a part of domainknowledge.

We distinguish two types of criteria:

ordinal, with values expressed on ordinal scale,

cardinal, with values expressed on interval or ratio scale.

5

Page 6: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 6/25

Multi-attribute Fuzzy Classification Problem Statement

There is given a set of objects    described in terms of condition

attributes from set   , and a set of decision classes  ,   .Each decision class      ,       is a fuzzy set withmembership function       . For given object   ,value    reflects the credibility that object    belongs todecision class   .

There is given a marginal similarity function for each conditionattribute     ,    , and objects from set   are marked as reference objects (“known cases”).

The task is to build a similarity-based model that is capable of fuzzy classification, i.e., which can assign appropriate membershipvalue to each decision class      , for new (test) objects.

6

Page 7: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 7/25

Motivations for Application of DRSA

The problem can be effectively solved using Dominance-based

Rough Set Approach (DRSA), which:can handle inconsistences in data (preprocessing), resulting,e.g., from imprecise of incomplete information,

takes into account domain knowledge:

domains of attributes, i.e., sets of values that an attribute may

take while being meaningful for user’s perception,

division of attributes into condition and decision attributes,

monotonicity constraints between attributes, addressed by the

dominance principle,

works with heterogenous attributes – nominal, ordinal andcardinal (no need of discretization),

enables to infer decision rule model from decision table(disaggregation-aggregation paradigm).

7

f l d l

Page 8: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 8/25

Motivations for Using Decision Rule Model

Advantages of decision rules:

comprehensible form of knowledge representation,

can represent any function (more general than utility functions

or binary relations),resistant to irrelevant attributes,

do not require aggregation operators,

support “backtracing”,

can explain past decisions and predict future decisions.

8

Page 9: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 9/25

CBR-DRSA Methodology

9

M h d l f S l i F Cl ifi i P bl

Page 10: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 10/25

Methodology for Solving Fuzzy Classification Problems

Example for “fuzzy” IRIS problem – part of initial data:

10

CBR DRSA

Page 11: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 11/25

CBR-DRSA

The first step consists in creation of  similarity tables, one for eachreference object    . At this stage chosen marginalsimilarity functions are used to calculate  marginal similarities   w.r.t.chosen   reference objects .

Different  marginal similarity functions  can be used, depending onthe domain      of attribute     ,    . The minimalrequirement that each such function        must

satisfy is that   , where     is a   reference object .

11

CBR DRSA M i l Si il it F ti

Page 12: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 12/25

CBR-DRSA – Marginal Similarity Functions

Numeric attribute    with values on  interval   or ratio scale  –similarity is defined using a  mathematical function, e.g.:

      

  

   

   

. . .

Attribute    with nominal values – similarity is defined using atable , e.g.:

12

CBR DRSA Si il it T bl

Page 13: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 13/25

CBR-DRSA – Similarity Table

Example for “fuzzy” IRIS problem – part of the similarity tablecreated for reference object   no. 36 (5.1,3.4,1.5,0.2|1.0,0.5,0.4),with      

, for   :

13

CBR DRSA cuts of Decision Classes

Page 14: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 14/25

CBR-DRSA –   -cuts of Decision Classes

We consider each decision class (fuzzy set)       separately fromthe other classes. For each such class, we identify possibleupward/downward  -cuts in the following way:

       ,

       .

For an upward  -cut    it is required that       , while

for a downward  -cut   , it is required that       ,

where      denotes a set of membership values to     observed indata.

14

CBR DRSA Dominance Relation for Pairs of Objects

Page 15: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 15/25

CBR-DRSA – Dominance Relation for Pairs of Objects

The dominance relation between pairs of objects    and   ,

w.r.t. set of condition attributes        is defined as:         

15

CBR DRSA Dominance Cones

Page 16: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 16/25

CBR-DRSA – Dominance Cones

Given       and   , let:

 -positive dominance cone        ,

 -negative dominance cone        .

In the pair   ,    is considered to be a reference object, while  

is called a limit object, because it conditions the membership of  in  

   and    .

16

Contextual Approximations of cuts of Decision Classes

Page 17: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 17/25

Contextual Approximations of   -cuts of Decision Classes

Definitions of  contextual   -lower approximations:

          ,

          .

Definitions of  contextual   -upper approximations:

  

    ,

  

    .

Definitions of  contextual   -boundaries:         ,         .

17

Approximations of -cuts of Decision Classes

Page 18: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 18/25

Approximations of   -cuts of Decision Classes

Definitions of   -lower approximations:

  

    ,

  

    .

Definitions of   -upper approximations:  

    ,

  

    .

Definitions of   -boundaries:

         ,         .

18

CBR-DRSA – Decision Rules

Page 19: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 19/25

CBR DRSA Decision Rules

Decision rules are induced in order to identify similarity-basedpatterns in data.

All pairs of objects   , where    belongs to lower approximationof some    or  , are basis for induction of  certain rules.

All pairs of objects   , where    belongs to upper approximationof some    or  , are basis for induction of  possible rules.

19

CBR-DRSA – Decision Rules

Page 20: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 20/25

CBR DRSA Decision Rules

Exemplary certain decision rule, for upward union   ,generated for decision class  setosa:

If  -   -

then    belongs to class  setosa  to degree at least 0.8.

Exemplary possible decision rule, for downward union   ,generated for decision class  versicolor :

If  -

then    could belong to class  versicolor  to degree at most 0.5.

20

CBR-DRSA – Decision Rules

Page 21: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 21/25

CBR DRSA Decision Rules

There is a problem which decision rules resulting fromlower/upper approximations of  -cuts   ,    should beused for classification.

The minimal set of  certain/possible rules is   non-unique . Thechoice of such a set is  arbitrary   and  non-trivial .

On the other hand, (explicit) generation of a set of allcertain/possible  rules is  computationally hard .

21

CBR-DRSA – Decision Rules

Page 22: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 22/25

CBR DRSA Decision Rules

Proposed approach:

Certain knowledge   minimal set of certain rules  (MCR),generated from lower approximations of    and   ,independently for each decision class   .

Possible knowledge  minimal set of possible rules  (MPR),generated from upper approximations of    and   ,independently for each decision class   .

Only minimal certain/possible rules are taken into account.

Decision rules are generated by VC-DomLEM [1,2] algorithm.

22

CBR-DRSA – Fuzzy Classification of New Objects

Page 23: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 23/25

C S u y C ass cat o o e Objects

Fuzzy classification of a new (test) object is performedindependently for each decision class      .

For each such class two suggestions can be calculated:

certain suggestion, resulting from application of MCR,

possible suggestion, resulting from application of MPR.

Each suggestion is obtained using a VC-DRSA classifier [5].

23

Summary and Conclusions

Page 24: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 24/25

y

DRSA is a flexible modelling method that allows to includedomain knowledge and can handle possible inconsistencies in

data by calculating  lower   and  upper approximations  of sets.

DRSA allows to work with heterogenous attributes – nominal,ordinal and cardinal (no need of discretization).

DRSA can be applied to multi-attribute similarity-based fuzzy

classification problems, that employ  pairwise comparisons expressed in  similarity table .

Rule model has many advantages, e.g., comprehensibility,generality, lack of aggregation operators, predictive power,resistance to irrelevant attributes.

Definitions of rough approximations and syntax of decisionrules in CBR-DRSA are based on ordinal properties of similarity relations only – unlike in other methods (e.g., k-NN),no aggregation operators are used.

24

References

Page 25: Application of the Dominance-based Rough Set  Approach to Case-based Reasoning

7/27/2019 Application of the Dominance-based Rough Set Approach to Case-based Reasoning

http://slidepdf.com/reader/full/application-of-the-dominance-based-rough-set-approach-to-case-based-reasoning 25/25

1   J. Błaszczyński, R. Słowiński, M. Szeląg, Sequential Covering Rule Induction Algorithm forVariable Consistency Rough Set Approaches. Submitted to Information Sciences in 2009.

2   J. Błaszczyński, R. Słowiński, M. Szeląg, VC-DomLEM: Rule induction algorithm for variableconsistency rough set approaches. Research Report RA-07/09, Poznań University of Technology,2009.

3   S. Greco, B. Matarazzo, R. Słowiński, Granular Computing for Reasoning About Ordered Data:the Dominance-Based Rough Set Approach. Chapter 15 [in]: W. Pedrycz, A. Skowron, V.Kreinovich (eds.), Handbook of Granular Computing. John Wiley & Sons, Chichester, 2008, pp.347–373.

4   S. Greco, B. Matarazzo, R. Słowiński, Case-based reasoning using gradual rules induced fromdominance-based rough approximations. [In]: G. Wang, T. Li, J. W. Grzymała-Busse, D. Miao,A. Skowron, Y. Yao (eds.), Rough Sets and Knowledge Technology (RSKT 2008). Lecture Notesin Artificial Intelligence, vol. 5009, Springer-Verlag, Berlin, 2008, pp. 268–275.

5   J. Błaszczyński, S. Greco, R. Słowiński, Multi-criteria classification – A new scheme forapplication of dominance-based decision rules. European Journal of Operational Research,181(3), 2007, pp. 1030–1044.

6   S. Greco, B. Matarazzo, R. Słowiński, Dominance-based Rough Set Approach to Case-Based

Reasoning. [In]: V. Torra, Y. Narukawa, A. Valls, J. Domingo-Ferrer (eds.), Modelling Decisionsfor Artificial Intelligence. Lecture Notes in Artificial Intelligence, vol. 3885, Springer-Verlag,Berlin Heidelberg, 2006, pp. 7–18.

25