Methodsfor DiscoveringProcess...

110
Methods for Discovering Process Models and Their Properties in Data Zbigniew Suraj Chair of Computer Science University of Rzeszow Laboratory of Intelligent Decision Support Systems, Institute of Computing Science Poznan University of Technology and Section of Intelligent Decision Support Systems and Soft Computing, Committee on Computer Scienc of the Polish Academy of Sciences January 11, 2011, Poznań

Transcript of Methodsfor DiscoveringProcess...

Page 1: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

Methods for Discovering Process

Models and Their Properties in Data

Zbigniew Suraj

Chair of Computer Science

University of Rzeszow

Laboratory of Intelligent Decision Support Systems, Institute of Computing Science

Poznan University of Technology

and

Section of Intelligent Decision Support Systems and Soft Computing,

Committee on Computer Scienc of the Polish Academy of Sciences

January 11, 2011, Poznań

Page 2: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

2

OUTLINE

• Introduction

• Considered Problems

• Data and Knowledge Representations

• Concurrent Models

• Experiments

• Software

• Concluding Remarks and Further Work

Page 3: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

3

INTRODUCTION

• Data Mining and Knowledge Discovery are crucial

and current research problems in the modern

computer sciences

• Discovering hidden relationships in data is a main

goal of machine learning.

• In a lot of cases, data are generated by concurrent

processes. Therefore, discovering concurrent

system models is essential from the point of view

of understanding the nature of modeled systems

as well as explaining their behaviors.

Page 4: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

4

INTRODUCTION

• A notion of concurrent systems can be understood widely.

• In a general case, a concurrent system is a system consisting

of some processes, whose local states can coexist together

and they are partly independent.

• We can treat systems consisting of economic processes,

financial processes, biological processes, genetic processes,

meteorological processes, etc. as concurrent systems.

Page 5: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

5

INTRODUCTION

• This lecture concerns methods of concurrent system modeling on the basis

of observations or specifications of their behaviors given in the form of

different kinds of data tables.

• Data tables can include results of observations or measurements of specific

states of concurrent processes. In this case, created models of concurrent

systems are useful for analyzing properties of modeled systems, discovering

the new knowledge about behaviors of processes, etc.

• Data tables can also include specifications of behaviors of concurrent

processes.

• Then, created models can be a tool for verification of those specifications,

e.g. during designing concurrent systems. Methods presented in this lecture

can be used, i.e., in system designing or analyzing, data analysis,

forecasting.

Page 6: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

6

THE AIM OF RESEARCH

To present a general approach to inductive process

modelling from data.

This work is a continuation of a research

direction concerning relationships between

rough set theory and concurrency.

Page 7: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

7

SPECIFICATION OF CONCURRENT

SYSTEMS BY INFORMATION SYSTEMS

• Pawlak, Z.: Concurrent versus sequential the rough sets

perspective. Bulletin of the EATCS 48 (1992) 178—190

Page 8: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

8

EXAMPLE: LIGHT CONTROL

b

a c200u3

020u2

011u1

cba

uu11, u, u22, u, u33 -- statesstates

a, b, c a, b, c -- movement directionsmovement directions

0, 1, 20, 1, 2 -- light color (red, green, green arrowlight color (red, green, green arrow))

Global states

Local processes

Page 9: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

9

HOW TO DESIGN CONCURRENT

SYSTEMS FROM SPECIFICATIONS BY

INFORMATION SYSTEMS?

• Skowron, A., Suraj, Z.: Rough sets and concurrency. Bull. Pol. Acad. Sci. 41(3)

(1993) 237—254

• Suraj, Z.: Rough set methods for the synthesis and analysis of concurrent

processes. In: L. Polkowski, S. Tsumoto, T.Y. Lin (eds), Rough Set Methods

and Applications Studies in Fuzziness and Soft Computing 56, Springer

(2000) 379—488

• Pancerz, K., Suraj, Z.: Rough Sets for Discovering Concurrent System Models

from Data Tables, A.E. Hassanien, Z. Suraj, D. Slezak, P. Lingras (Eds.), Rough

Computing. Theories, Technologies, and Applications, IGI Global, 2008, pp.

239—268

Page 10: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

10

MAIN IDEA

• Dependencies defined by attributes are conditions for coexistence of local states in global states

• One can use the existing methods for generating rules representing such dependencies (e.g., based on reducts ornot)

• The set of rules is treated as knowledge representation for a given information system and defines its maximal extension, i.e., the set of global states consistent with all rules

• One can develop algorithms for designing, e.g., Petri Netsdefining maximal extensions of information systems (data tables)

Page 11: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

11

FORMALLY:

• Given information system A

• Define a theory Th(A) of A (consisting of a set of

rules describing dependencies in A)

• Th(A) defines the maximal extension of A

• Construct a Petri net consistent with Th(A)

Page 12: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

12

ADVANTAGES

• Complex Petri Nets can be generated automatically

from their specification by data tables

• Petri Net can be adaptively modified with changes of

data

Page 13: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

13

IMPORTANT QUESTIONS

• Which kinds of rules should be used (e.g., also non-deterministic, probabilistic)?

• How to characterize the expressibility of different rule sets?

• How to extend the approach by adding information on transition relation or temporal dependencies?

1. Moshkov, M., Skowron, A., Suraj, Z.: Maximal Consistent Extensionsof Information Systems Relative to Their Theories, InformationSciences , Elsevier, 178/12 (2008), 2600--2620.

2. Delimata, P., Moshkov, M., Skowron, A., Suraj, Z.: Inhibitory Rules in Data Analysis . A Rough Set Approach, Springer, 2009

Page 14: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

14

OUR METHODOLOGY

It is based on:

• rough set theory (Z. Pawlak 1982),

• Boolean reasoning (G. Boole, XIX cent. ),

• Petri nets (C.A. Petri, 1962).

1. Pawlak, Z.: Rough sets, International Journal of Computer and Information Sciences, 11, pp. 341-356, 1982.

2. Wegener, I. The complexity of Boolean functions. Wiley&B.G. Teubner, 1987.

3. Petri, C.A.: Kommunikation mit Automaten (Communication withautomata). Bonn: Institut für Instrumentelle Mathematik, Schriften des IIM Nr. 2, 1962.

Page 15: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

15

ROUGH SETS

• Rough set theory introduced by prof. Zdzisław

Pawlak (1982) from Poland provides advanced

and efficient methods of data analysis and

knowledge extraction.

Page 16: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

16

THE CREATOR OF ROUGH SETS

PawlakZdzislaw

Pawlak, Z.: Rough sets, International Journal of Computer and Information

Sciences, 11, pp. 341-356, 1982.

Page 17: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

17

More information about publications, software

and biographies of distinctive researchers in the

rough set theory and applications can be found

in the Rough Set Database under address:

http://rsds.univ.rzeszow.pl

THE RSDS

Page 18: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

18

Page 19: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

19

SCHEME OF BOOLEAN REASONING

Formulate a given problem P

Construct a Boolean function fPcorresponding to P

Interpret prime implicants

as solutions of P

Compute prime implicants of fP

Boolean reasoning

makes a base for solving

a lot of decision and

optimization problems.

Especially, it plays a

special role during

generation of decision

rules.

Page 20: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

20

DATA AND KNOWLEDGE

• In our approach:

� the data are represented in the form of data tables (information/ decision systems, specialized tables) that are used as the specification of the structure and behaviour of modelled systems,

� the knowledge about the structure and behaviour of the modelled system is extracted from a given data table and represented in the form of IF...THEN rules.

Page 21: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

21

CONCURRENT MODEL - PETRI NETS

� Petri nets are the graphical and mathematical tool

for modeling of different kinds of phenomena,

especially those, where actions executed

concurrently play a significant role.

� General net theory proposed by C.A. Petri in 1962.

� It is a generalisation of automata theory such that

the concept of concurrently occurring events can be

expressed.

Page 22: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

22

THE CREATOR OF GENERAL NET

THEORYCarl Adam Petri - German mathematician and computer scientist,

a honorary professor at the University of Hamburg.

Kommunikation mit Automaten (Communication with automata).

Bonn: Institut für Instrumentelle Mathematik, Schriften des IIM Nr.

2, 1962

His PhD thesis:

Page 23: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

23

WHY DO WE USE COLOURED PETRI NETS?

• In particular, coloured Petri nets have:

� solid mathematical backgrounds

� intuitive graphical (visual) representation

� the possibility to refine models (hierarchical representation)

� many practical applications

� available computer tools for the users (e.g. CPN Tools, CPNetwork)

Page 24: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

24

THE CREATOR OF COLOURED PETRI

NETS

First article:

Coloured Petri Nets and the Invariant Method. Theoretical ComputerScience 14 (1981), North-Holland, 317-336.

Kurt Jensen (1950 - ) – Danish computer scientist, professor of

Aarhus University.

Page 25: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

25

THE PROBLEMS

1. The synthesis problem

2. The decomposition problem

3. The reconstruction problem

4. The prediction problem

Page 26: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

26

1. THE SYNTHESIS PROBLEM

INPUT: A given data generated by a system of concurrent

processes.

OUTPUT: A concurrent model of the system discovered and

constructed on the base of knowledge extracted

from a given data in such a way that model global

states are consistent with the extracted knowledge

from the data.

An information system can include the knowledge about global states of a given concurrent system,

understood as vectors of local states of processes making up the concurrent system, whereas a

dynamic information system can include additionally the knowledge about transitions between global

states of the concurrent system. Specialized matrices are designed for specifying undesirable states of

a given concurrent system (i.e. those states, which cannot hold together) and undesirable transitions

between its states.

Page 27: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

27

2. THE DECOMPOSITION PROBLEM

INPUT: A given data (an information system) generated by a system

of concurrent processes, and concurrent model constructed

on the base the given data (as a result of the synthesis

algorithm).

OUTPUT: A family of components (subsystems) together with the links

(rules) binding those components which are sufficient to

build the original concurrent model such that model global

states are consistent with the extracted knowledge

from the given data.

Decomposition of data tables into smaller subtables connected by suitable rules is also possible.

Those subtables make up modules of a system. Local states of processes represented in a given

subtable are linked by means of a functional dependency.

Page 28: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

28

3. THE RECONSTRUCTION PROBLEM

INPUT: A given data table representing the specification of

concurrent process model obtained by the

synthesis algorithm, and a new data representing

a new specification of the modelled system.

OUTPUT: A plan (algorithm) of the reconstruction of a given

concurrent process model satisfying the new

specification represented by a new data.

In this lecture, the problems of reconstruction of models and prediction of their changes in time are

also taken up. Those problems occur as a result of appearing the new knowledge about modeled

systems and their behaviors. The new knowledge can be expressed by appearing new global states,

new transitions between states, new local states of individual processes or new processes in

modeled systems.

Page 29: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

29

4. THE PREDICTION PROBLEM

INPUT: A given concurrent model described by

temporal data (ordered in time).

OUTPUT: A set of prediction rules which can be used

to predict future changes of the model.

A prediction method proposed in this lecture points at the character of model changes

in time. For representing prediction rules, both prediction matrices and Pawlak’s flow

graphs are used.

Page 30: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

30

A DIAGRAM OF PROBLEM SOLVING

DATA REPRESENTATION KNOWLEDGE REPRESENTATION

CONCURRENT MODEL

Rough set methods

Structure Dynamics

Page 31: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

Scheme 1: The synthesis problem

A real system

A description /

specification

Knowledge

Concurrent model

New knowledge

on the system

Observations / Measurements / Designing

Transformations

Coloured Petri nets

Analysis

Information system / Dynamic information system /

Decomposed information system / Specialized data tables

Rough set methods

Rules

Page 32: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

Scheme 2: The reconstruction problem

A real system

at time t1

A description /

specification

Knowledge

Concurrent model

Reconstruction

A real system

at time t2

A description /

specification

Knowledge

New concurrent model

Page 33: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

33

RECONSTRUCTION PROBLEM• In our approach, the net model can be built on the basis of a decomposed

information system S describing a given concurrent system.

• If the description of a concurrent system changes (i.e., a new information

system S* appears), we have to reconstruct the net model representing the

concurrent system.

• The structure of a constructed net is determined on the basis of

components of an information system. So, changing reducts and

components in S can lead to a change in the structure of a net model. In that

case, we would like to know how the reducts and components change when

the new information about the system behavior appears.

• The idea of the reconstruction of a net model constructed for S can be

presented graphically using a block diagram as in Figure.

Page 34: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

34

Fig. RECONSTRUCTION PROBLEM

Page 35: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

35

RECONSTRUCTION PROBLEM(Remarks)

• The renewed computation of reducts and components of information

systems is time-consuming, because algorithms are NP-hard.

• So, it is important to compute new reducts and components in an efficient

way, i.e., without the necessity of renewed computations. Some method

has been proposed in:

Suraj, Z., & Pancerz, K. (2005). Restriction-Based Concurrent System Design

Using the Rough Set Formalism. Fundamenta Informaticae, 67(1-3), 233-

247.

• In the approach presented there, a particular case has been considered,

when the new description (in the form of an information system S*) of a

modeled system includes one new object (global state) with relation to the

old description (in the form of an information system S).

Page 36: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

Scheme 4: The prediction problem

Observations / Measurements

Analysis

Temporal information system

A real system

changing in time

A description

Knowledge on system behaviour

in consecutive time windows

Concurrent models in consecutive

time windows

Prediction of model changes

Transformation

Rough set methods

Reducts, components,

rules

Coloured Petri nets

Page 37: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

37

PREDICTION RULES(Algorithm)

• Split a given temporal information system into time windows and obtain a set of all the

time windows.

• For each time window from the set and each attribute, compute a set of all functional

{a}-reducts of A – {a} and obtain a temporal table of functional reducts) whose columns

are labeled with attributes from A whereas rows, with consecutive time windows from S.

The cells of such a table contain sets of functional relative reducts.

• For each attribute a in A, build a temporal decision system. Attributes of this system are

labeled with the consecutive time windows (the last attribute is treated as a decision).

Each row represents a sequence of sets of functional relative reducts which appeared

in consecutive time windows.

• For each attribute a in A, compute prediction rules from the temporal decision system.

In order to represent such rules, use e.g. flow graphs proposed by Z. Pawlak.

See: Suraj, Z., Pancerz, K., & Swiniarski R.W.: Prediction of Model Changes of Concurrent Systems

Described by Temporal Information Systems. The 2005 World Congress in Applied Computing,

Las Vegas, Nevada, USA, 2005, CSREA Press, pp. 51--57.

Page 38: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

38

DATA REPRESENTATIONS

1. A data table (Pawlak’s information system)

2. A composed data table (dynamic information

system)

3. Specialized data tables (forbidden state/transition

matrices)

Page 39: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

39

Interpretation:

Global states

of S

Interpretation: Processes of S

02

20

01

10

baU\A

3u2u1u

4u

Interpretation:

A local state

of a given

process

Data Table (1)

Information system S = (U,A)

Page 40: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

40

INTERPRETED DATA TABLE

- A Communication System -

Device a

sender/receiver

Device b

sender/receiver

Bus

0 – sending

1 – receiving

2 – disconnecting

U\A a b

0 1

1 0

0 2

2 0

u1

u4

u2

u3

ACTIONS:

PROCESSES:

INTERPRETED DATA TABLE

Page 41: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

41

A DATA TABLE(REMARKS)

Advantages:

- Simple and intuitive specification of concurrent system

- Easy for interpretation into concurrent system concepts

Disadvantages:

- Partial information on dynamics of concurrent systems (only a set

of global states). Lack of information about transition relation.

Page 42: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

42

U\E

02

20

01

10

baU\A

3u2u1u

4u1u

2u

4u

3u

1e 4e3e2e

An underlying system S of DS A transition system TS of DS

Interpretation:

Global states of SInterpretation:

Local processes of S

Interpretation:

A next global state

Composed Data Table (2)

Dynamic information system DS = (U, A, E, T, u0)

A sequence: u1 -> u4 -> u2 -> u3 –> u1

Page 43: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

43

0

1

2

0

b’

1

0

0

2

a’

02

20

01

10

ba

3u2u1u

4u

'\ AAU ∪

Previous states(conditional attributes)

Next states(decisions)

Composed Data Table (2’)

• Dynamic information system

– weak specification )',( AAUST ∪=

It represents onlypositive informationabout a transitionrelation T.

A sequence: u1 -> u4 -> u2 -> u3 –> u1

Page 44: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

44...

0

0

0

1

d

...............

0

1

2

0

b’

1

0

0

2

a’

10

10

10

10

ba

3u2u1u

4u

}{'\ dAAU ∪∪

Previous states Next states

(conditional attributes) (decision)

Composed Data Table (2”)

• Dynamic information system

– strong specification

}){',( dAAUST ∪∪=

It representspositive andnegativeinformationabout a transitionrelation T.

YES

NO

NO

NO

Page 45: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

45

COMPOSED DATA TABLE (REMARKS)

Advantages:

- Simple and intuitive specification of concurrent system, easy for interpretation intoconcurrent system concepts

- Full information on dynamics of concurrent systems (a set of global states togetherwith a transition relation).

- Opportunity for different representations of the transition relation.

- Two structures of concurrent system models are considered: synchronous and asynchronous. A synchronous model enables us to generate the so-called maximal consistent extension of a given information system. Such an extension includes all possible global states consistent with all rules extracted from the original data table. An asynchronous model enables us to find all possible transitions between global states of a given concurrent system, for which only one process changes its local state.

Disadvantages:

- A rise of the tabular representation size.

Page 46: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

46

000110(b,2)

0

0

1

1

0

(b,2)

0

0

1

1

0

(b,1)

0110(b,1)

0

0

0

0

(b,0)

0

0

0

0

(a,2)

00(b,0)

00(a,2)

00(a,1)

00(a,0)

(a,1)(a,0)

Interpret. : A local state of process a

Specialized Data Tables (3)

Forbidden state matrix FSM

0 - YES

1 - NO

Interpret.: A local state of process b

Interpret.: Process a (= 1) can

coexist with process b (=0)

Page 47: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

47

Specialized Data Table (3)

101110(b,2)

0

0

1

0

1

(b’,2)

1

0

1

1

1

(b’,1)

1110(b,1)

0

0

1

0

(b’,0)

0

1

1

0

(a’,2)

00(b,0)

01(a,2)

10(a,1)

11(a,0)

(a’,1)(a’,0)

Interpret.: A current local state of a given process

Interpret.: A next local state of a

given process

Forbidden transition matrix FTM

Page 48: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

48

SPECIALIZED DATA TABLE(REMARKS)

Advantages:

- Precise information about structure and dynamics of concurrent

systems.

- Specialized matrices are designed for specifying undesirable

states of a given concurrent system (i.e. those states, which

cannot hold together) and undesirable transitions between its

states.

Disadvantages:

- A considerable rise of sizes of the tabular representations.

Page 49: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

49

Extracting Knowledge from Data

Page 50: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

50

KNOWLEDGE REPRESENTATIONS

• Deterministic rules

• Inhibitory rules

Page 51: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

51

DETERMINISTIC RULES

IF (expression) THEN (action)

or

... ⇒ attribute = value

• Example: IF (a = 1) THEN (b = 0)

Theory of information system based on deterministic rules can have nonstandard models:

See: Skowron, A., & Suraj, Z. (1993). Rough Sets and Concurrency. Bulletin of the Polish Academy of Sciences, 41(3), 237-254.

Page 52: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

52

• IF (expression) THEN (no action)

or

... ⇒ attribute ≠ value (inhibitory rule)

• Example: IF (a = 0) THEN (b ≠≠≠≠ 1)

Inhibitory rules do not allow nonstandard models

See: Moshkov, M., Skowron, A., Suraj, Z.: On Testing Membership to Maximal Consistent Extensions of Information Systems, in: Proc. of the 5th Int. Conf. on Rough Sets and Current Trends in Computing, RSCTC 2006, Kobe, Japan, LNAI 4259, Springer, 2006, pp. 85—90.

INHIBITORY RULES

Page 53: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

53

Maximal Consistent Extensions(intuitively)

Page 54: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

54

Example: Maximal Consistent Extensions

02

20

01

10

a2a1

00

12

21

22

11 a1 = 1 → a2 = 0 a1 = 2 → a2 = 0 a2 = 1 → a1 = 0 a2 = 2 → a1 = 0

hidden global state

All true and realizable

rules for S:

All true and realizable rules for S are true for (0, 0).

Page 55: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

55

Example: Maximal Consistent Extensions

02200110a2a1

0012212211

a1 = 0 →→→→ a2 ≠≠≠≠ 0,a1 = 1 → a2 ≠ 1,a1 = 1 → a2 ≠ 2,a1 = 2 → a2 ≠ 1,a1 = 2 → a2 ≠ 2,

hidden global state

All true and realizableinhibitory rules for S:

Not all true and realizable inhibitory rules for S are true for (0, 0).

a2 = 1 → a1 ≠ 1a2 = 1 → a1 ≠ 2a2 = 0 →→→→ a1 ≠≠≠≠ 0a2 = 2 → a1 ≠ 1a2 = 2 → a1 ≠ 2

Page 56: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

56

It can represent essentially more information encoded in

information systems than deterministic ones.

This fact was a reason to use inhibitory rules in classifiers as

well as in the concurrent system design.

INHIBITORY RULES(Remarks)

Page 57: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

57

Maximal Consistent Extensions(Remarks)

• There exist information systems S = (U, A) for which |Ext(S) \ U| = exp(|U|)

• There are no polynomial algorithms for construction of the set Ext(S)

• There exists polynomial algorithm which for a given object v ∈ V recognizes if v belongs to Ext(S) or not

See: Moshkov, M., Skowron, A., Suraj, Z.: Maximal ConsistentExtensions of Information Systems Relative to Their Theories, Information Sciences, Elsevier, 178/12 (2008), 2600--2620.

Page 58: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

58

SYSTEM DECOMPOSITION

),( iiii YXUS ∪=

}}{ and

a .t.reduct w.r a is

:{

aX

X

AaY

i

i

i

∈=

A given information system S:

Decomposition w.r.t. a reduct R of the system S

A normal component of S:

),( iiii YXUS ∪=

Degenerated component:

Si = (Ui, {a}) and the attributea does not appear inany normal component

Page 59: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

C-covering of S = (U,A) with links C:

A set of components:

where:

AYYXX kk =∪∪∪∪∪ ...... 11

),(

...

),( 1111

kkkk YXUS

YXUS

∪=

∪=

Covering of Information Systems

Page 60: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

A set of links C includes:

• rules corresponding dependencies between attribute values

of component Si , where i = 1,...,k, called internal links ofcomponent Si,

• rules corresponding dependencies between attribute values

of component Si and attribute values from outside of Si, where

i = 1,...,k, called external links of component Si

Links between Components

Page 61: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

Example: Decomposition of Information Systems

2

0

1

1

c

11

11

01

20

baU\A

3u2u1u

4u1

1

0

a

1

0

2

bU’\A

3u2u1u

Normal component S1

2

01cU”\A

3u1u

4u

Degenerated component S2Reducts of S:

R1 = {b,c} R2 = {a,c}

A given information system S = (U,A)

with A = {a,b,c}:

Decompositionw.r.t. reduct R1:

A - R1= {a}

Page 62: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

62

Descriptive Set of Attributes

A set of attributes B ⊆⊆⊆⊆ A is called a descriptive set for

S if there exists a set of rules Q ⊆⊆⊆⊆ Rul(S) constructed

over the attributes from B only such that Ext(S)

coincides with the set of all tuples (objects) from V for

which all rules from Q are true.

Page 63: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

63

Irreducible Descriptive Set

• A descriptive set B for S is called irreducible if each

proper subset of B is not a descriptive set for S.

Page 64: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

64

Remarks

• We showed that for any information system S there exists only one irreducible descriptive set of attributes, and create a polynomial algorithm for this set construction.

• We proposed a polynomial algorithm recognizing if there exists a cover ofthe irreducible descriptive set by reducts of S.

• The obtained results will be useful in applications of information systems

connected with analysis and design of concurrent systems.

See: Moshkov, M., Skowron, A., Suraj, Z.: On Irreducible Descriptive Sets of Attributes for Information Systems. In: Transactions on Rough Sets XI, LNCS 5946, Springer, 2010, 92--105.

Moshkov, M., Skowron, A., Suraj, Z.: On Covering Attribute Sets by Reducts, in: Proceedings of the International Conference on Rough Sets and Intelligent Systems Paradigms, RSEISP’07, Warsaw, Poland, June 28-30, 2007, LNAI 4585, Springer, 2007, pp. 175--180.

Page 65: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

Concurrent Models- coloured Petri nets -

Σ – a nonempty finite set of types

P – a finite set of places, T – a finite set of transitions

A – a finite set of arcs, N – a node function

C – a colour function, G – a guard function

E – an arc expression, I – an initialization function

),,,,,,,,( IEGCNATPCPN Σ=

Definition of CP-net:

Page 66: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

66

Example: Coloured Petri Nets

Page 67: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

67

Solving Synthesis Problem

Page 68: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

68

TRANSFORMATION OF DATA

INTO CONCURRENT MODEL

Input: A given data table and a set of rules extracted from the data table.

Output: A resulting concurrent model (coloured Petri net).

Step 1. Construct a net representing the set of processes of a given data table.

Step 2. Add to the net obtained in Step 1 a net defined by the set of rules of a given data table, corresponding to all nontrivial dependencies (connections) between the values of attributes belonging to different processes of the data table.

The connections between processes represent constraints which must be satisfied when these processes in the system.

Step 3. Describe the elements (places, transitions and arcs) of the net defined in steps 1-2 according to the definition of a coloured Petri net.

Page 69: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

69

A Scheme of the Transformation

Information

system/decomposed

system/specialized

tables

Set of all minimal

rulesBoolean

expressions

Net model in the

form of CP-net

Net structure Guard expressions

Description

Concurrent model

Page 70: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

70

Computing rules for eachreduct separately

Computing reducts

Information system

Computing of inhibitorrules directly from a data

table

Computing all inhibitorrules

Creating asynchronousnet model

Creating synchronousnet model

Asynchronous model inthe form of a CP-net

Synchronous modelin the form a CP-net

Rough setmethods

(discernibilitymatrix,

discernibilityfunction)

Methods for Constructing Model

Page 71: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

71

Computing rules with respect to reducts

For each reduct we compute internal and external rules.

Internal rules. For each attribute a from R we compute rules

corresponding to a dependency:

External rules. For each attribute a from outside R we compute rules

corresponding to a dependency:

}{}){( aaR →−

}{aR→

Methods for Generating Rules

Page 72: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

72

Computing rules directly from a data table

For each attribute a of S=(U,A) we compute rules corresponding to a dependency:

}{}){( aaA →−

Methods for Generating Rules

Page 73: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

73

Rough setmethods

(discernibilitymatrix,

discernibilityfunction)

Computing underlyinginhibitor rules

Dynamic information system- weak specification

Computing transitioninhibitor rules

Creating synchronousnet model

Synchronous model inthe form of a CP-net

Dynamic information system- strong specification

Creating asynchronousnet model

Asynchronous model inthe form of a CP-net

Methods for Constructing Model

Page 74: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

74

Computing a Guard Expression

The guard expression

xxxxxx

yxyx

⇔∧⇔∨•∨¬⇔⇒•

,

][][

The Boolean expression (minimal

disjunctive normal form)

The set of all minimal rules in S

Page 75: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

75

Example 1 IF a1 THEN b0

IF a2 THEN b0

IF b1 THEN a0

IF b2 THEN a0

(a0 AND b0) OR (a0 AND b1) OR (a0 AND b2)

OR (a1 AND b0) OR (a2 AND b0)

SYNCHRONOUS CONCURRENT MODEL:

02u4

20u3

01u2

10u1

baU\A

Page 76: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

76

02u4

20u3

01u2

10u1

baU\A IF a1 THEN b0

IF a2 THEN b0

IF b1 THEN a0

IF b2 THEN a0

(a0 AND b0) OR (a0 AND b1) OR (a0 AND b2)

OR (a1 AND b0) OR (a2 AND b0)

ASYNCHRONOUS CONCURRENT MODEL:

Example 2

Page 77: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

The Synthesis ProblemInformationsystem

Booleanexpressions

Model in theform of CP-net

Net structure Guard expression

System components

Decomposition Internal andexternal links for components

(with decomposition)

Page 78: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

78

NET MODEL RECONSTRUCTION

Information

system

(old) S

New requirements

New knowledge

Net model

1

Net model

2

Decomposition

Comparison

Decomposition

Components

and linkings

Components

and linkings

Modification of a net

structure and/or guard

expressions

Information

system

(new) S*

Page 79: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

A temporal

information system

Split a given system

into time windows

Compute functional

relative reducts

Construct decision systems

(attribute values: families

of functional relative reducts

Construct prediction matrix

or flow graphs

Prediction rules

PREDICTION OF MODEL PROPERTY CHANGE

Page 80: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

PREDICTION OF MODEL PROPERTY CHANGEtime t time t+1 time t+n

A Pawlak’s flow graph expressive prediction rules

Page 81: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

EXAMPLE 1: A GENETIC SYSTEM

CCGACGGCACAAACACGCAGA

CCCAg1

CCAGGACGg3g2U\A

3u2u1u

4u

7u6u5u

8u

11u10u9u

},,{

},,{

},...,{

321

321

111

CGAVVV

gggA

uuU

ggg ===

=

=

Attribute values represent allele (states of genes):

A – Adenine

C – Cytosine

G - Guanine

Global states are interpreted as chromosomes (gene chains), attributesas genes.

Page 82: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

PROBLEMS

Define on the base of the knowledge extracted from a given data tablecoming from observations:

- all global states consistent with the knowledge extracted from a givendata table,

- all possible transitions between global states of the system definingpointed mutation (only one of genes changes its value).

Page 83: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

SYNCHRONOUS CONCURRENT MODEL

Page 84: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

ANALYSIS RESULTS OF THE GENETIC SYSTEM

New global state consistentwith all rules generated fromthe given genetic system

CCGACGGCACAAACACGCAGA

CCCAg1

CCAGGACGg3g2U\A

3u2u1u

4u

7u6u5u

8u

11u10u9u

CCA12u

Page 85: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

ASYNCHRONOUS CONCURRENT MODEL

Page 86: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

TRANSITION GRAPH OF ASYNCHRONOUS TRANSTITIONS BETWEEN GLOBAL STATES

(only one process changes its local state)

New state u12 andits relationshipswith another states

Page 87: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

87Our task is to design a traffic signals control for this crossroads

)}4,(),3,(),2,(),1,(

),4,(),3,(),2,(),1,(

),4,(),3,(),2,(),1,{(),(

}4,3,2,1{

},,{

cccc

bbbb

aaaaVADESC

VVVV

cbaA

cba

=====

=

The meaning of attribute values:

1 – red

2 – green arrow (left turn)

3 – green arrow (right turn)

4 - green

EXAMPLE 2: TRAFFIC SIGNALS

Page 88: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

88

THE FORBIDDEN STATE MATRIX

Page 89: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

89

THE FORBIDDEN TRANSITION MATRIX

Page 90: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

90

THE NET MODEL

Page 91: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

91

THE REACHABLE STATES OF THE

DESIGNED SYSTEM

Page 92: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

92100

100

000

-201

-111

111

100

-1

1

0

-1

dollar

-11

00

-10

10

oileuroU\A

3u2u1u

4u

7u6u5u

8u

11u10u9u )}1,(),0,(),1,(),2,(

),1,(),0,(

),1,(),0,(),1,{(

),(

},,{

},...,{ 111

oiloiloiloil

euroeuro

dollardollardollar

VADESC

oileurodollarA

uuU

−−

−==

=

=

EXAMPLE 2: ECONOMIC PROCESSESEconomic processes (exchange rates, oil price)

Page 93: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

93

The meaning of local states of processes:

0 – no change, i.e. [-0.5%,0.5%]

-1 – decrease [-1.5%,-0.5%)

-2 – decrease [-2.5%,-1.5%)

1 – increase (0.5%,1.5%]

2 – increase (1.5%,2.5%]

The problem is the following:

On the basis of possessed knowledge (deriving from observations)

determine:

- all states in which the processes can hold

- all transitions between states which can appear

THE PROBLEMS AND COMMENTS

Objects represent data from consecutive days.

Page 94: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

941

1

0

-2

-1

1

1

-1

0

-1

oil’

0

0

0

0

1

1

0

1

0

0

euro’

0

0

0

1

1

1

0

-1

1

0

dollar’

100

000

-201

-111

111

100

-1

1

0

-1

dollar

-11

00

-10

10

oileuroU\A

3u2u1u

4u

7u6u5u

8u

10u9u

INFORMATION SYSTEM: A WEAK

SPECIFICATION

Page 95: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

95

RULES WITH PARAMETERS

Page 96: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

96

INHIBITOR TRANSITION RULES

Page 97: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

97

NET MODEL

Page 98: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

98

OCCURRENCE GRAPH

Page 99: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

99

10-1100

1

1

0

-2

-1

1

1

-1

0

-1

oil’

0

0

0

0

1

1

0

1

0

0

euro’

0

0

0

1

1

1

0

-1

1

0

dollar’

100

000

-201

-111

111

100

-1

1

0

-1

dollar

-11

00

-10

10

oileuroU\A

3u2u1u

4u

7u6u5u

8u

10u9u

11uNew transitions between global states consistent with all transition rules

extracted from a dynamic information system DS

ANALYSIS RESULTS

Page 100: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

100

UseTool

Discovering concurrent models from data tables

http://rsds.univ.rzeszow.pl (Software)

ROSECON

Design and simulation of coloured Petri net models

http://rsds.univ.rzeszow.pl (Software)

CPNetwork

Design, analysis and simulation of coloured Petri net modelsDESIGN/CPN

Data analysis (among others: computation of reducts, rules,

classification, construction of composed classificators)

RSES

Data analysis (among others: computation of reducts, rules,

classification, discretization)

ROSETTA

SOFTWARE

In the today’s computer science development, the usefulness of proposed methods and algorithms

for real-life data is conditioned by existing suitable computer tools automating computing

processes. Therefore, in the lecture the ROSECON system is presented.

Page 101: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

101

ROSECON

ROSECON can be used to support automated:

• discovering concurrent models from experimental data,

• predicting model changes for temporal data,

• data analysis by using rough set methods

ROSECON is a computer tool supporting users in automatized discovering

net models from data tables as well as predicting their changes in time.

Page 102: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

102

ROSECON ARCHITECTURE

Page 103: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

103

CONCLUDING REMARKS

• The presented methodology can be applied for automaticfeature extraction. The processes and connections betweenthem can be interpreted as new features of the modelledsystem

• Constructed models can be useful for designers and analysts to:

- determine some properties (concerning structures and

behaviours) of modelled systems

- extract new knowledge about systems

- verify their descriptions or specifications

Page 104: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

104

• to consider the prediction problem of property changingnet models in non-stationary data systems

• to discover modular and hierarchical concurrent models

• discovery of dynamical models from data based on roughgranular calculus of changes and interactions

FURTHER WORKS

Page 105: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

105

A NEW SYNTHESIS PROBLEM

• Given the hierarchical information system A

• Define theory Th(A) of A (consisting of a set of rules

describing temporal and spatial dependencies in A)

• Th(A) defines the maximal extension of A

• Construct a concurrent system consistent with Th(A)

Page 106: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

106

More information about „Discovering

Concurrent Models from Data” can be found in

collective book,

dedicated to Professor Zdzisław Pawlak

Page 107: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

107

This book offers the most comprehensive

coverage of key rough computing research,

surveying a full range of topics from

granular computing to systems theory.

In particular:

Chapter XII: Rough Sets for Discovering

Concurrent System Models from Data

Tables

by Krzysztof Pancerz and Zbigniew Suraj

Information Science REFERENCE, Hershey – New York, 2008

Page 108: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

108

More information about inhibitory rules can be

found in our book

Page 109: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

109

Page 110: Methodsfor DiscoveringProcess ModelsandTheirPropertiesinDataidss.cs.put.poznan.pl/site/fileadmin/seminaria/2010/ZSuraj_PP_110111_ang.pdf · 4 INTRODUCTION • A notion of concurrent

110

Thank you!