The Integrated Enterprise Data Warehouse Engineering

28
1 The Integrated Enterprise The Integrated Enterprise Data Warehouse Engineering Data Warehouse Engineering Software Engineering Conference (Russia) (SEC(R)-2008) October 21–25, 2008, Moscow, Russia Dr. Sergey V. Zykov, Ph.D. TEKAMA Ltd.

description

The Integrated Enterprise Data Warehouse Engineering. Dr. Sergey V. Zykov , Ph.D. TEKAMA Ltd. Software Engineering Conference (Russia) (SEC(R)-2008) October 21–25, 2008, Moscow, Russia. The Integrated Enterprise Data Warehouse Engineering. - PowerPoint PPT Presentation

Transcript of The Integrated Enterprise Data Warehouse Engineering

Page 1: The Integrated Enterprise  Data Warehouse Engineering

1

The Integrated Enterprise The Integrated Enterprise Data Warehouse EngineeringData Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Dr. Sergey V. Zykov, Ph.D.TEKAMA Ltd.

Page 2: The Integrated Enterprise  Data Warehouse Engineering

2

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Heterogeneous EDW engineering – a challenging SE task The methodology combines formal model and SDK for class-level

association-based relationships. Problem domain features: heavy data burden - in 2005 total data size of Intel Corp. exceeded

3.2 petabytes (over 120,000 employees in 57 countries) - high object classes complexity - incomplete information on the structure of certain instantiations

of the classes; - the set of class attributes and operations can be determined

rigorously. Reasons for methodology application: - variety of heterogeneous classes, - importance of association-based inter-class relationships - class inference possible even under certain % of weak-structured

class instancesSoftware Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 3: The Integrated Enterprise  Data Warehouse Engineering

3

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

The methodology vs. ontology-based approaches (OBA):

OBA (e.g. Cyc) efficiency is comparable only under a total class-level uncertainty, which is a different problem domain than ECM

Thesaurus needed for the OBA to meet the relevance required

The methodology uses similar foundations and tools as OBA (UML and XML-based tools, predicate calculus-based CycL, “conceptual model” etc.) for data modelling and integration

OBA lack a balanced combination of formal models and industry-level SDKs (incl. visualization) for ECM lifecycle, resulting in low scalability and non-suitability for the major enterprise-level tasks

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 4: The Integrated Enterprise  Data Warehouse Engineering

4

Objective, tasks, theoretical background

Objective: to make a software development methodology, which supports entire lifecycle of the enterprise software in the global computational environment

Tasks: - formalizing stages and levels of the methodology; - mathematical modeling; - creating CASE- and RAD-tools ; - implementing the methodology (prototype, full-scale).

Background: finite sequence, category, computation (D.Scott), semantic networks.

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 5: The Integrated Enterprise  Data Warehouse Engineering

5

Innovations – the integrate methodology includes :

1. a set of data models for problem domain objects and for computational environment (CM, AMCM);

2. algorithm of the new component integration into the software implemented;

3. personalization procedure for enterprise content access;

4. SDKs: ConceptModeller, Content Management System

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 6: The Integrated Enterprise  Data Warehouse Engineering

6

EDW problems solved by the methodology

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 7: The Integrated Enterprise  Data Warehouse Engineering

7

Enterprise software lifecycle support by the methodology

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 8: The Integrated Enterprise  Data Warehouse Engineering

8

Problem domain modeling

Data object modeling: “class object value” Class – collection of data objects of the integrated problem domain; Object – class instantiation by CMS template (metadata partial evaluation); Value – static HTML page generated by CMS (full evaluation).

Benefits: - evolves from the object-oriented approach; - develops the existing models ([V.E.Wolfengagen’s CM] et al.) in relation to global computational environment

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 9: The Integrated Enterprise  Data Warehouse Engineering

9

Modeling classes of data objects

Classes C of problem domain data objects are modeled by domains: C = Iw:[D] v:D (w(v)  ) = {v:D|}, where: 1) C and D are in a partial order relation (C ISA D); 2) is a criterion of data object w belonging to class C from the viewpoint of a problem domain expert.

Class of “n-dimensional” data objects is modeled by an n-arity relation:Rn = Iw: [V1,..,Vn] v1:V1 … vn:Vn (w [v1,…,vn]  ) = {[ v1:V1 , …, vn:Vn ] | }, where: – “n-dimensional” criterion of data object w belonging to class Rn

Class is a collection of ordered pairs (vi,Vi), where vi  is its i-th attribute (either of data or of metadata); Vi – attribute type.

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 10: The Integrated Enterprise  Data Warehouse Engineering

10

From problem domain to computational environment (1)

Under class C instantiation with assignment a1 and template k of CMS HTML page, evaluation of the template collection M sets into “true” value its element mi, which index (k) equals the template number:

M = (m1,…, mk,…, mN), i=1,…,N mi{0,1};

[M|k] = (m1*,…, mk*,…, mN*), где mi*=1, i=k и mi*=0, ik.

Certain attributes of class metadata v1,…,vn are evaluated according to ti conditions of template:

[(v1:V1,…,vn:Vn)]ti= ([v1]|(t1),…, [vn]|(tn)) = (v1’:V1’,…,vn’:Vn’),

причем V1’ ISA V1,…, Vn’ ISA Vn.

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 11: The Integrated Enterprise  Data Warehouse Engineering

11

From problem domain to computational environment (2)

The second assignment a2 instatiates non-evaluated

template elements (v1’,…,vn’) of CMS HTML-page

by content values (c1,…, cn) :

[(v1’:V1’,…,vn’:Vn’)]c = (v1’/c1,…, vn’/cn),

where c1:С1,…, cn:Сn,

and C1 ISA V1’,…, Cn ISA Vn’.

Сi class template is Ti = (i,(t1,…,tn)),

where (t1,…,tn) is the vector of the evaluated class metadata

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

a1

Value (portal )

Class (UML)

-Name : char-ColorDepth : int-Resolution : int-ID : long-Width : int-Height : int-TemplMask : long double

Photo

Object (CMS )

a2

Page 12: The Integrated Enterprise  Data Warehouse Engineering

12

Formal syntax of the CMS abstract machine

Let us collect all the CMS abstract machine language identifiers into Ide domain, commands – into Com domain, and expressions – into Exp domain:

Ide ={I | I – identifier}; Com ={C | C – command}; Exp ={E | E – expression}.

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 13: The Integrated Enterprise  Data Warehouse Engineering

13

Formal semantics of the CMS abstract machine (1)Order of construction:

– standard domains (most often used);– finite domains (including explicitly enumerable elements);– domain constructors – operations of building new domains out of

existing ones;– composite domain formalization based on standard domains and

domain constructors.

Domain constructors :

- functional space: [D1D2];

- Cartesian product: [D1D2…Dn]; - sequence: D*;

- disjunctive sum:[D1+  D2+… +Dn].

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 14: The Integrated Enterprise  Data Warehouse Engineering

14

Formal semantics of the CMS abstract machine (2)

State = Memory  Input  Output; Memory = Ide  [Value + {unbound}];

Input = Value*; Output = Value*; Value = Type1 + Type2 + …

Constant denotate: <variable, value>

Identifier denotate: <variable_in_memory, identifier, state>

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 15: The Integrated Enterprise  Data Warehouse Engineering

15

Formal semantics of the CMS abstract machine (3)

Semantic function for expression:E: Exp  [ State [[Value State] + {error}]];

Semantic function for command:С : Com[State[State+{error}]].

Semantic statement for identifier:E [I] s = (m, I = unbound) error, (m, I, s).

Semantic statement for assignment command:C [I=E] = E [E] * v (m , i, o) . (m [v/I], i, o).

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 16: The Integrated Enterprise  Data Warehouse Engineering

16

Bi-directional software development

in ConceptModeller CASE-toolkit

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

FO

RM

AL

IZA

TIO

NNATURAL

LANGUAGE

Business situations

in terms of

natural language

FORMAL LANGUAGEOF FRAME

DESCRIPTION(XML)

Visual frame

representation

C#.NET

FRAMETRANSLATION

INTO UML DIAGRAMS

Mapping

function from

frames to UML

diagrams

C#.NET

FORMAL LANGUAGEOF DIAGRAMS

(XML / RATIONAL)

UML diagrams

visualization

C#.NET

FORMAL LANGUAGEOF DIAGRAMS

(IBM RATIONAL / MS VISIO)

IBM RATIONAL,ORACLE DEVELOPER,

MS VISUAL STUDIO

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

VIS

UA

LIZ

AT

ION

TR

AN

SL

AT

ION

VIS

UA

LIZ

AT

ION

Business situations

in terms of

UML diagrams

Page 17: The Integrated Enterprise  Data Warehouse Engineering

17

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Solution Arcitecture

в

Oracle InterOffice

Oracle Appls

Media

ArchiveOracleDBMS Cluster

WWW server

SD

Catalyst8500

Power Supply 0CISCO YSTEMSS Power Supply 1

SwitchProcessor

SERIES

Content DB

Notebook

Mobile phone

PocketPC /PDA

Home desktop PC

RS CS TR RD TD CDTALK / DATA

TALK

Enterprise LAN

Oracle Portal/Intranet server

Content DB

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 18: The Integrated Enterprise  Data Warehouse Engineering

18

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

CMS logical structure

Keywords

IM

ImagesReferences, keywords

Image parameters

Special section parameters,

Message status

KeywordsModulePress Release

Module

Media News

NCNews Columns

Module

MenuModulePages

CMConfiguration management

ModuleEvents

ModuleSpeeches

SSСпециальные разделы

ModuleSynchronize

ModulePublish Cycle

ADAdministration

Menu parameters

Keywords

Keywords

Page status, publishing cycle

params

Params and states for news pages

Page status, publishing cycle params

References, keywords

Menu items paramsMedia news parameters

(menu items, etc.)

Keywords

Image state

parameters

Special section parameters

(menu items etc.)

Page status, publishing cycle params

Page status, publishing cycle params

Params and states for menus and messages

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 19: The Integrated Enterprise  Data Warehouse Engineering

19

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Structure of the integrated enterprise program system

Deductions and Bonuses

Module

VacanciesModule

LeavesModule

Personnel Movement

ModuleAssets

Responsibility

Module

Appraisal and Testing

Module

Personal Data

Module

TrainingsModule

PR

Payroll Information System

FAFixed Assets Management

Information System

AP

Accounts Payable IS

DC

Document Control Information System

AR

Accounts Receivable IS

Deductions, payments

Address book , organization

structure

Asset Features

Параметры аттестации

Appraisal params

Appraisal reports

Параметры выплат

Responsibility Reports

Assets Refixed

Leave params Testing

params

NecessaryTrainings

Appraisal params

Appraisal params

Appraisal params

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 20: The Integrated Enterprise  Data Warehouse Engineering

20

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Comparing the software development methodology

to the commercial methodologies available

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Methodology Mathematical model

Integrated methodology

Page 21: The Integrated Enterprise  Data Warehouse Engineering

21

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Implementation features comparison

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Software

ITERA CMS

Mu

lti-

lan

gu

age

pu

blic

atio

n

Java

ser

vlet

s

.NE

T w

eb s

ervi

ces

UM

L d

iag

ram

s

WY

SIW

YG

-map

pin

g

Inte

gra

ted

ER

P

info

rmat

ion

sys

tem

re

po

rts

Inte

gra

tio

n w

ith

leg

acy

info

rmat

ion

sys

tem

s

Sm

art,

tem

pla

te-

bas

ed d

esig

n

Co

mp

lex

dat

a o

bje

ct

emb

edd

ing

Page 22: The Integrated Enterprise  Data Warehouse Engineering

22

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

TCO comparison results

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

/person

CMS + ConceptModeller

Page 23: The Integrated Enterprise  Data Warehouse Engineering

23

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

ROI comparison results

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

CMS + ConceptModeller

, yrs

Page 24: The Integrated Enterprise  Data Warehouse Engineering

24

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Implementation terms comparison results

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

CMS +

Conce

ptM

odell

er

Optimistic scenario

Pessimistic scenario

Page 25: The Integrated Enterprise  Data Warehouse Engineering

25

Theoretical results:

1) A system of formal models for problem domain and computational environment (rigorous semantics, entire lifecycle support, content management orientation);

2) Algorithm of integrating new components to the enterprise software system (problem-oriented, heterogeneous software architecture support);

3) personalization procedure for accessing enterprise content

(flexible, reliable)

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 26: The Integrated Enterprise  Data Warehouse Engineering

26

Engineering results:

1) CASE- and RAD-toolkits:  a)  ConceptModeller (rigorous semantics; compatible to up-to-

date CASE-tools, ERP and и legacy systems; re-engineering; XML/BPR/UML standard support);

b) ITERA CMS (rigorous semantics; rapid publishing of complex content; WYSIWYG interface; office products integration).

2) Architecture (environment unification of heterogeneous enterprise

applications; role personalization with situation dynamics)

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 27: The Integrated Enterprise  Data Warehouse Engineering

27

Practical value of the results obtained:

1) implementation term-and-cost reduction (TCO, ROI) as compared to commercially available

software by 30% (average);

2) major enterprise software features improvement:- scalability; - reliability; - ergonomics.

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia

Page 28: The Integrated Enterprise  Data Warehouse Engineering

28

The Integrated Enterprise Data Warehouse EngineeringThe Integrated Enterprise Data Warehouse Engineering

Research results approbation:

Over 30 presentations on international conferences,over 50 papers (incl. 4 books).

Research grants from MSR (2002-2003) and RFBR (1996-2006).

ITERA implementation (150 companies, 10,000 employees): CMS (2002); Internet-portal (2003); Intranet-portal (2004)

Other implementations: ICP (RAS), Sterkh Foundation, Ashihara Karate Association, Russian Orthodox Church, etc.

Curricula (MEPhI, MSUFI, INTUIT, LANIT, SoftLine) – over 3000 graduates

Software Engineering Conference (Russia) (SEC(R)-2008)

October 21–25, 2008, Moscow, Russia