DataEd Webinar: Unlocking Business Value Through Data Modeling and Data Architecture (Part 2 of 2)

84
Welcome: Data Modeling & Data Architecting for Business Value pt. 2 Date: February 12, 2013 Time: 2:00 PM ET Presented by: Peter Aiken, PhD When asked why they are architec0ng data, many in the prac0ce answer: "Because that is what must be done." However, a be>er approach to this ques0on is to speak in terms that are understood in the execu0ve suite – business results! All of our organiza0ons are faced with various organiza0onal challenges that require analysis. Building new systems is just one example. This webinar describes the use of data architec0ng as a basic analysis method (one of many that good analysts should keep in their “toolbox"). I will demonstrate various uses of data architec0ng to inform, clarify, understand, and resolve aspects of a variety of business problems. As opposed to showing how to architect data, I will show how to use data architec0ng to solve business problems. The goal is for you to be able to envision a number of uses for data architectures that will raise the perceived u0lity of this analysis method in the eyes of the business. TITLE PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060 CLASSIFICATION EDUCATION DATA SLIDE 2/12/2013 © Copyright this and previous years by Data Blueprint - all rights reserved! 1

Transcript of DataEd Webinar: Unlocking Business Value Through Data Modeling and Data Architecture (Part 2 of 2)

Welcome: Data Modeling & Data Architecting for Business Value pt. 2

Date: February 12, 2013Time: 2:00 PM ETPresented by: Peter Aiken, PhD

When   asked   why   they   are   architec0ng   data,   many   in   the  prac0ce   answer:     "Because   that   is  what   must   be   done."    However,  a  be>er   approach   to   this  ques0on   is  to  speak   in  terms  that  are  understood  in  the  execu0ve  suite  –  business  results!     All   of   our   organiza0ons   are   faced   with   various  organiza0onal   challenges   that   require   analysis.     Building  new   systems   is  just   one  example.    This  webinar   describes  the  use  of  data  architec0ng  as  a  basic  analysis  method  (one  of  many  that  good  analysts  should  keep  in  their   “toolbox").    I   will   demonstrate   various   uses   of   data   architec0ng   to  inform,  clarify,  understand,  and  resolve  aspects  of  a  variety  of   business   problems.     As   opposed   to   showing   how   to  architect   data,   I  will  show   how   to   use  data  architec0ng  to  solve  business  problems.    The  goal   is  for   you   to  be  able   to  envision   a  number   of   uses  for   data  architectures  that   will  raise   the   perceived   u0lity   of   this   analysis   method   in   the  eyes  of  the  business.

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE

2/12/2013© Copyright this and previous years by Data Blueprint - all rights reserved!

1

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Get Social With Us!

Live Twitter FeedJoin the conversation!

Follow us: @datablueprint

@paikenAsk questions and submit your comments: #dataed

2

Like Us on Facebookwww.facebook.com/

datablueprint Post questions and

commentsFind industry news, insightful

content and event updates.

Join the GroupData Management &

Business IntelligenceAsk questions, gain insights and collaborate with fellow

data management professionals

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Meet Your Presenter: Dr. Peter Aiken

• Internationally recognized thought-leader in the data management field - 30 years of experience– Recipient of multiple international

awards– Founder, Data Blueprint

(http://datablueprint.com)• 7 books and dozens of articles• Experienced w/ 500+ data

management practices in 20 countries

• Multi-year immersions with organizations as diverse as the US DoD, Deutsche Bank, Nokia, Wells Fargo, the Commonwealth of Virginia and Walmart

3

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

Upcoming Special Event

4

Leading the Data Asset Management Team: CDO or Top Data Job?Join Peter Aiken, Ph.D. and Micheline Casey forthis interactive discussion on the role of Chief Data Officer (CDO) or Top Data Job (TDJ).

Attendees will be presented with big ideas and alternative ways not only for how to think about the role of CDO/TDJ but also how to plan and establish a CDO/TDJ position at their organizations. This webinar is intended to provide viewers with deep insights from two data management thought leaders.

March 19, 2013 @ 2:00 PM ET/11:00 AM PT

Brought to you by:

2/12/2013DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060 EDUCATION

Peter Aiken: Data Modeling & Data Architecting for Business Value pt. 1

Data Modeling & Data Architecting

for Business Value pt. 2

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!6

TITLE Agenda1. What is Data Management/DAMA/DM

BoK/CDMP?2. Brief review of Part 13. What is Data/Information

Architecture?4. Why is Data/Information Architecture

Important?5. Data Engineering/Leverage6. Example: Software Package

Implementation7. Example: Donation Center

Processing8. Example: Text Mining/Analytics9. Take Aways, References & Q&A

PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATION DATE SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

EDUCATION

Tweeting now: #dataed

7

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Five Integrated DM Practices

8

#dataed

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Five Integrated DM Practices

9

Manage data coherently.

Share data across boundaries.

Assign responsibilities for data.Engineer data delivery systems.

Maintain data availability.

Data Program Coordination

Organizational Data Integration

Data Stewardship Data Development

Data Support Operations

#dataed

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

• 5 Data Management Practices Areas / Data Management Basics

• Are necessary but insufficient prerequisites to organizational data leveraging applications (that is Self Actualizing Data or Advanced Data Practices)

Data Management Practices Hierarchy (after Maslow)

Basic Data Management Practices– Data Program Management– Organizational Data Integration– Data Stewardship– Data Development– Data Support Operations

http://3.bp.blogspot.com/-ptl-9mAieuQ/T-idBt1YFmI/AAAAAAAABgw/Ib-nVkMmMEQ/s1600/maslows_hierarchy_of_needs.png

Advanced Data Practices• Cloud• MDM• Mining• Analytics• Warehousing• Big

• Published by DAMA International– The professional association for Data

Managers (40 chapters worldwide)– DMBoK organized around – Primary data management functions

focused around data delivery to the organization (more at dama.org)

– Organized around several environmental elements

• CDMP– Certified Data Management Professional– DAMA International and ICCP– Membership in a distinct group made up of

your fellow professionals– Recognition for your specialized knowledge

in a choice of 17 specialty areas– Series of 3 exams– For more information, please visit:

• http://www.dama.org/i4a/pages/index.cfm?pageid=3399

• http://iccp.org/certification/designations/cdmp

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

DAMA DM BoK & CDMP

11

#dataed

Data  Management  Func-ons  

TITLE Agenda1. What is Data Management/DAMA/DM

BoK/CDMP?2. Brief review of Part 13. What is Data/Information

Architecture?4. Why is Data/Information Architecture

Important?5. Data Engineering/Leverage6. Example: Software Package

Implementation7. Example: Donation Center

Processing8. Example: Text Mining/Analytics9. Take Aways, References & Q&A

PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATION DATE SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

EDUCATION

Tweeting now: #dataed

12

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Data Modeling for Business Value (REVIEW)• Goal must be shared IT/business understanding

– No disagreements = insufficient communication• Data sharing/exchange is largely and highly automated and

thus dependent on successful engineering– It is critical to engineer a sound foundation of data modeling basics

(the essence) on which to build advantageous data technologies• Modeling characteristics change over the course of analysis

– Different model instances may be useful to different analytical problems• Incorporate motivation (purpose statements) in all modeling

– Modeling is a problem defining as well as a problem solving activity - both are inherent to architecture

• Use of modeling is much more important than selection of a specific modeling method

• Models are often living documents– The more easily it adapts to change, the resource utilization

• Models must have modern access/interface/search technologies– Models need to be available in an easily searchable manner

• Utility is paramount– Adding color and diagramming objects customizes models and allows for a more engaging

and enjoyable user review process

13

Inspired by: Karen Lopez http://www.information-management.com/newsletters/enterprise_architecture_data_model_ERP_BI-10020246-1.html?pg=2

TITLE Agenda1. What is Data Management/DAMA/DM

BoK/CDMP?2. Brief review of Part 13. What is Data/Information

Architecture?4. Why is Data/Information Architecture

Important?5. Data Engineering/Leverage6. Example: Software Package

Implementation7. Example: Donation Center

Processing8. Example: Text Mining/Analytics9. Take Aways, References & Q&A

PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATION DATE SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

EDUCATION

Tweeting now: #dataed

14

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Data Architecture Management

15

from The DAMA Guide to the Data Management Body of Knowledge © 2009 by DAMA International

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

16

• Models more downward facing - detail• Architecture is higher level of abstraction - integration• In the past architecture attempted to gain complete

(perfect) understanding– Not timely – Not feasible

• Focus instead on architectural components– Governed by a framework– More immediate utility

• http://www.architecturalcomponentsinc.com

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Levels of Abstraction, Completeness and Utility

17

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Architecture

18

Architecture is both the process and product of planning, designing and constructing space that reflects functional, social, and aesthetic considerations. A wider definition may comprise all design activity from the macro-level (urban design, landscape architecture) to the micro-level (construction details and furniture). In fact, architecture today may refer to the activity of designing any kind of system and is often used in the IT world.

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Architecture Representation

19

• Architectures are the symbolic representation of the structure, use and reuse of resources

• Common components are represented using standardized notation

• Are sufficiently detailed to permit both business analysts and technical personnel to separately read the same model, and come away with a common understanding and yet they are developed effectively

• A specific definition – 'Understanding an architecture'

– Documented and articulated as a digital blueprint illustrating the commonalities and interconnections among the architectural components

– Ideally the understanding is shared by systems and humans

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Understanding

20

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Typically Managed Architectures

21

• Process Architecture– Arrangement of inputs -> transformations = value -> outputs– Typical elements: Functions, activities, workflow, events, cycles, products,

procedures• Systems Architecture

– Applications, software components, interfaces, projects• Business Architecture

– Goals, strategies, roles, organizational structure, location(s)• Security Architecture

– Arrangement of security controls relation to IT Architecture• Technical Architecture/Tarchitecture

– Relation of software capabilities/technology stack– Structure of the technology infrastructure of an enterprise, solution or system– Typical elements: Networks, hardware, software platforms, standards/protocols

• Data/Information Architecture– Arrangement of data assets supporting organizational strategy – Typical elements: specifications expressed as entities, relationships, attributes,

definitions, values, vocabularies

• The underlying (information) design principals upon which construction is based– Source: http://architecturepractitioner.blogspot.com/

• … are plans, guiding the transformation of strategic organizational information needs into specific information systems development projects

– Source: Internet• A framework providing a structured description of an enterprise’s information

assets — including structured data and unstructured or semistructured content — and the relationship of those assets to business processes, business management, and IT systems.

– Source: Gene Leganza, Forrester 2009• "Information architecture is a foundation discipline describing the theory,

principles, guidelines, standards, conventions, and factors for managing information as a resource. It produces drawings, charts, plans, documents, designs, blueprints, and templates, helping everyone make efficient, effective, productive and innovative use of all types of information."

– Source: Information First by Roger & Elaine Evernden, 2003 ISBN 0 7506 5858 4 p.1.• Defining the data needs of the enterprise and designing the master blueprints to

meet those needs – Source: DM BoK

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Information Architectures

22

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

What do you use an information architecture for?

23 Illustration by murdock23 @ http://designfestival.com/information-architecture-as-part-of-the-web-design-process/

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Data Architecture – Better Definition

24

*Source: Aiken 2010

• Common vocabulary expressing integrated requirements ensuring that data assets are stored, arranged, managed, and used in systems in support of organizational strategy* • All organizations have

information architectures• Some are better understood and

documented (and therefore more useful) than others

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Vocabulary is Important-Tank, Tanks, Tankers, Tanked

25

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

How one inventory item proliferates data throughout the chain

555 Subassemblies & subcomponents

17,659 Repair parts or Consumables

System 1:18,214 Total items75 Attributes/ item

1,366,050 Total attributes

System 247 Total items

15+ Attributes/item720 Total attributes

System 316,594 Total items73 Attributes/item1,211,362 Total

System 48,535 Total items16 Attributes/item

136,560 Total attributes

System 515,959 Total items22 Attributes/item

351,098 Total attributes

Total for the five systems show above:59,350 Items

179 Unique attributes3,065,790 values

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Business Value

27

• Agency units are carrying $1.5 billion worth of expired inventory – Generates unnecessary costs and negative

impacts on operations, including:• Mission Readiness

• Storage

• Handling

• Opportunity

• Systemic

• Maintenance

TITLE Agenda1. What is Data Management/DAMA/DM

BoK/CDMP?2. Brief review of Part 13. What is Data/Information

Architecture?4. Why is Data/Information Architecture

Important?5. Data Engineering/Leverage6. Example: Software Package

Implementation7. Example: Donation Center

Processing8. Example: Text Mining/Analytics9. Take Aways, References & Q&A

PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATION DATE SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

EDUCATION

Tweeting now: #dataed

28

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Why Architectural Models?

29

• Would you build a house without an architecture sketch?

• Model is the sketch of the system to be built in a project.

• Would you like to have an estimate how much your new house is going to cost?

• Your model gives you a very good idea of how demanding the implementation work is going to be!

• If you hired a set of constructors from all over the world to build your house, would you like them to have a common language?

• Model is the common language for the project team.

• Would you like to verify the proposals of the construction team before the work gets started?

• Models can be reviewed before thousands of hours of implementation work will be done.

• If it was a great house, would you like to build something rather similar again, in another place?

• It is possible to implement the system to various platforms using the same model.

• Would you drill into a wall of your house without a map of the plumbing and electric lines?

• Models document the system built in a project. This makes life easier for the support and maintenance!

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Architecture Examples: Bad

30

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!31

Poor Quality Foundation

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

What they think they are purchasing!

32

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE

2/12/2013© Copyright this and previous years by Data Blueprint - all rights reserved!

Polling Question #1

33

Do you believe that your organization has a solid data architectural foundation on which to build their IT projects?

a) Yesb) No

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE

2/12/2013© Copyright this and previous years by Data Blueprint - all rights reserved!

Polling Question #2

34

Do you believe that your organization is capable of building a solid data architectural foundation on which to build their IT projects?

a) Yesb) No

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

35

Context Diagrams Show System Boundaries

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

36

Too Much Detail

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

37

Web Developers Understand IAhttp://www.jeffkerndesign.com

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

38

Web Developers Understand IAhttp://www.jeffkerndesign.com

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

39

#dataed

Database  Architecture  Focus

Program F

Program E

Program DProgram G

Program H

Program I

Applicationdomain 2Application

domain 3

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

40

#dataed

Data  Architecture  Focus  has  poten-ally  greater  Business  Value

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Data Data

Data

Information

Fact Meaning

Request

A Model Specifying Relationships Among Important Terms

[Built on definition by Dan Appleton 1983]

Intelligence

Use

1. Each FACT combines with one or more MEANINGS. 2. Each specific FACT and MEANING combination is referred to as a DATUM. 3. An INFORMATION is one or more DATA that are returned in response to a

specific REQUEST 4. INFORMATION REUSE is enabled when one FACT is combined with more than

one MEANING.5. INTELLIGENCE is INFORMATION associated with its USES.

Wisdom & knowledge are often used synonymously

Data

Data

Data Data

41

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

How are data structures expressed as architectures?

• Details are organized into larger components

• Larger components are organized into models

• Models are organized into architectures

42

A B

C D

A B

? D

A

D

C

B

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Architectures Comprise a Network of Networks

43

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

How are Data Models Expressed as Architectures?• Attributes are organized into entities/objects

– Attributes are characteristics of "things"– Entitles/objects are "things" whose information is managed in support of

strategy– Examples

• Entities/objects are organized into models– Combinations of attributes and entities are structured to represent

information requirements– Poorly structured data, constrains organizational information delivery

capabilities– Examples

• Models are organized into architectures– When building new systems, architectures are used to plan development– More often, data managers do not know what existing architectures are

and - therefore - cannot make use of them in support of strategy implementation

– Why no examples?

44

#dataed

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

How do data structures support organizational strategy?

• Consider the opposite question?– Were your systems explicitly designed

to be integrated or otherwise work together?

– If not then what is the likelihood that they will work well together?

– In all likelihood your organization is spending between 20-40% of its IT budget compensating for poor data structure integration

– They cannot be helpful as long as their structure is unknown

• Two answers– Achieving efficiency and

effectiveness goals– Providing organizational dexterity for

rapid implementation45

Computers

Human resources

Communication facilities

Software

Managementresponsibilities

Policies,directives,and rules

Data

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

What Questions Can Architectures Address?

46

• How and why do the components interact?

• Where do they go?• When are they needed?• Why and how will the

changes be implemented?

• What should be managed organization-wide and what should be managed locally?

• What standards should be adopted?

• What vendors should be chosen?

• What rules should govern the decisions?

• What policies should guide the process?

! ! ! !

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

47

Organizational Needs

become instantiated and integrated into an Data/Information

Architecture

Informa(on)System)Requirements

authorizes and articulates sa

tisfy

spe

cific

org

aniz

atio

nal n

eeds

#dataed

Data Architectures produce and are made up of information models that are developed in response to organizational needs

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE

2/12/2013© Copyright this and previous years by Data Blueprint - all rights reserved!

Polling Question #3

48

• Our organization is using enterprise data modeling to achieve integration– a) Yes– b) No

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE

2/12/2013© Copyright this and previous years by Data Blueprint - all rights reserved!

Polling Question #4

49

• Our organization should be using enterprise data modeling to achieve integration– Yes– No

TITLE Agenda1. What is Data Management/DAMA/DM

BoK/CDMP?2. Brief review of Part 13. What is Data/Information

Architecture?4. Why is Data/Information Architecture

Important?5. Data Engineering/Leverage6. Example: Software Package

Implementation7. Example: Donation Center

Processing8. Example: Text Mining/Analytics9. Take Aways, References & Q&A

PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATION DATE SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

EDUCATION

Tweeting now: #dataed

50

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Data Leverage

• Data Leverage permits organizations to better manage their most powerful yet under-utilized, poorly managed, durable asset - data– within the system and – with organizational data exchange partners.

• Leverage is obtained by implementation of data-centric technologies, processes, and human skill sets.

• Leverage is increased by elimination of data ROT (redundant, obsolete, or trivial)

• Treating data more asset-like simultaneously 1. lowers organizational IT costs and 2. increases organizational knowledge worker productivity

51

Less ROT

Technologies

Process

People

Architecture Evolution

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!52

Conceptual Logical Physical

Validated

Not  Validated

Strategy

Goals/Objectives

Systems/Applications

Network/Infrastructure

Data/Information

- IA-2 datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

• In support of strategy, the organization develops specific goals/objectives

• The goals/objectives drive the development of specific systems/applications

• Development of systems/applications leads to network/infrastructure requirements

• Data/information are typically considered after the systems/applications and network/infrastructure have been articulated

• Problems with this approach:– This ensures that data is formed

around the application and not the organizational information requirements

– Process are narrowly formed around applications

– Very little data reuse is possible

Application-Centric Development

Original articulation from Doug Bagley @ Walmart

53

Strategy

Goals/Objectives

Data/Information

Network/Infrastructure

Systems/Applications

- IA-2 datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

• In support of strategy, the organization develops specific goals/objectives

• The goals/objectives drive the development of specific data/information assets with an eye to organization-wide usage

• Network/infrastructure components are developed to support organization-wide use of data

• Development of systems/applications is derived from the data/network architecture

• Advantages of this approach:– Data/information assets are

developed from an organization-wide perspective

– Systems support organizational data/information needs and compliment organizational process flows

– Data/information reuse is maximized

Data-Centric Development Flow

Original articulation from Doug Bagley @ Walmart

54

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Why is Data Architecture Important?

55

• Poorly understood– Data architecture asset value is

not well understood• Inarticulately explained

– Little opportunity to obtain learning and experience

• Indirectly experienced– Cost organizations millions each year in

productivity/redundant and siloed efforts– Example: Poorly thought out software

purchases

- IA-2 datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Architectural Work ProductComponents may be defined as:

• The intersection of common business functionality and the subsets of the organizational technology and data architectures used to implement that functionality

• Component definition is an important activity because CM2 component engineering is focused on an entire component as an analysis unit. A concrete example of a component might be

– the business processes, the technology and the data supporting organizational human resource benefits operations. This same component could be described simply as the "PeopleSoft™ version 7.5 benefits module implemented on Windows 95." illustrates the integration of the three primary PeopleSoft metadata structures describing the: business processes used to organization the work flow, menu navigation required to access system functionality, and data which when combined with meanings provided by the panels provided information to the knowledge workers.

56

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!57

Engineering Standards

- IA-2 datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Hierarchical System Functional Decomposition

SystemProcess

Process2

Process1

Process3

Subprocess1.1

Subprocess1.2

Subprocess1.3

58

- IA-2 datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Level 1 Level 2 Level 3Pay Employment Recruitmentand Selectionpersonnel Personnel Employee relations

administration Employee compensation changesSalary planningClassification and payJob evaluationBenefits administrationHealth insurance plansF lexible spending accountsGroup life insurance

Retirement plansPayroll Payroll administration

Payroll processingPayroll interfaces

Development N/ATrainingadministration

Career planning and skillsinventoryWork group activities

Health andsafety

Accidents and workerscompensationHealth and safety programs

A three-level decomposition of the model views

from the governmental pay

and personnel scenario

59

- IA-2 datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

H ealth car e system1 Patient administration 1.1 R egistration1.2 Admission1.3 Disposition1.4 Transfer1.5 M edical record1.6 Administration1.7 Patient bi l l ing1.8 Patient affairs1.9 Patient management2 Patient appointments

and sche d ul ing 2.1 Create or maintain

schedules2.2 Appoint patients2.3 R ecord patient encounter2.4 I dentify patient2.5 I dentify health care

provider3 Nursing 3.1 Patient care3.2 Unit management4 Laboratory 4.1 R esults reporting4.2 Specimen processing4.3 R esult entry processing4.4 Laboratory management4.5 Workload support5 Pharmacy 5.1 Unit dose dispensing5.2 Control led Drug

I nventory5.3 Outpatient

6 R adiology 6.1 Schedul ing6.2 E xam processing6.3 E xam reporting6.4 Special interest and

teaching6.5 R adiology workload

reporting7 C l inical dietetics 7.1 E stabl ish parameters7.2 R eceive diet orders8 Order entry and r e sults 8.1 R eporting8.2 E nter and maintain

orders8.3 Obtain results8.4 R eview patient

information8.5 C l inical desktop9 System management 9.1 Logon and security

management9.2 Archive run

M anagement9.3 Communication software9.4 M anagement9.5 Site management10 Faci l ity qual ity assurance 10.1 Provider credential ing10.2 M onitor and evaluation

A relatively complex model

view decomposition

60

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Polling Question #5• My organization uses the following

approaches to achieve organizational integration A)Organizational data modelingB)Creates point to point interconnectivityC)Distributed systems implementationD)My organization is not

following a programmatic approach to integration

61

TITLE Agenda1. What is Data Management/DAMA/DM

BoK/CDMP?2. Brief review of Part 13. What is Data/Information

Architecture?4. Why is Data/Information Architecture

Important?5. Data Engineering/Leverage6. Example: Software Package

Implementation7. Example: Donation Center

Processing8. Example: Text Mining/Analytics9. Take Aways, References & Q&A

PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATION DATE SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

EDUCATION

Tweeting now: #dataed

62

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Challenge

Package Implementation Example• "Green screen" legacy system to be replaced with

Windows Icons Mice Pointers (WIMP) interface; and• Major changes to operational processes

– 1 screen to 23 screens• Management didn't think workforce could adjust to

simultaneous changes– Question: "How big a change will it be to replace all instances of

person_identifier with social_security_number?"• Answer:

– (from "big" consultants) "Not a very big change."63

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Home Page

Business Process Name

Business Process Component

Business Process Component Step

PeopleSoft Process Metadata

64

Home Page Name

(relates to one or more)

Business Process Name

(relates to one or more)

Business Process Component Name

(relates to one or more)

Business Process Component Step Name

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Example Query Outputs65

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

processes(39)

homepages(7)

menugroups(8)

components(180)

stepnames(822)

menunames(86)

panels(1421)

menuitems(1149)

menubars(31)

fields(7073)

records(2706)

parents(264)

reports(347)

children(647)

(41) (8)

(182)

(847)

(949)

(86)

(281)

(1259)(1916)

(5873)(264)

(647)(708)(647)

(25906)

(347)

Peoplesoft Metadata Structure

66

• Home Page Name• Business Process Name• Business Process Component Name• Business Process Component Step Name

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Business Value - Better Decisions

67

Quantity System Component

Time to make change

Labor Hours

1,400 Panels 15 minutes 3501,500 Tables 15 minutes 375984 Business

process component steps

15 minutes 246

Total 971

X $200/hour $194,200

X 5 upgrades $1,000,000

TITLE Agenda1. What is Data Management/DAMA/DM

BoK/CDMP?2. Brief review of Part 13. What is Data/Information

Architecture?4. Why is Data/Information Architecture

Important?5. Data Engineering/Leverage6. Example: Software Package

Implementation7. Example: Donation Center

Processing8. Example: Text Mining/Analytics9. Take Aways, References & Q&A

PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATION DATE SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

EDUCATION

Tweeting now: #dataed

68

• This Virginia cancer center is a leader in shaping the fight against cancer

• Over 500 researchers and staff tend to over 12,000 patients annually

• This requires robust information management and analytical services

• The problem: It takes 1 month to run a report on an incident, i.e. a patient’s hospital visit that shows all touch points

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

A National Cancer Institute

69

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Other Departments

SQLSQL

Current State Assessment

7

SAS

Cancer Registry

ClaimsDatabase

File Export

Physician Invoices

Patient(Hospital)

Patient(Physician)

Patient(Registry)

Billing Data(Hospital)

Billing Data(Physician)

Diagnoses(Hospital)

Diagnoses(Physician)

Diagnoses(Registry)

Physicians(Hospital)

Physicians(Physician)

Access

SQL

SQL

SAS

SQL

Excel

Excel

Hospital Claims Text

Files FTP FTP

Text Files

FTP orEmail

WordWordWord

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Other Departments

Conceptual Target Architecture

10

SSIS

Cancer Registry

Hospital Claims

Staging

SSIS

Physician Invoices

PatientDemographics

Billing Data(Hospital)

Billing Data(Physician)

Diagnoses(Hospital)

Diagnoses(Physician)

Diagnoses(Registry)

Physicians(Hospital)

Physicians(Physician)

SSIS

SSIS

Consolidated/Sandbox

SSIS SSA

S

Patient(Consolidated)

RPT

Physicians(Consolidated)

Diagnoses(Consolidated)

SSRS

SharePoint

Excel

Email

One-off reports

Reusable reports

0

25

50

75

100

Current Improved

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Business Value - Improving Productivity

• Currently:– Analysts spend 80% of their time manipulating data and 20% of their time

analyzing data– Used to take 1 month to produce key reports

• After rearchitecting:– Analysts spend 20% of their time manipulating data and 80% of their time

analyzing data– Two days to produce key reports

72

Manipulation Analysis

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Rough Estimates on Improvements to Modeling Analyses

• Modelers/analysts are expensive knowledge workers

• 80% of their time is spent searching for information

• 20% of their time is spent acting on the retrieved information

• An improvement of 25% (from 80% search to 60%) could yield a doubling of modeler/analyst productivity - a 10 to one payoff

• A 75% improvement (from 80% search to 20%) could yield a 5 X improvement ...

• ... and a similar multiplier implying the opportunity for 10X (OOM) improvements in systems development time

73

0%

20%

40%

60%

80%

100%

80% 60% 40% 20%Searching Analysis

12

3

TITLE Agenda1. What is Data Management/DAMA/DM

BoK/CDMP?2. Brief review of Part 13. What is Data/Information

Architecture?4. Why is Data/Information Architecture

Important?5. Data Engineering/Leverage6. Example: Software Package

Implementation7. Example: Donation Center

Processing8. Example: Text Mining/Analytics9. Take Aways, References & Q&A

PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATION DATE SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

EDUCATION

Tweeting now: #dataed

74

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Engineering

Architecture

Engineering/ Architecting Relationship

• Architecting is used to create and build systems too complex to be treated by engineering analysis alone

• Architects require technical details as the exception

• Engineers develop the technical designs

• Craftsman deliver components supervised by:– Building Contractor– Manufacturer

75

USS Midway & Pancakes

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!76

What is this?

• It is tall• It has a clutch• It was built in 1942• It is still in regular use!

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Text Mining/Analytics Example

77

• Challenge– Millions of NSN/SKUs

maintained in a catalog– Key and other data stored in

clear text/comment fields– Original suggestion was manual

approach to text extraction– Left the data structuring problem unsolved

• Solution– Proprietary, improvable text extraction process– Converted non-tabular data into tabular data– Saved a minimum of $5 million– Literally person centuries of work

Unmatched Items

Ignorable Items

Items Matched

Week # (% Total) (% Total) (% Total)1 31.47% 1.34% N/A2 21.22% 6.97% N/A3 20.66% 7.49% N/A4 32.48% 11.99% 55.53%… … … …14 9.02% 22.62% 68.36%15 9.06% 22.62% 68.33%16 9.53% 22.62% 67.85%17 9.50% 22.62% 67.88%18 7.46% 22.62% 69.92%

- datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

The Business Value of Diminishing Returns

78

Time needed to review all NSNs once over the life of the project:Time needed to review all NSNs once over the life of the project:NSNs 2,000,000Average time to review & cleanse (in minutes) 5Total Time (in minutes) 10,000,000

Time available per resource over a one year period of time:Time available per resource over a one year period of time:Work weeks in a year 48Work days in a week 5Work hours in a day 7.5Work minutes in a day 450Total Work minutes/year 108,000

Person years required to cleanse each NSN once prior to migration:Person years required to cleanse each NSN once prior to migration:Minutes needed 10,000,000Minutes available person/year 108,000Total Person-Years 92.6

Resource Cost to cleanse NSN's prior to migration:Resource Cost to cleanse NSN's prior to migration:Avg Salary for SME year (not including overhead) $60,000.00Projected Years Required to Cleanse/Total DLA Person Year Saved

93Total Cost to Cleanse/Total DLA Savings to Cleanse NSN's: $5.5 million - datablueprint.com 2/14/2013 © Copyright this and previous years by Data Blueprint - all rights reserved!

Business Value - Quantitative Benefits

79

TITLE Agenda1. What is Data Management/DAMA/DM

BoK/CDMP?2. Brief review of Part 13. What is Data/Information

Architecture?4. Why is Data/Information Architecture

Important?5. Data Engineering/Leverage6. Example: Software Package

Implementation7. Example: Donation Center

Processing8. Example: Text Mining/Analytics9. Take Aways, References & Q&A

PRODUCED BY DATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATION DATE SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

EDUCATION

Tweeting now: #dataed

80

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Take Aways

81

• What is an information architecture?– A structure of data-based information assets supporting implementation of

organizational strategy (or strategies)– Most organizations have data assets that are not supportive of strategies

- i.e., information architectures that are not helpful– The really important question is: how can organizations more effectively

use their information architectures to support strategy implementation?• What is meant by use of an information architecture?

– Application of data assets towards organizational strategic objectives– Assessed by the maturity of organizational data management practices – Results in increased capabilities, dexterity, and self awareness– Accomplished through use of data-centric development practices

(including taxonomies, stewardship, and repository use)• How does an organization achieve better use of its information

architecture?– Continuous re-development; the starting point isn't the beginning– Information architecture components must typically be reengineered – Using an iterative, incremental approach, typically focusing on one

component at a time and applying formal transformations

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE2/12/2013

© Copyright this and previous years by Data Blueprint - all rights reserved!

Questions?

82

It’s your turn! Use the chat feature to submit your questions to Peter now.

+ =

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

Upcoming Events

83

March Webinar:Building the Case for the Top Data JobMarch 12, 2013 @ 2:00 PM ET/11:00 AM PT

April Webinar:Unlock Business Value through Data GovernanceApril 9, 2013 @ 2:00 PM ET/11:00 AM PT

Sign up here:• www.datablueprint.com/webinar-schedule • www.Dataversity.net

Brought to you by:

TITLE

PRODUCED BYDATA BLUEPRINT 10124-C W. BROAD ST, GLEN ALLEN, VA 23060

CLASSIFICATIONEDUCATION

DATA SLIDE

© Copyright this and previous years by Data Blueprint - all rights reserved!

Upcoming Special Event

84

Leading the Data Asset Management Team: CDO or Top Data Job?Join Peter Aiken, Ph.D. and Micheline Casey forthis interactive discussion on the role of Chief Data Officer (CDO) or Top Data Job (TDJ).

Attendees will be presented with big ideas and alternative ways not only for how to think about the role of CDO/TDJ but also how to plan and establish a CDO/TDJ position at their organizations. This webinar is intended to provide viewers with deep insights from two data management thought leaders.

March 19, 2013 @ 2:00 PM ET/11:00 AM PT

Brought to you by: