Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data...
-
Upload
pieter-de-leenheer -
Category
Business
-
view
550 -
download
2
description
Transcript of Data Stewardship and Governance: how to reach global adoption and systematic monitoring of data...
Data Governance and Data Stewardship on how to reach global adoption and systematic monitoring of data policy through software
Dr. Pieter De Leenheer Co-founder !
What we talk about when we talk about no Data Governance
Who approved this?!
I wish these guys spoke our language !
I can’t understand this report !!
I’ve never seen this code! Who introduced this ?!
This doesn’t seem right. Are we sure this data is correct ?!
The Problem!
This rule is different in our country !!
This is an exception !to the rule !!
Data Management Challenges
• Data Service = data sharing agreement across organization silos, policies, regulations, semantic assumptions
• No clear balance between data ownership and control: • responsibilities are not set • for each data point : increasing exposure to risk regarding quality
and policy compliance ! ask Alice, she knows
Regulatory+compliance+risks+con3nue+to+persist+and+remain+a+solid+driver+for+governance,+risk+and+compliance+technologies.+However,+more+hype+is+being+generated+by+external+risks+posed+by+third+par3es,+suppliers+and+customers.+
(Gartner!Hype!Cycle!on!Risk!and!Compliance!Tech,!2013)!!
Flanders Research Information Space
• Providing Scientific Research Information and Services
• Easy
• Transparent
• Open
• Timely
• Unambiguous
• Supported by Data Governance
• Qualitative meta data: e.g., definition for project, funding codes, mappings, classifications, etc.
• Roles and responsibilities for Information Providers and Stiweto
• Collaborative workflows between Information Providers and Stiweto
By courtesy of G. Van Grootel, EWI
FRIS’ Data-driven Innovation Engine
By courtesy of G. Van Grootel, EWI
Context & Necessity
• Services are increasingly • knowledge-intensive relying on millions of data points from
• Partners • Third parties • Customers
• co-produced in federated, decentralised, multi-tier settings
• multi-disciplinary:
• Algorithm: e.g., Big Data Analytics • Infrastructure: e.g., Internet of Things
• Service Innovation Methods: e.g, Living Labs
• Marketing: e.g., Service-dominant Logic
• sufficient…..? No
Defining Data Stewardship & Governance
• Ownership + => Power + Control
Data Stewardship & Governance
• Ownership + Responsibility => Power + Control 1. (global) data stewardship
! Requirement 1: people who define data policy ! E.g., multilingualism policy
2. (systematic) data governance ! Requirement 2: processes that enforce data policy ! E.g., every project abstract must be in English and Dutch
• Now let’s build software for it…
“New+Informa3on+infrastructure+technologies+must+enable+organiza3ons+to+define,+organize,+share,+integrate+and+govern+data+and+content+to+create+business+value”++
(Gartner!Hype!Cycle!on!Informa@on!Infrastructure!Tech!2013).!
Yet contradicting forces… Borrowed from Dirk Coutuer (ING)
. .and not all data points are create equal
Critical Data Elements
Auditors, Clients,
Counterparties ...
External
Risk, Compliance, Finance ...
Critical Data Elements
Critical Data Elements
Equities, Fixed Income, Wealth Management ...
Dodd Frank Act, Basel III, FATCA ...
Critical Data Elements
Business Lines
Corporate Functions
Regulations
Borrowed from Predrag Dizdarevic (Element 22 NYC)
Can technology globalise and systematise data policy scoping, definition and enforcement which is by nature a human process?
Process-driven Data Governance
Tools Policy
Multilingualism
Business RuleAbstract must be
in English
Business RuleAbstract must be
in Dutch
Code Value4250
Code Value4.3
Code ValueG3
Business TermResearcher
Business TermPublication
Business TermProject
Business TermActie er Onder...
?!
?!
?!
?!
Funding Community
Generation 1 Funding Codes (Codelist)Funding Sources Glossary
Business termActie ter
ondersteuning van de
Strategische prioriteiten van
de Federale overheid
Generation 2 Funding Codes (Codelist)
Code SetGeneration 2 Funding Codes
Code Value368
contains
code
Code SetGeneration 1 Funding Codes
Code Value4250
containscode
Code Value4.3
contains
Funding Stream Codes (Codelist)
Code SetFunding Stream Codes
Code ValueG3
Accounting Codes (Codelist)
Code SetAccounting Codes
Code Valuexxxx
Business termPOD
wetenschapsbeleid -
Federale Impulsprogra
mma's
?!?!
?!
?!
Tools Policy
Multilingualism
Business RuleAbstract must be
in English
Business RuleAbstract must be
in Dutch
Code Value4250
Code Value4.3
Code ValueG3
Business TermResearcher
Business TermPublication
Business TermProject
Business TermActie er Onder...
Funding Community
Generation 1 Funding Codes (Codelist)Funding Sources Glossary
Business termActie ter
ondersteuning van de
Strategische prioriteiten van
de Federale overheid
Generation 2 Funding Codes (Codelist)
Code SetGeneration 2 Funding Codes
Code Value368
contains
code
Code SetGeneration 1 Funding Codes
Code Value4250
containscode
Code Value4.3
contains
Funding Stream Codes (Codelist)
Code SetFunding Stream Codes
Code ValueG3
Accounting Codes (Codelist)
Code SetAccounting Codes
Code Valuexxxx
Business termPOD
wetenschapsbeleid -
Federale Impulsprogra
mma's
?!
?!
?!
?!
Data Governance Council
Example for Funding Source Terms and Codes
Funding Community
Generation 1 Funding Codes (Codelist)Funding Sources Glossary
Business termActie ter
ondersteuning van de
Strategische prioriteiten van
de Federale overheid
Generation 2 Funding Codes (Codelist)
Code SetGeneration 2 Funding Codes
Code Value368
contains
code
Code SetGeneration 1 Funding Codes
Code Value4250
containscode
Code Value4.3
contains
Funding Stream Codes (Codelist)
Code SetFunding Stream Codes
Code ValueG3
Accounting Codes (Codelist)
Code SetAccounting Codes
Code Valuexxxx
Business termPOD
wetenschapsbeleid -
Federale Impulsprogra
mma's
Load, Define & Enforce Data Governance Council: Governance Operating Model
Roles & Responsibilities
Processes & Workflow
Asset Types & Traceability
Data Governance Organization
Data Stewardship Activities
Data Quality Development
IT / Operational Data Management Activities
Data Modeling
Metadata Lineage
Establishes & drives
Aligns & Coordinates
Reports & Escalates
Monitors & Remediates
Metadata Scanning
Reference Data Authoring
Data Integration
Collibra Business Semantics Glossary (BSG)
Collibra Reference Data Accelerator (RDA)
Hierarchy Management
Business & Data Definitions
Business Traceability
Semantic Modeling
Mapping Specifications
Policy Management
BusinessRules
Data Quality Rules
Data Quality Reporting
Issue Management
Reference Data Crosswalks
Master Data Stewardship
Data Quality Profiling
DQ Defect Resolution
Collibra Data Stewardship Manager (DSM)
Collibra Platform
Other Data Management Vendor products
...
Load…!
Scope,!select,!define!
enforce!
5 Modeling Concepts in DGC Operating Model
Community Name
Domain
Assetrelation Attribute
Assets are fundamental building blocks or resources for which you want to capture information. An asset
belongs to exactly one domain. An asset has a unique name within its domain..
E.g., Personal Privacy Policy, Customer, ISO 3166, CRM, Customer Gender Disclosure Issue
Attributes are literal values such as strings or numbers that do not form an asset on their own right. E.g., the Description attribute for asset “Customer” is “Person that placed at least one order for at least one product with Bank and Insurance”
Relations semantically relate 2 assetsE.g., between assets “Customer” and “CRM”: “Customer has system of
record / is system of record for CRM”E.g., between assets “Customer” and “Gender”: “Customer has gender /
gender of Gender”
Domains logically group assets (according to their function, project, or knowledge area) and are owned by exactly one community. It has a domain type that specifies which asset types can be created in the domain.E.g., Customer Domain groups all assets related to customer relationship managementE.g., Enterprise Rules and Policies Domain collects all valid policies and rules in the organisation
Communities are groups of people. They often correspond to functional divisions in a company and should be aligned with
the company's governance organization. A community can control/own various domains.
E.g., Finance Community includes relevant people in the finance function, and controls the Customer Domain.
DGC Asset Types
Asset
Business Asset
Data AssetTechnology
AssetGovernance
AssetIssue
Asset Types allow you to formally specify what type an asset is, as a kind of template. They are assigned to one or more Domain Types.E.g., Business Term is type for “Customer” and “Gender”E.g., Code Value is type for “CG_NA”;E.g., System is type for “CRM”
subsumes asset types such as Business Term, KPI, and Report
includes asset types such as Policy and Rule
subsumes asset types such as Code Value
includes asset types such as System and Database
We distinguish between 4 main types of asset, and 1
special type called Issue
Traceability of Assets across Domains
Assigning types to assets, relations, domains gives meaning; and brings a better understanding of different viewpoints on DG !
Enterprise Architecture
Finance
Working Group on Rules and Policies
Application Assets CRM Application Reference Data
Enterprise Rules and Policies
Customer Domain
Business Term Customer
System CRM
has system of record
"Person or […] and Insurance"
Business Term Genderhas gender
Code ValueCG_MA
allowed value
Code ValueCG_FE
Code ValueCG_NA
PolicyPersonal
Privacy Policy
governs / complies to
Issue Gender
Disclosure Issueviolates
description
Use-cases
Business Glossary
DG in Cloud Provider
Data Dictionary
Business Glossary at the #1 Chocolate Factory
Reference Data Reference Data
FWO Disciplines
IWETO Disciplines
ECOOM Hasselt
Issue Management Issue Management
Data Governance Council
Funding Source Not
Found
Reference Data & Issue Mgt at Health Insurance Co.
• http://prezi.com/ve1ws8jmpqcn/workflow/
Policy Management Policy Management
PolicyMultilingualism
Business RuleAbstract must be
in Dutch
Business RuleAbstract must be
in English
Business TermProject
Data EntitycfProj
FRIS Data Governance: Funding Sources Glossary Scenario
…!
…!
Funding Sources Glossary (FSG)
Data Governance Council
ECOOMUGent VUB
data governance officers in the Council are delegated by each institute
FRIS Data Governance: Funding Sources Glossary Scenario (2)
• 5 (fictional) workflows for different phases in the lifecycle of a term:
candidate > proposed > draft > in-review > accepted
Funding Sources Glossary (FSG)
approving in-review term
Data Governance Council
ECOOM UGent VUB
Ticket Request
Create
Import
Discover
delegating proposed FSG term
mapping accepted FSG terms
on-boarding candidate FSG term
approving in-review termapproving in-review term
5
1
4 4 4
2
draft term3
on-boarding, delegating and drafting a Funding Source term
candidate > proposed > draft > in-review > accepted
Approving Funding Source Glossary term
candidate > proposed > draft > in-review > accepted
Demonstration in the DGC Software Tool
• 5 workflows for different phases in the lifecycle of a term:
Funding Sources Glossary (FSG)
approving in-review term
Data Governance Council
VUB UGent ECOOM
Ticket Request
Create
Import
Discover
delegating proposed FSG term
mapping accepted FSG terms
on-boarding candidate FSG term
approving in-review termapproving in-review term
5
1
4 4 4
2
draft term3
candidate > proposed > draft > in-review > accepted
1. Start-user who requests: Bob Brown 2. DGO Secretary motivates request: Mike Jones
6. Subject Matter Expert reviews: John West
8. Co-Stewards vote : Mary Smith
7. Stakeholder comments: Judy Clarke
5. Steward drafts term: Pieter DL
3. Officers vote onboarding: John Fisher
4. DGO Secretary moves the onboarded term: Mike Jones
Conclusion • FRIS Service = Qualitative Data Sharing
• Qualitative => Unambiguous, Timely, Accurate, Open, Complete, Consistent, Valid, etc.
• Data Stewardship highlights Responsibility aspect of Data Ownership
• Data Governance programs enforces Data Quality Policy and Regulations
• Data Governance Technologies are promising to handle these issues that hamper service innovation
Conclusions • Services are data-intensive
• Their coproduction requires data sharing across organisation policies / modelling assumptions / regulations
• Data Stewardship highlights responsibility aspect of Data Power
• Data Governance programs enforces data policy and regulations
• Data Governance Technologies are promising to overcome these issues that hamper service innovation
Questions & Feedback?