Attachez vos ceintures et écoutez le Data Steward

51
@Djeepy1 | @Fleid_bi | @GUSS_France Fasten your seatbelt and listen to the (Data) Steward Jean-Pierre Riehl http://blog.djeepy1.net | @djeepy1

Transcript of Attachez vos ceintures et écoutez le Data Steward

@Djeepy1 | @Fleid_bi | @GUSS_France

Fasten your seatbelt and

listen to the (Data) Steward

Jean-Pierre Riehl

http://blog.djeepy1.net | @djeepy1

@Djeepy1 | @Fleid_bi | @GUSS_France

Sponsors Gold

@Djeepy1 | @Fleid_bi | @GUSS_France

Sponsors Silver et Bronze

@Djeepy1 | @Fleid_bi | @GUSS_France

Jean-Pierre RiehlMembre du Board

http://blog.djeepy1.net

@djeepy1

Pure-Player Microsoft

• Practice Collaboration

• Practice Data & Business Intelligence

• Practice Infrastructure

• Practice Développement

MVP SQL Server

MCSE : SQL Server 2012

MCPD : Enterprise Application

Microsoft Certified Trainer

@Djeepy1 | @Fleid_bi | @GUSS_France

Jean-Pierre Riehl

Practice Manager Data & BI – AZEO

MVP SQL Server

President at GUSS

Florian Eiden

Managing Consultant, Data & Analytics - Cellenza

MVP SQL Server

Board Member at GUSS

Who are we ?

@Djeepy1 | @Fleid_bi | @GUSS_France

GUSS : PASS France chapter

Webcasts, Conferences, Afterworks

.Pro

Next event :

SQLSaturday Paris 2014September 13th

Tour Montparnasse, Paris

English-speaking track

@Djeepy1 | @Fleid_bi | @GUSS_France

THE CONTEXT

@Djeepy1 | @Fleid_bi | @GUSS_France

Self-Service BI

Corporate BI

• Managed

• DatawareHouse

• Company-wide

Team BI

• Shared

• Models

• Department-wide

Self-Service BI

• Quick & Easy

• Personal data

• Document-centric

@Djeepy1 | @Fleid_bi | @GUSS_France

Empower UsersRelease Data, Release usages

@Djeepy1 | @Fleid_bi | @GUSS_France

Governance noun.

« Leading the conduct of things

or persons »

@Djeepy1 | @Fleid_bi | @GUSS_France

THE ISSUES

@Djeepy1 | @Fleid_bi | @GUSS_France

Writer’s block

also known as

White Worksheet Syndrom

Issue #1

?

@Djeepy1 | @Fleid_bi | @GUSS_France

Too much Data !« I want the Employee’s List »

– Duplicates

– Wrong sources

– Bad Data

– Poor or Bad description

– Etc.

Issue #2

@Djeepy1 | @Fleid_bi | @GUSS_France

Compliance– Security

– Encryption

– Anonymization

Issue #3

@Djeepy1 | @Fleid_bi | @GUSS_France

How does it scale ?

@Djeepy1 | @Fleid_bi | @GUSS_France

POWER BIThe Microsoft Way

@Djeepy1 | @Fleid_bi | @GUSS_France

Features and tools

Analyze

Visualize

Share

Question

Q&A

Mobility

DiscoverSearch, access, and transform

public and internal data sources

with Power Query

Share datasets and workbooks refreshable from on-premises and cloud based data sources, with Power BI Sites

Easy data modeling and lightning fast in-memory analytics with Power Pivot

Bold new interactive data visualizations with Power View and Power Map

Ask questions and get immediate answers with natural language query

Mobile access through HTML5 and touch optimized apps

Scalable | Manageable | Trusted

@Djeepy1 | @Fleid_bi | @GUSS_France

Power BI - Big Picture

Power BI O365 Tenant

Power BI

Admin

Center

SQL

Data

Catalog

External

Data

Q&A

Cloud On-Prem

Oracle …

Excel

Power BI Sites Power Query Power Pivot

Power View Power Map

Cloud

Power Query

Data

Refresh

Index

Search Data

Management

Gateway

@Djeepy1 | @Fleid_bi | @GUSS_France

Ideas of costs

Q&A

All Inclusive (ie. including Office 2013 ProPlus Licences)

@Djeepy1 | @Fleid_bi | @GUSS_France

Power BI* On-Prem

Data Sources Gateways & Data SourcesShared Data Sources (SSRS)

Office Data Connection

Datasets QueriesShared Datasets (SSRS)

Power Pivot for SharePoint

ModelsPower Pivot

Data Management GatewayPower Pivot for SharePoint

Dashboards Power ViewPower View (BISM)

SSRS over Power Pivot

There was some SSBI before Power BI

Power BI vs. On-Prem

* Many more features to come

@Djeepy1 | @Fleid_bi | @GUSS_France

@Djeepy1 | @Fleid_bi | @GUSS_France

• Tools are only a part of the solution

• Good formula : People + Processes + Tools– « Data governance is between 80 and 95%

communication » - Dec 2006 Data Governance Conference

• We have the tools, let’s talk about the rest…

If all you have is hammer…

@Djeepy1 | @Fleid_bi | @GUSS_France

THE DATA STEWARD

@Djeepy1 | @Fleid_bi | @GUSS_France

• Wikipedia: Stewardship is an ethic that

embodies the responsible planning and

management of resources.

A Steward?

Data Steward of Gondor…

Not our idea, see Matthew Roche for complaints

@Djeepy1 | @Fleid_bi | @GUSS_France

IT : Information Technology

My pretty typical organization

Piercing items

Slashing items

Bludgeoning items

Armors & Shield

Business Divisions Functional Units

Finance HR Legal

@Djeepy1 | @Fleid_bi | @GUSS_France

Where are stewards needed in the org?

Piercing items

Slashing items

Bludgeoning items

Armors & Shield

Business Divisions Functional Units

Finance HR Legal

IT : Information Technology

@Djeepy1 | @Fleid_bi | @GUSS_France

My organization : actual perception

Piercing items

Slashing items

Bludgeoning items

Armors & Shield

Business Divisions Functional Units

Finance HR Legal

IT

@Djeepy1 | @Fleid_bi | @GUSS_France

Well, let’s be honest about what it looks like

Piercing items

Slashing items

Bludgeoning items

Armors & Shield

Business Divisions Functional Units

Finance HR Legal

IT

@Djeepy1 | @Fleid_bi | @GUSS_France

For maximum results: local initiatives

Piercing items

Slashing items

Bludgeoning items

Armors & Shield

Business Divisions Functional Units

Finance HR Legal

IT

@Djeepy1 | @Fleid_bi | @GUSS_France

• Why : – Specific to your company, to be defined in your master plan

• How : – “Responsible planning and management of resources”

• What :– Elect data stewards that will enable, teach, police

Let’s get back to our steward

Slashing items

@Djeepy1 | @Fleid_bi | @GUSS_France

• Skills– Interpersonal skills

– Good personal organization

– Data-awareness

• Data lifecycle specific to the company

• General understanding of BI/data technologies

• Data merging, cleaning, metadata maintenance

– Training in tools used in the company

• A chosen career path– It’s an actual job, usually part time

– But not just an additional task in the schedule!

Required skills

@Djeepy1 | @Fleid_bi | @GUSS_France

The Journey of a Data Steward

@Djeepy1 | @Fleid_bi | @GUSS_France

The Journey of a Data Steward

• Help to find data– Manage the Data Lake

– Create Data Sources

– Facilitate exploration

– Manage metadata

@Djeepy1 | @Fleid_bi | @GUSS_France

The Journey of a Data Steward

• Manage new data– Find new Data Sources

– Find new Datasets

• Verify new datasets– Check for Accuracy

– Check for duplicates

– Fix sources and queries

• Use of Workflows

@Djeepy1 | @Fleid_bi | @GUSS_France

Data Workflows

Create

DeriveApprove

Data HubModels, OData, Reports, DWH, MDM, etc.

Publish

Sandox

Enhance

Discovery

Data

Steward

Analyst

Developer

@Djeepy1 | @Fleid_bi | @GUSS_France

The Journey of a Data Steward

• Certify– Ensure Corporate Policies

• Train & Teach – Help for modeling

– Help for analysis

@Djeepy1 | @Fleid_bi | @GUSS_France

@Djeepy1 | @Fleid_bi | @GUSS_France

Information Management Platform

IT

Developers

Data

Steward

Importance of relations

Business

Users

Tools Tools Tools

@Djeepy1 | @Fleid_bi | @GUSS_France

Information Management Platform

SalesIT

And reality is more complex

Mktg Production

@Djeepy1 | @Fleid_bi | @GUSS_France

DATA(source) LIFECYCLE MANAGEMENT

@Djeepy1 | @Fleid_bi | @GUSS_France

Data Lifecycle Management

@Djeepy1 | @Fleid_bi | @GUSS_France

Data Lifecycle Management

@Djeepy1 | @Fleid_bi | @GUSS_France

Data Lifecycle Management

@Djeepy1 | @Fleid_bi | @GUSS_France

Data Lifecycle Management

@Djeepy1 | @Fleid_bi | @GUSS_France

Data Source Lifecycle Management

Manage, enable

Teach, assist

Analyse, merge

Data

Steward

@Djeepy1 | @Fleid_bi | @GUSS_France

• Data asset administration

– Create, Delete

– Update, Maintain

– Give/revoke access

– Refresh schedule

– Monitor

– …

• Business metadata

understanding

• Data manipulation

• Applied to : data artifacts

– Data sets

• Files: CSV, XLSX

– Metadata

• Data Models

• Documentation

– Data sources

– Queries

– …

Data Source Lifecycle Management

@Djeepy1 | @Fleid_bi | @GUSS_France

CONCLUSION

@Djeepy1 | @Fleid_bi | @GUSS_France

• Tools are nothing without people and

processes

• Governance is different in every company– Decided and sponsored by the executives, inscribed in

a global strategy

– Adapted to your organization

– The Data Steward as the local implementation of it

A matter of governance

@Djeepy1 | @Fleid_bi | @GUSS_France

1. Build an Information Management Platform

2. Identify your processes & Org Chart

3. Write the Data Steward « Job Profile »

4. Identify the right people for the job

5. Leverage Self-Service BI

• Ask your local experts

How to start tomorrow ?

@Djeepy1 | @Fleid_bi | @GUSS_France

• A Data Culture– See Satya Nadella

April 15th 2015 presentation in SF

• To at last step up in the

knowledge pyramid!– Machine learning \o/

All this for what?

@Djeepy1 | @Fleid_bi | @GUSS_France

Any questions ?

Thank You !