Empowering the Data-Driven Organization - SAS · Patterns of using SAS with Hadoop for Analytics &...

Post on 25-Apr-2020

15 views 0 download

Transcript of Empowering the Data-Driven Organization - SAS · Patterns of using SAS with Hadoop for Analytics &...

Copyright © 2014, SAS Institute Inc. All rights reserved.

Empowering the Data-Driven OrganizationJeroen Dijkxhoorn, SASLars Slagboom, ABN AMRO

In 5 years from now…Elephants will rule the world

Acting on predictive Decisions will be standard

Real Time Analytics is to blame for a crash

Mobile User Interfacing will be the Standard

Data will be everywhere and Nobody knows where exactly

Copyright © 2014, SAS Institute Inc. All rights reserved.

Trends Big Data, Storage, Hadoop & In-memory Technology

$- $20.000 $40.000 $60.000 $80.000 $100.000

Vertica

Teradata

Greenplum

Oracle

Microsoft PDW

Hadoop

Today 2009

Cost of Storage, Memory, Computing • In 2000 a GB of Disk $17 today < $0.07

• In 2000 a GB of Ram $1800 today < $10

• In 2009 a TB of RDBMS was $70K today < $ 20K

Cost per Terabyte

Technology Push: storage costs and CPU speed

To enable analytics in this changing environment, you need to:

Bring the Analytics to the Data…

…and run it in a distributed mode

Copyright © 2014, SAS Institute Inc. All rights reserved.

Business pull: two Eras . . .two mindsets

Process-centric

Everything is

forbidden unless it is

permitted

Focus on cost control

Technology constrained

Discovery-centric

Everything is

permitted unless it is

forbidden

Focus on value

Technology empowered

To enable analytics in this changing environment, you need to:

Provide self-service analytic capabilities…

…and automate the decision making process

Copyright © 2014, SAS Institute Inc. All rights reserved.

Data-Driven with Analytics as the main enabler

Copyright © 2014, SAS Institute Inc. All rights reserved.

From Data to Decision

TEXT

MANAGE

DATA

EX

PL

OR

E

DA

TA

DEVELOP

MODELS

DE

PL

OY

&

MO

NIT

OR

Challenges:

• Growth in Demand

• Growth of Data

• Access to Talent

• Controlling Cost

Needs:

• Scale the Process

• Avoid Replication

• Increase Productivity

• Decouple Cost & Growth

Copyright © 2014, SAS Institute Inc. All rights reserved.

SAS Directions to address these needs

Scale the Process

SPEED UP THE DATA TO DECISION LIFECYCLE

1. Event Stream Processing

2. High Performance Analytics

3. Decision Management

1

Avoid Replication

MOVE SAS PROCESSING TO THE DATA

1. In-Database Processing

2. Scoring Accelerators

3. Code Accelerators

2

Increase Productivity

PROVIDE INTERACTIVE, SELF-SERVICE INTERFACES

1. Data Loader for Hadoop

2. Visual Analytics, Visual Statistics & In-Memory Statistics

3. Move to responsive web-apps based on HTML5

3

Decouple Cost & Growth

SUPPORT IT COST EFFICIENCY EFFORTS

1. Span data and processing across a Grid or Cluster

2. Virtual Apps to deploy in Private, Public or Hybrid Cloud

3. On-premise deployment within 3 hours

4

Copyright © 2014, SAS Institute Inc. All rights reserved.

Copyright © 2014, SAS Institute Inc. All rights reserved.

Copyright © 2014, SAS Institute Inc. All rights reserved.

…… …

……

on a single platform

annual savings

production time

19 models

€15 billion

−30%

Platform Strategy, Automotive Engineering

Copyright © 2014, SAS Institute Inc. All rights reserved.

……

……

……

Risk

Sales

Partners

Fraud

Controlling

Marketing

Logistics

Purchasing

IT

Production

50% reduction in costs for BI/Analytics

Double the value of BI/Analytics projects

per year

Platform strategy: Basis of the Analytics Factory

Copyright © 2014, SAS Institute Inc. All rights reserved.

Copyright © 2014, SAS Institute Inc. All rights reserved.

Standardization Consolidation Industrialization

3 steps towards an Analytics Factory

Copyright © 2014, SAS Institute Inc. All rights reserved.

Standardization

• Coming together by agreeing what capabilities to use

Consolidation

• Keeping together by centralizing the platform

Industrialization

• Working together by scaling and speeding up the process

3 steps towards an Analytics Factory

Data en Informatie bij ABN AMRO

Introductie

• ABN AMRO

• Enterprise Data & Information

22

23

Standardization Consolidation Industrialization

Standardization

Kenmerken

• Focus op systeemlandschap

• Iedereen zijn eigen voorkeur

• Data decentraal

Succesfactoren

• Externe druk

• Bedrijfsbreed thema

• Beleid

24

Standardization

Consolidation

Kenmerken

• Focus naar gebruiker

• Waarde van geïntegreerde data wordt onderkent

• Wachttijden in je datawarehouse ontwikkeling

Succesfactoren

• Introductie gebruikersteams

• Vermarkt je datawarehouse en BI omgeving

25

Consolidation

Industrialization

Kenmerken

• Focus op gebruik

• Snellere groei van data dan systemen

• Meer vraag dan aanbod

• Data is een keten

Succesfactoren

• Businessprocessen meenemen in je verandering

• Organiseer bronsystemen

26

Industrialization

Copyright © 2014, SAS Institute Inc. All rights reserved.

Marc Lammers:

“50 keer 2% is ook 100%”

Copyright © 2014, SAS Institute Inc. All rights reserved.

Back to the elephant…

Copyright © 2014, SAS Institute Inc. All rights reserved.

Where is Hadoop being used for?

Hadoop as a Data PlatformHadoop as a core component of next

generation analytical platform

TEXT

MANAGE

DATA

EX

PL

OR

E

DA

TA

DEVELOP

MODELS

DE

PL

OY

&

MO

NIT

OR

Copyright © 2014, SAS Institute Inc. All rights reserved.

Usage 1: Hadoop as Data Platform

Initiator

• This paradigm is mostly driven by IT

Drivers

• Increasing costs of data storage

• Increasing volume of data

• Latency to deliver information

Benefits

• Large-scale distributed storage and

batch processing

Copyright © 2014, SAS Institute Inc. All rights reserved.

Ingest/Load Data

Cleanse & Transform

Data

Load Data To Other Sources

/ Memory

Metadata Documentation

Usage 1: Hadoop as data platform

• SAS/ACCESS

• SAS Data Management

• SAS Event Stream Processing

• SAS Federation Server

• SAS Data Loader for Hadoop

SAS Data Quality Accelerator for

Hadoop

SAS Code Accelerator for Hadoop

• SAS/ACCESS

• SAS Data Management

• SAS Federation Server

• SAS Metadata Server

Copyright © 2014, SAS Institute Inc. All rights reserved.

Usage 2: Hadoop as core of next generation analytical platform

TEXT

MANAGE

DATA

EX

PL

OR

E

DA

TA

DEVELOP

MODELS

DE

PL

OY

&

MO

NIT

OR

Initiator

• This paradigm is mostly driven by business

Drivers

• Increasing question to a variety of different

and additional information

• The need for a flexible data platform to

store, process, and analyze data at any

scale

Benefits

• The business can start thinking big again

when it comes to data

Copyright © 2014, SAS Institute Inc. All rights reserved.

Usage 2: Hadoop as core of next generation analytical platform

TEXT

MANAGE

DATA

EX

PL

OR

E

DA

TA

DEVELOP

MODELS

DE

PL

OY

&

MO

NIT

OR

• SAS/ACCESS

• SAS Data Management

• SAS Event Stream Processing

• SAS Federation Server

• SAS Data Loader for Hadoop

SAS Data Quality Accelerator for

Hadoop

SAS Code Accelerator for Hadoop • SAS Visual Analytics

• SAS In-memory

Statistics for Hadoop

• SAS HPA Products

• SAS Visual Statistics

• SAS In-memory Statistics

for Hadoop

• SAS Decision Manager

• SAS Scoring Accelerator for

Hadoop

Copyright © 2014, SAS Institute Inc. All rights reserved.

Patterns of using SAS with Hadoop for Analytics & reporting

SAS with Hadoop

Hive

Extract from Hadoop pushing

some SAS pre-processing to

Hadoop

Embedded Process - Push

SAS data processing to

Hadoop with Map Reduce

SAS in Hadoop

Score A Code AImpala

In-Memory Analytics - Use

Hadoop for Storage persistence

and commodity computing.

SAS on Hadoop

HPA LASR

Copyright © 2014, SAS Institute Inc. All rights reserved.

Continuity of Business

Bring SAS processing to the Data

Leverage Hadoop for new Technology offerings

Breadth and depth of modern analytic methods in Hadoop

SAS for Hadoop directions

DIRECTIONAL THEMES

Copyright © 2014, SAS Institute Inc. All rights reserved.

13.30 Parallel Sessions

• Big Data and Visual Analytics – Rabobank

• Business Analytics – SAS

• Data Management – Ziekenhuis Gelderse Vallei

• Visual Analytics – Mercachem

13.30 Guided Tours

• Visual Analytics

15.45 Parallel Sessions

• Big Data and Visual Analytics – Belastingdienst

• Business Analytics – iBridge/ Randstad

• Data management – DSM

• Visual Analytics – H@nd

Information on breakouts Analytical platform

14.30 What’s Hot Sessions

• Big Data Analytics met Hadoop

• Data Management 3.0: What about Hadoop?

• What’s hot in Data Governance

• Modernisatie: meer mogelijkheden, minder risico’s

• Geavanceerd modelleren met SAS

• What’s new in SAS Visual Analytics 7.1

• Best Practices in Visualisatie en Dashboard design

14.30 Roundtables (max 20 pers.)

• The Analytical Bank

• Data monetization

Copyright © 2014, SAS Institute Inc. All rights reserved.