The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL...

16

Transcript of The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL...

Page 1: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,
Page 2: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

The Evolution of the Data Landscape

Page 3: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

IDSIMS

System R (SQL)IngresOracle

InformixDB2Sybase ASETeradataRed Brick

JADEInterSystemsVersant InformixDB2 OracleMS SQL ServerMySQLPostgreSQL

HadoopClouderaMapRMongoDBExadataNetezzaHP NeoviewGreenplum

Hortonworks, Cloudera, MapR, Snowflake, S3, MariaDB, Amazon Redshift, DynamoDBDocumentDB, Amazon Neptune, Amazon Timestream, Oracle, DB2, ArongoDB, MS Azure SqlServer, PostreSQL, MySQL, Neo4J

1960 1970 1980 1990 2000 2010 Today

Page 4: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

Customers / Partners

Cloud Platform

Social Media

Mobility / Devices & IoT

Page 5: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

Data inconsistencies

Data SilosData

Protection/Security

Skill shortage

Data Wrangling

Data LatencyDuplicate

Data

Moving Data to the Cloud

An evolving Data Landscape introduces challenges

Page 6: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

What is a Data Fabric?

“Data Fabric is an architecture and set of data services that provide consistent capabilities across a choice of endpoints spanning on-premises and multiple cloud environments”

- NetApp

Page 7: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

What is a Data Fabric?

“Data fabric is a combination of architecture and technologythat is designed to ease the complexities of managing many

different kinds of data, using multiple database management systems, and deployed across a variety of platforms”

- Eckerson Group

Page 8: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

What is a Big Data Fabric?

“Big data fabric is an emerging platform which accelerates business insights “by automating ingestion, curation, discovery,

preparation and integration from data silos”

- Forrester Research

Page 9: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

Why Data “Fabric”?Fabrics are interconnected structures where multiple

nodes appear as a single logical unit

What makes up a Data Fabric?1. Technology / Data Services2. Architecture

Page 10: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

Data Fabric

Data Lake

Data Streams

EDW

Cloud DW

New Data

Cloud Applications

Data Management

On-Premises Off-Premises

Data Management

Page 11: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

Forrester Data Fabric Reference Architecture

Data Security

Governance

Metadata

Search

Data Quality

Lineage

Data Management

Caching, In-Memory, Embedded, Self-Service

Data modeling, Preparation, Curation, Virtualization

Transformation, Integration, Cleansing

Ingestion, machine learning, streaming, data movement

NoSQL

RDBMS

Hadoop

Data access

Data discovery

Orchestration

Processing and persistence

Data Ingestion

On-premises sources Cloud sources

Page 12: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

Data Scientists

Challenges

• Understanding what data is available

• Getting access

• Putting the data to use• Data preparation

• Analytics

• Decisioning

• Connected, always-on data

“Is this the best data available?”

Page 13: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

Data Engineers

Challenges

• Modernize & simplify data infrastructure

• Optimize use of storage & compute

• Leverage new technologies

• Safe movement of data to the cloud

“We can make this run faster”

Page 14: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

CIO and CDO

Challenges

• Simplify and Modernize Data infrastructure

• Protect the data

• Compliance

• Data Governance

• Reduce Infrastructure Costs

“Data is secure and protected”

Page 15: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

HYPEREALITY

Page 16: The Evolution of the Data Landscape - Sas Institute · 2019-05-20 · MS SQL Server MySQL PostgreSQL Hadoop Cloudera MapR MongoDB Exadata Netezza HP Neoview Greenplum Hortonworks,

sas.com

Copy r i g ht © S A S I ns t i t ut e I nc. A l l r i g ht s r e se r v e d.

Thank You