Big Data Wonderland: Two Views on the Big Data Revolution
-
Upload
mark-madsen -
Category
Technology
-
view
105 -
download
1
description
Transcript of Big Data Wonderland: Two Views on the Big Data Revolution
Big Data Wonderland:Two Views on the Big Data Revolution
Mark MadsenThird Nature, [email protected]@markmadsen
Marc DemarestNoumenal, [email protected]
Strata Santa ClaraFebruary 2013
2 Third Nature, Inc. || Noumenal, Inc.
PreambleTwenty Years On
• We came up together in this industry in the early 1990s, as pointy-headed advocates of star schema design, trained by the deity himself, Ralph Kimball
• Back then, it was a simpler world...big iron, big DBMS, hand-coded ETL, star schema, a thousand rinky-dink query tools
• Mostly, conversation was dominated by ETL and schema design
• “There will never be a decisional database larger than 10 GB...”
St. Ralph
Our Alma Mater
3 Third Nature, Inc. || Noumenal, Inc.
PreambleTwenty Years On
• Twenty years on, we find ourselves with opposing view on what is either the biggest con, or the biggest sea-change, in our data warehousing odyssey
• Question: Is the big data revolution big, or a revolution?
• Question: do we have to change? and if so, how?
• Not a round table. A slugfest....
Demarest asShana Alexander?
Madsen asJack Kilpatrick?
4 Third Nature, Inc. || Noumenal, Inc.
Regular Programming Is Suspended
Demarest Madsen
5 Third Nature, Inc. || Noumenal, Inc.
Compromise
Demarest Madsen
You take the blue pill. The story ends, you wake up in your bed and believe whatever you want to believe.
You take the red pill, you stay in Wonderland,
and I show you how deep the rabbit hole
goes.
Remember, all I am offering is the truth:
nothing more.
6 Third Nature, Inc. || Noumenal, Inc.
The Issues1. Data As A Factor of Production
RED BLUEAmen.
This change has been in process for more
than a decade. Social media leads the way, but we’re all affected.
7 Third Nature, Inc. || Noumenal, Inc.
The Issues1. Data As A Factor of Production
RED BLUEAmen.
This change has been in process for more
than a decade. Social media leads the way, but we’re all affected.
Hype.
For most companies, data remains an
asset, but not a factor in the production of its products or services.
8 Third Nature, Inc. || Noumenal, Inc.
The Issues2. The Reality of Big Data
RED BLUEFew companies
transformed.
No quantification of benefits, right now.
Leverage? Maybe.
9 Third Nature, Inc. || Noumenal, Inc.
The Issues2. The Reality of Big Data
RED BLUENo company escapes.
Text, social, sensors, streaming -- the
instrumentation of the real world transforms company decision-making processes.
Few companies transformed.
No quantification of benefits, right now.
Leverage? Maybe.
10 Third Nature, Inc. || Noumenal, Inc.
The Issues3. Merchant DBMSs
RED BLUEIncreasingly irrelevant.
We’ve been over-structured and under-
resourced for 20 years.
CSV is still the international standard.
11 Third Nature, Inc. || Noumenal, Inc.
The Issues3. Merchant DBMSs
RED BLUEIncreasingly irrelevant.
We’ve been over-structured and under-
resourced for 20 years.
CSV is still the international standard.
Will rise to the challenge.
Any worthwhile innovation will be absorbed by the merchant DBMS
players.
12 Third Nature, Inc. || Noumenal, Inc.
The Issues4. Query, Reporting & Dashboarding Tools
RED BLUEWill rise to the
challenge.
We have two generations of
analysts trained to feed using these tools.
13 Third Nature, Inc. || Noumenal, Inc.
The Issues4. Query, Reporting & Dashboarding Tools
RED BLUEIneffective, now and in
the future.
Can’t do real-time, can’t visualize large
data sets, can’t support discovery and
exploration.
Will rise to the challenge.
We have two generations of
analysts trained to feed using these tools.
14 Third Nature, Inc. || Noumenal, Inc.
The Issues5. The Commodity Hardware Revolution & Radical Scale-Out
RED BLUEThe new topology.
Cheap compute, unintelligent direct-attach storage and free comms make
large scale-out grids the future.
15 Third Nature, Inc. || Noumenal, Inc.
The Issues5. The Commodity Hardware Revolution & Radical Scale-Out
RED BLUEThe new topology.
Cheap compute, unintelligent direct-attach storage and free comms make
large scale-out grids the future.
The current topology is alive and well.
These commodity building blocks are, after all, just SMP
platforms.
16 Third Nature, Inc. || Noumenal, Inc.
The Issues6. Structured Query Language
RED BLUETried-and-True.
Powerful, expressive language for complex analytical problems.
That’s why the noSQLvendors reinvent it all
the time.
17 Third Nature, Inc. || Noumenal, Inc.
The Issues6. Structured Query Language
RED BLUEToast.
Too complex, too hard to code, too hard to
debug. A way of ensuring dependency on merchant DBMSs.
Tried-and-True.
Powerful, expressive language for complex analytical problems.
That’s why the noSQLvendors reinvent it all
the time.
18 Third Nature, Inc. || Noumenal, Inc.
The Issues7. New Programming Models
RED BLUESay hello to Pig.
New analytical problems (decisioning, discovery, exploration)
require new languages, new tools
and new programming models.
19 Third Nature, Inc. || Noumenal, Inc.
The Issues7. New Programming Models
RED BLUESay hello to Pig.
New analytical problems (decisioning, discovery, exploration)
require new languages, new tools
and new programming models.
Say hello to Java.
Open source doesn’t mean free. Or easy.
The skills gap here is huge. And there are
few truly new analytical problems.
20 Third Nature, Inc. || Noumenal, Inc.
The Issues8. Conventional DW Architecture
RED BLUEPerfectly viable.
No need to change anything. Some new
technologies may play roles in the existing
architecture, but we’re good to go, generally.
21 Third Nature, Inc. || Noumenal, Inc.
The Issues8. Conventional DW Architecture
RED BLUEA relic.
Overly complex. Difficult to implement.
Controlled by the supply side of the market, anyway.
Perfectly viable.
No need to change anything. Some new
technologies may play roles in the existing
architecture, but we’re good to go, generally.
22 Third Nature, Inc. || Noumenal, Inc.
The Issues9. The Cloud
RED BLUEWe all go there.
Most of the interesting data is there; it’s more effective to move our
data, and our analyses, to where the
data is, already.
23 Third Nature, Inc. || Noumenal, Inc.
The Issues9. The Cloud
RED BLUEWe all go there.
Most of the interesting data is there; it’s more effective to move our
data, and our analyses, to where the
data is, already.
Don’t go there.
Public cloud security is an oxymoron.
Your inside-the-firewall apps remain the core
information asset.
24 Third Nature, Inc. || Noumenal, Inc.
The Issues10. New Technologies
RED BLUEDistract Us.
We’ve already seen what best-of-breed gives us: a circus.
25 Third Nature, Inc. || Noumenal, Inc.
The Issues10. New Technologies
RED BLUESave Us.
Best of breed integration led by in-house designers ins
back, with a vengeance.
Distract Us.
We’ve already seen what best-of-breed gives us: a circus.
26 Third Nature, Inc. || Noumenal, Inc.
What We Really Think1. Data As A Factor of Production
2. The Reality of Big Data
3. Merchant DBMSs
4. Query, Reporting & Dashboarding Tools
5. The Commodity Hardware Revolution
6. Structured Query Language
7. New Programming Models
8. Conventional DW Architecture
9. The Cloud
10. New Technologies