Big Data Wonderland: Two Views on the Big Data Revolution

26
Big Data Wonderland: Two Views on the Big Data Revolution Mark Madsen Third Nature, Inc. [email protected] @markmadsen Marc Demarest Noumenal, Inc. [email protected] Strata Santa Clara February 2013

description

To kick off the Big Data for Enterprise IT Day, we present two views of big data. Is it truly something new, or just an evolution of what we have already? Join us for an interesting and entertaining talk that will help frame your thinking on big data. We take on the roles of former bosses: the techno-lustful and the luddite, and debate the key talking points put forth in the market. An earlier video of this talk can be seen at http://www.youtube.com/watch?v=qnHHOWz5uvM

Transcript of Big Data Wonderland: Two Views on the Big Data Revolution

Page 1: Big Data Wonderland: Two Views on the Big Data Revolution

Big Data Wonderland:Two Views on the Big Data Revolution

Mark MadsenThird Nature, [email protected]@markmadsen

Marc DemarestNoumenal, [email protected]

Strata Santa ClaraFebruary 2013

Page 2: Big Data Wonderland: Two Views on the Big Data Revolution

2 Third Nature, Inc. || Noumenal, Inc.

PreambleTwenty Years On

• We came up together in this industry in the early 1990s, as pointy-headed advocates of star schema design, trained by the deity himself, Ralph Kimball

• Back then, it was a simpler world...big iron, big DBMS, hand-coded ETL, star schema, a thousand rinky-dink query tools

• Mostly, conversation was dominated by ETL and schema design

• “There will never be a decisional database larger than 10 GB...”

St. Ralph

Our Alma Mater

Page 3: Big Data Wonderland: Two Views on the Big Data Revolution

3 Third Nature, Inc. || Noumenal, Inc.

PreambleTwenty Years On

• Twenty years on, we find ourselves with opposing view on what is either the biggest con, or the biggest sea-change, in our data warehousing odyssey

• Question: Is the big data revolution big, or a revolution?

• Question: do we have to change? and if so, how?

• Not a round table. A slugfest....

Demarest asShana Alexander?

Madsen asJack Kilpatrick?

Page 4: Big Data Wonderland: Two Views on the Big Data Revolution

4 Third Nature, Inc. || Noumenal, Inc.

Regular Programming Is Suspended

Demarest Madsen

Page 5: Big Data Wonderland: Two Views on the Big Data Revolution

5 Third Nature, Inc. || Noumenal, Inc.

Compromise

Demarest Madsen

You take the blue pill. The story ends, you wake up in your bed and believe whatever you want to believe.

You take the red pill, you stay in Wonderland,

and I show you how deep the rabbit hole

goes.

Remember, all I am offering is the truth:

nothing more.

Page 6: Big Data Wonderland: Two Views on the Big Data Revolution

6 Third Nature, Inc. || Noumenal, Inc.

The Issues1. Data As A Factor of Production

RED BLUEAmen.

This change has been in process for more

than a decade. Social media leads the way, but we’re all affected.

Page 7: Big Data Wonderland: Two Views on the Big Data Revolution

7 Third Nature, Inc. || Noumenal, Inc.

The Issues1. Data As A Factor of Production

RED BLUEAmen.

This change has been in process for more

than a decade. Social media leads the way, but we’re all affected.

Hype.

For most companies, data remains an

asset, but not a factor in the production of its products or services.

Page 8: Big Data Wonderland: Two Views on the Big Data Revolution

8 Third Nature, Inc. || Noumenal, Inc.

The Issues2. The Reality of Big Data

RED BLUEFew companies

transformed.

No quantification of benefits, right now.

Leverage? Maybe.

Page 9: Big Data Wonderland: Two Views on the Big Data Revolution

9 Third Nature, Inc. || Noumenal, Inc.

The Issues2. The Reality of Big Data

RED BLUENo company escapes.

Text, social, sensors, streaming -- the

instrumentation of the real world transforms company decision-making processes.

Few companies transformed.

No quantification of benefits, right now.

Leverage? Maybe.

Page 10: Big Data Wonderland: Two Views on the Big Data Revolution

10 Third Nature, Inc. || Noumenal, Inc.

The Issues3. Merchant DBMSs

RED BLUEIncreasingly irrelevant.

We’ve been over-structured and under-

resourced for 20 years.

CSV is still the international standard.

Page 11: Big Data Wonderland: Two Views on the Big Data Revolution

11 Third Nature, Inc. || Noumenal, Inc.

The Issues3. Merchant DBMSs

RED BLUEIncreasingly irrelevant.

We’ve been over-structured and under-

resourced for 20 years.

CSV is still the international standard.

Will rise to the challenge.

Any worthwhile innovation will be absorbed by the merchant DBMS

players.

Page 12: Big Data Wonderland: Two Views on the Big Data Revolution

12 Third Nature, Inc. || Noumenal, Inc.

The Issues4. Query, Reporting & Dashboarding Tools

RED BLUEWill rise to the

challenge.

We have two generations of

analysts trained to feed using these tools.

Page 13: Big Data Wonderland: Two Views on the Big Data Revolution

13 Third Nature, Inc. || Noumenal, Inc.

The Issues4. Query, Reporting & Dashboarding Tools

RED BLUEIneffective, now and in

the future.

Can’t do real-time, can’t visualize large

data sets, can’t support discovery and

exploration.

Will rise to the challenge.

We have two generations of

analysts trained to feed using these tools.

Page 14: Big Data Wonderland: Two Views on the Big Data Revolution

14 Third Nature, Inc. || Noumenal, Inc.

The Issues5. The Commodity Hardware Revolution & Radical Scale-Out

RED BLUEThe new topology.

Cheap compute, unintelligent direct-attach storage and free comms make

large scale-out grids the future.

Page 15: Big Data Wonderland: Two Views on the Big Data Revolution

15 Third Nature, Inc. || Noumenal, Inc.

The Issues5. The Commodity Hardware Revolution & Radical Scale-Out

RED BLUEThe new topology.

Cheap compute, unintelligent direct-attach storage and free comms make

large scale-out grids the future.

The current topology is alive and well.

These commodity building blocks are, after all, just SMP

platforms.

Page 16: Big Data Wonderland: Two Views on the Big Data Revolution

16 Third Nature, Inc. || Noumenal, Inc.

The Issues6. Structured Query Language

RED BLUETried-and-True.

Powerful, expressive language for complex analytical problems.

That’s why the noSQLvendors reinvent it all

the time.

Page 17: Big Data Wonderland: Two Views on the Big Data Revolution

17 Third Nature, Inc. || Noumenal, Inc.

The Issues6. Structured Query Language

RED BLUEToast.

Too complex, too hard to code, too hard to

debug. A way of ensuring dependency on merchant DBMSs.

Tried-and-True.

Powerful, expressive language for complex analytical problems.

That’s why the noSQLvendors reinvent it all

the time.

Page 18: Big Data Wonderland: Two Views on the Big Data Revolution

18 Third Nature, Inc. || Noumenal, Inc.

The Issues7. New Programming Models

RED BLUESay hello to Pig.

New analytical problems (decisioning, discovery, exploration)

require new languages, new tools

and new programming models.

Page 19: Big Data Wonderland: Two Views on the Big Data Revolution

19 Third Nature, Inc. || Noumenal, Inc.

The Issues7. New Programming Models

RED BLUESay hello to Pig.

New analytical problems (decisioning, discovery, exploration)

require new languages, new tools

and new programming models.

Say hello to Java.

Open source doesn’t mean free. Or easy.

The skills gap here is huge. And there are

few truly new analytical problems.

Page 20: Big Data Wonderland: Two Views on the Big Data Revolution

20 Third Nature, Inc. || Noumenal, Inc.

The Issues8. Conventional DW Architecture

RED BLUEPerfectly viable.

No need to change anything. Some new

technologies may play roles in the existing

architecture, but we’re good to go, generally.

Page 21: Big Data Wonderland: Two Views on the Big Data Revolution

21 Third Nature, Inc. || Noumenal, Inc.

The Issues8. Conventional DW Architecture

RED BLUEA relic.

Overly complex. Difficult to implement.

Controlled by the supply side of the market, anyway.

Perfectly viable.

No need to change anything. Some new

technologies may play roles in the existing

architecture, but we’re good to go, generally.

Page 22: Big Data Wonderland: Two Views on the Big Data Revolution

22 Third Nature, Inc. || Noumenal, Inc.

The Issues9. The Cloud

RED BLUEWe all go there.

Most of the interesting data is there; it’s more effective to move our

data, and our analyses, to where the

data is, already.

Page 23: Big Data Wonderland: Two Views on the Big Data Revolution

23 Third Nature, Inc. || Noumenal, Inc.

The Issues9. The Cloud

RED BLUEWe all go there.

Most of the interesting data is there; it’s more effective to move our

data, and our analyses, to where the

data is, already.

Don’t go there.

Public cloud security is an oxymoron.

Your inside-the-firewall apps remain the core

information asset.

Page 24: Big Data Wonderland: Two Views on the Big Data Revolution

24 Third Nature, Inc. || Noumenal, Inc.

The Issues10. New Technologies

RED BLUEDistract Us.

We’ve already seen what best-of-breed gives us: a circus.

Page 25: Big Data Wonderland: Two Views on the Big Data Revolution

25 Third Nature, Inc. || Noumenal, Inc.

The Issues10. New Technologies

RED BLUESave Us.

Best of breed integration led by in-house designers ins

back, with a vengeance.

Distract Us.

We’ve already seen what best-of-breed gives us: a circus.

Page 26: Big Data Wonderland: Two Views on the Big Data Revolution

26 Third Nature, Inc. || Noumenal, Inc.

What We Really Think1. Data As A Factor of Production

2. The Reality of Big Data

3. Merchant DBMSs

4. Query, Reporting & Dashboarding Tools

5. The Commodity Hardware Revolution

6. Structured Query Language

7. New Programming Models

8. Conventional DW Architecture

9. The Cloud

10. New Technologies