Big Data - What is it Really About?
-
Upload
insidehpc -
Category
Technology
-
view
478 -
download
6
description
Transcript of Big Data - What is it Really About?
© TCC 2014, Confidential and Proprietary
BIG DATA: WHAT’S IT REALLY ABOUT?
Rich BruecknerPresident, InsideBIGDATA
© TCC 2014, Confidential and Proprietary
AGENDA
• Proper Intro
• It’s Not the Data. It’s What You Use it for.
• How Big Data Got Me Here
• Case Studies
• HPC and Big Data
• Trends- What’s Next?
• Call to Action
© TCC 2014, Confidential and Proprietary
THE PROPER INTRODUCTION
insideBIGDATA.com
© TCC 2014, Confidential and Proprietary
WHAT BIG DATA IS NOT
• Size matters not.• It’s not about the Data, it’s what you
do with it.• Deriving insight from Data for
purposes for which it was never intended.
© TCC 2014, Confidential and Proprietary
FOR TODAY’S DISCUSSION:
BIG DATA = HIGH PERFORMANCE DATA ANALYSIS
© TCC 2014, Confidential and Proprietary
BIG DATA IS ABOUT TWO THINGS
© TCC 2014, Confidential and Proprietary
BIG DATA IS ABOUT TWO THINGS
First it’s about Money, Lots of Money
© TCC 2014, Confidential and Proprietary
© TCC 2014, Confidential and Proprietary
SO WHAT’S THE SECOND THING?
© TCC 2014, Confidential and Proprietary
SO WHAT’S THE SECOND THING?
© TCC 2014, Confidential and Proprietary
SO WHAT’S THE SECOND THING?
© TCC 2014, Confidential and Proprietary
SO WHAT’S THE SECOND THING?
“ Bayesian probability provides a rational method for updating beliefs.”
© TCC 2014, Confidential and Proprietary
Big Data is about Degrees of Belief.
…even about how we feel about data itself.
© TCC 2014, Confidential and Proprietary
Can I prove that Big Data is about Degrees of Belief?
These 12 words just cost Facebook $18 billion of value:
"We did see a decrease in daily users specifically among
younger teens."
© TCC 2014, Confidential and Proprietary
BIG DATA- WHAT’S CHANGED?
http://bit.ly/1gZ7ZbD
• The nature of the Data: From Sampling to Full Datasets
• Rise of unstructured data
• Acceptance of messiness in the data
• N=All
© TCC 2014, Confidential and Proprietary
THE BIG DATA FRONTIER
There is no Eminent Domain
© TCC 2014, Confidential and Proprietary
SO WHERE ARE HEADED IN THIS TALK?
© TCC 2014, Confidential and Proprietary
BEWARE THE BIG DATA NAYSAYERS WHAT IS THEIR AGENDA?
© TCC 2014, Confidential and Proprietary
IF BIG DATA IS SO POWERFUL, WHY CAN’T IT PREDICT THE ECONOMY?
The Stock Market is all about Degrees of Belief!
• The economic system is non-linear.
• Therefore, even a small stimulus can create a completely unexpected result
• It’s a complex dynamical system where you don’t have clear knowledge of the initial conditions or the conditions of the stimulus.
• Big Data can can help you see averages, find the needle in the haystack, and help identify the accurate models for predicting what the market is really going to be doing.
© TCC 2014, Confidential and Proprietary
HOW BIG DATA GOT ME HERE
© TCC 2014, Confidential and Proprietary
HOW BIG DATA GOT ME HERE
• REACH – Measure of total audience size.• RESONANCE – How much activity
someone creates when he/she publishes.• RELEVANCE – How relevant someone is to
a topic.
© TCC 2014, Confidential and Proprietary
CASE STUDY: SUMO FRAUD
• Match-Fixing Scandal in 2011• Discovered through Big Data analysis.• Proven by Text Messages.• $200 tickets and $1 Million Dollar
Champions.
© TCC 2014, Confidential and Proprietary
CASE STUDY: PREDICTION MACHINE
• Tool simulates each and every game 50,000 times before making a pick.
• 32 million Americans average yearly spend is $467 or $15 billion in total playing.
© TCC 2014, Confidential and Proprietary
CASE STUDY: MORTGAGE FRAUD
LexisNexis Risk Analysis
• Developed HPCC technology - an HPC alternative to Hadoop
• Database has 270 Million Individuals in the US Alone
• They know you’re that John Smith
• Graph Analysis spots Relationships
• Ability to spot mortgage fraud rings that were previously undetectableAnnual Fraud Estimates:
• California, at $864 million• New York at $278 million• Florida at $273 million
© TCC 2014, Confidential and Proprietary
© TCC 2014, Confidential and Proprietary
CASE STUDY: BIG BROTHER WATCHES BIG BROTHER
© TCC 2014, Confidential and Proprietary
WORLD’S CONVERGING:HPC AND BIG DATA
Venus(HPC)
Mars(Big Data)
© TCC 2014, Confidential and Proprietary
PAYPAL CASE STUDY: HPC IN THE ENTERPRISE
“Examples of large organizations using HPC include PayPal, which IDC estimates has saved over $700 million by adopting HPC for real-time detection of online consumer fraud.”
- Steve Conway, IDC
© TCC 2014, Confidential and Proprietary
PAYPAL CASE STUDY: HPC IN THE ENTERPRISE
© TCC 2014, Confidential and Proprietary
HPC - WHAT’S CHANGED?
2013 Tianhe 23,120,000 cores33.86 Petaflops (1015)
1986CRAY X-MP/44 Vector Processors800 Mflops (106)
© TCC 2014, Confidential and Proprietary
THE HPC FRONTIER
© TCC 2014, Confidential and Proprietary
TRENDS• Rise of Real-Time
Analytic through in-memory technologies
• Enterprises adopt HPC technologies into workflow
• Internet of Things Feeds Big Data Phenomenon and ends up swallowing Big Data as a meme.
•bitly.com/theObserverEffect
© TCC 2014, Confidential and Proprietary
© TCC 2014, Confidential and Proprietary
SUMMARY
Big Data is about two things:
•Money! Making More and Keeping it
•Degrees of Belief
© TCC 2014, Confidential and Proprietary
CALL TO ACTION
• Check out insideBIGDATA.com
• Buy this book:
• Big Data: A Revolution That Will Transform How We Live, Work, and Think by Viktor Mayer-Schonberger and Kenneth Cukier
© TCC 2014, Confidential and Proprietary
CALL TO ACTION
Read my SCI-FI Original:
The Observer Effect
http://bit.ly/theobservereffect
© TCC 2014, Confidential and Proprietary
POLL: SO HOW DID I DO TONIGHT?
© TCC 2014, Confidential and Proprietary
PLEASE LET ME KNOW HOW I DID!“Big Data: What’s It Really About?”
1) WITH YOUR MOBILE DEVICE:In the TCCLive mobile app,
“Agenda” section, then tap “Surveys”
- OR -
2) FILL OUT THE PAPER VERSIONgiven to you at registration
Thank You For Attending!