Strata Online_road_to_enterprise_data_2011
-
Upload
lynn-langit -
Category
Technology
-
view
978 -
download
2
description
Transcript of Strata Online_road_to_enterprise_data_2011
@LynnLangitPractioner, Author, Instructor
The Road for Enterprise DataFrom Traditional BI to Big Data
BI = ‘Current State’ Questions
• What did we sell?• When did we sell it?• Where did we sell it?• What did we sell with it?
Collecting Transactional
data
BTW…Do you use Data Mining?
BI Data Landscape
StorageProcessing
Query
Presentation
Mix-in #1 -- the Cloud and…
• Host Data in the Cloud• Process & Query Data in the Cloud– Click to query and (data) mine– Return the data locally– Use Self-service BI visualizers
• Mash-up Cloud data – Combine with local data
NoSQL and the Cloud
• The Elephant in the room…Hadoop• Over 120+ types of noSQL databases– http://nosql-database.org/
Can’t We All Play Together?
Data in the Cloud - Microsoft
Windows Azure DataMarket
Amazon AWS
Google App Engine Data
New on Google – MySQL++
Comparing RDBMS and MapReduce
Traditional RDBMS MapReduce
Data Size Gigabytes (Terabytes) Petabytes (Hexabytes)
Access Interactive and Batch Batch
Updates Read / Write many times Write once, Read many times
Structure Static Schema Dynamic Schema
Integrity High (ACID) Low
Scaling Nonlinear Linear
DBA Ratio 1:40 1:3000
Reference: Tom White’s Hadoop: The Definitive Guide
BTW…NoSQL is 50x CHEAPER
BigData = ‘Next State’ Questions
• What could happen?• Why didn’t this happen?• When will the next new thing
happen?• What will the next new thing
be?
Collecting behavioral
data
Splunk
Mining Log Files
Presenting the results
Freebase
Mix-in #2 - Data Scientists
• Who asks the ‘right’ questions now? • Who understands the languages? • Who can understand the results?
Is Data Science your next Career?
Becoming a Data Scientist
• Conferences– Strata – Data Scientist
Summit– CloudCamps
• Practice– here
Mix-in #3 - Presentation
• New Devices – iPad, Kindle Fire• New User Experiences – touch, Kinect• EVERYTHING on the phone
HortonWorks, Cloudera…
Karmasphere Studio for Amazon Elastic MapReduce
More PowerPivot
Cloud-based Data Mining Predixion
QlikView
QlikView on iPad
BI >BigData ‘To Do ListStore some (more) data on the cloud• Relational and non-relational• Transaction AND Behavioral
Process some data in the cloud• Try data mining• Learn about Data Science
Update your client tools• New UI (touch, gestures)• Click to Query• New form factors (phone, tablet)
Hadoop Connector to Excel - Demo
www.TeachingKidsProgramming.org
• Do a Recipe Teach a Kid (Ages 10 ++)• Microsoft SmallBasic Free Courseware (recipes)
Keep up with Big Data
Follow me @LynnLangit
RSS my blog www.LynnLangit.com
Hire me• To help build your BI/Big Data solution• To teach your team next gen BI