The Big Deal about Big Data - CIO Summits Big... · •20 years in Large scale Data Warehouse •7...
Transcript of The Big Deal about Big Data - CIO Summits Big... · •20 years in Large scale Data Warehouse •7...
The Big Deal about Big Data
Agile principles to drive Adoption of Advanced Analytics
Oliver Ratzesberger
VP Information Analytics & Innovation
@ratzesberger
Oliver Ratzesberger – VP Analytics & Innovation
• 20 years in Large scale Data Warehouse
• 7 years at eBay – Analytics Platform
Teradata
Hadoop
200PB of infrastructure – largest commercial database sized for >50PB of raw data
• At Sears Holdings/MetaScale since October 2011
Transforming a legacy icon into an Analytical Competitor.
What is BigData?
PetaBytes of information
Hundreds of Millions of Customers
Complex/Semi/Unstructured Data
NoSQL/MapReduce/MPP/Hadoop
Data Science & Data Visualization
Advanced Algorithms & Predictive Technologies
Natural Language & Image Processing
Sensor Data
Sentiment Analysis
BigData at Sears Holding
3.5PB Teradata 2.5PB Hadoop
>5 Million requests per day
Consolidating all Data Marts into a Single Version of the Truth
Simplicity
Occam’s Razor:
“simpler explanations are …
generally better than more
complex ones”
The simple solution is
easy to explain, implement,
and maintain
Design for the Unknown
“Of design for analytics platforms - Perfect is Wasteful”
Friction to change & code weight are the antithesis of agility
Time to Market ( is everything …)
Are your Analytical needs getting stuck in traffic?
The Foundation
Technology Platform
Storage and processing platforms, Teradata & Hadoop, and data interconnect services
Analytics as a Service (A3S)
Reusable, powerful, and integrated analytics services that automates the actions in an analytics environment.
This enables rapid deployment of a high-quality feature rich collaborative analytics environment that will
empower users to be radically more self sufficient, be more productive, and achieve better results.
Insights Platform
Advanced analytics products with out of the box segmentation, trending, alerting, experimentation, etc.
capabilities supporting extremely large data sets
Ser
vice
s, T
rain
ing
, Su
pp
ort
Dev
elo
per
Pla
tfo
rm
Examples Usecases
Analytics as a Service (A3S)
Insights Platform
nSegment nTrend nAlert nExperiment
Operational Data
Engine
Insights Hub
Monitoring
Virtual Data
Containers
Data Movement
Service
Search
Activity Based
Chargeback
Data Profiling
Services
Security
Best practices
compliance
Database Marketing Loyalty Programs Gamification Store Operations
Example Data Engine for Segmentation
The importance of KPIs
Scrum – Adopting an Agile Methodology
Amount of Change
Competing Priorities in Technology
What is DevOps?
• Blend of
Agile Development AND
Agile Operations
• Software development methods that stress
communication and collaboration
• Developing the 1st line of code with
Operations in mind
Developer Platform
A BigData Organizational Example
Analytics & Innovation
Architecture Operations Business
Applications Product
Management Product
Development Data Science
Labs Offshore COE
CTO Analytics & Innovation
Data Science Labs
Dedicated Data Scientist Labs Organization
Center of Excellence for
• Advanced Algorithms
• Predictive Technologies
• Visualization Technologies
Assigned to the top priority initiatives of the enterprise
Separating GOOD from BAD
SEARS HOLDING CORPORATION COPYRIGHT 2012 19
Consistent Simplicity
SEARS HOLDING CORPORATION COPYRIGHT 2012 20
Data Science - When the AVERAGE is useless
SEARS HOLDING CORPORATION COPYRIGHT 2012 21
Questions?
Oliver Ratzesberger
VP Information Analytics & Innovation
@ratzesberger