FOR A FEW TERABYTES MORE THE GOOD, THE …...FOR A FEW TERABYTES MORE – THE GOOD, THE BAD and THE...
Transcript of FOR A FEW TERABYTES MORE THE GOOD, THE …...FOR A FEW TERABYTES MORE – THE GOOD, THE BAD and THE...
FOR A FEW TERABYTES MORE –
THE GOOD, THE BAD and THE BIG DATA
Cenk Kiral
Senior Director of BI&EPM solutions – ECEMEA region
Big Data Buzz
“Big data, analytics get even bigger, hotter in 2012” InfoWorld – 12/30/11
“The promise of big data” Intelligent Utility - 8/28/11
“Are you ready for the era of big data?” McKinsey Quarterly - 11/11
“Health care is next frontier for big data” Wall Street Journal – 1/19/12
“Big data: science’s microscope of the 21st century” Business Week – 11/8/11
“Decisions, decisions…will big data have big impact?” Financial Times – 1/24/12
Source: McKinsey Global Institute 2011 May
30 billion Pieces of content shared
On Facebook every month
5 billion Mobile phones
in use in 2010
$600 To buy a disk drive that can
store all of the world’s music
40% Projected growth in global data
Generated per year vs. 5% Growth in global
IT spending
235 Terabytes data collected by the US
Library of Congress in April 2011 15 out of 17
Sectors in the US have more data
stored per company than the US
Library of congress
Big Data – a growing torrent
Why Is Big Data Important?
Source: * McKinsey Global Institute: Big Data – The next frontier for innovation, competition and productivity (May 2011)
US HEALTH CARE
$300 B
“In a big data world, a competitor that fails to sufficiently
develop its capabilities will be left behind.”
Increase industry
value per year by
McKinsey Global Institute
US RETAIL
60+%
Increase net
margin by
MANUFACTURING
–50%
Decrease dev.,
assembly costs by
GLOBAL PERSONAL
LOCATION DATA
$100 B
Increase service
provider revenue by
EUROPE PUBLIC
SECTOR ADMIN
€250 B
Increase industry
value per year by
What Does This Mean for Business
Logistics – Smart
Routing
Remote Medical Monitoring Insurance – Pay as you Drive
Off-Shore Renewable energy
Smart Customers
Low Carbon
Generation Traditional Generation
Transmission & Distribution
Energy Retail
Smart Transmission Substation Smart Transmission Substation
Smart Distribution Substation
Smart Distribution Substation
Electric Vehicles
H
o
m
e
A
r
e
a
N
e
t
w
o
r
k
From other Grids Sens
ors,
Smar
t
switc
hes
Nuclear
Solar farms Wind
New Sensors / Distributed Computing on
Transmission and Distribution Lines
alert operators, fix problems, integrate large-scale renewables generation
Smart Meters and Home Networks help
customers use energy wisely, mitigate peak
demand, integrate local renewables
Utilities – Smart grid
What Makes it Big Data?
VOLUME VELOCITY VARIETY VALUE
SOCIAL
BLOG
SMART
METER
101100101001
001001101010
101011100101
010100100101
Big Data Use Cases
Today’s Challenge New Data What’s Possible
Healthcare
Expensive office visits Remote patient monitoring
Preventive care, reduced
hospitalization
Manufacturing
In-person support Product sensors Automated diagnosis, support
Location-Based Services
Based on home zip code Real time location data
Geo-advertising, traffic, local
search
Public Sector
Standardized services Citizen surveys
Tailored services,
cost reductions
Retail
One size fits all marketing Social media
Sentiment analysis
segmentation
Growth in Enterprise Unstructured Data
How will the
amount of
unstructured
data change
over next 3
years? 4%
2%
6%
9%
47%
33%
Don't know/unsure
Decrease significantly
Decrease moderately
No change
Increase moderately
Increase significantly
Source: Unisphere Research, 2011
Value of Unstructured Data
How important
is unstructured
data to your
business?
Extremely important;
18% Don't know
unsure, 6%
Not important today; 8% Very
important; 39%
Somewhat important;
30%
Source: Unisphere Research, 2011
Unstructured Data
Actually, some of this data is structured (XML, ...)
But it is not in table/column (relational) format
Structured Data
Unstructured Data
85% of data acquired is unstructured
• Field Service notes
• Tech Support memos
• Call center narrative
• Customer comments
• Memos, letters, notes, reports
• E-mail • HTML
documents • PDF documents
• Financial reports
• Product sheets
• Exported data
• News feeds • Web sites • Transcripts • Research data
Oracle Endeca Information Discovery A platform for provisioning data discovery applications across the enterprise
Endeca Information Discovery
helps organizations quickly
explore all relevant data
• Combine structured & unstructured
data from disparate systems
• Automatically organize information
for search, discovery & analysis
• Rapidly assemble easy to use
analysis applications
Faceted Data Model
Integration Enrichment Indexing
Unified
Querying
Interactive
Exploration
App
Composition
Oracle NoSQL
Database
HDFS
Enterprise
Applications Oracle Data Integrator
Oracle Big Data Connectors
Hadoop (MapReduce)
Oracle Integrated Solution Stack for Big Data
ACQUIRE ORGANIZE DECIDE
Analytic
Applications
ANALYZE
In-D
ata
base
An
aly
tics
Data
Warehouse
Oracle Exalytics Hardware
Engineered for extreme analytics
•40 Intel processor cores
•1 Terabyte main memory
•40 Gb InfiniBand connection to Oracle Exadata
Oracle Exalytics Software
•Oracle TimesTen In-Memory Database
‒Adaptive in-memory caching of analytics
‒In-memory columnar compression
‒Tightly integrated with Oracle Exadata
‒Enables speed-of-thought visualization
•Oracle Business Intelligence Foundation Suite