Hadoop in the Cloud
-
Upload
jim-oneil -
Category
Technology
-
view
1.218 -
download
0
description
Transcript of Hadoop in the Cloud
DIY
Roll your own Hadoop cluster….welcome to DevOps
Pallet
“Isotope”
Appliances
Oracle Big Data Appliance– 18 server / 12 core each / 40Gb Infiniband– Partnering with Cloudera on the distribution
Greenplum HD Data Computing Appliance – 18 nodes, 12 core each– Straight up Apache Hadoop
NetApp Open Solution for Hadoop– Storage arrays only (E2660 and FAS2040)– Partnership with Cloudera
The Elephant in the CloudJim O’NeilDeveloper Evangelist, [email protected] @jimoneil
Cloud: a Notional Definition
Essential Characteristics
On-demand self-service
Broad network access
Resource Pooling
Rapid Elasticity
Measured serviceServ
ice
Mod
elsInfrastructure as a Service
Platform as a Service
Software as a Service
Deployment Models
Private Cloud
Public Cloud
Hybrid Cloud
Community Cloud
Hadoop in the Cloud
Google App Engine
appengine-mapreduce API (not really Hadoop)
Amazon Web Services66 Public AMIs (including Cloudera)Elastic Map Reduce
Windows AzureHadoop on Azure
IBM SmartCloudInfosphere BigInsights
Google App Engine
MapreducePipeline Class
Experimental!
Mapreduce is an experimental, innovative, and rapidly changing new feature for App Engine. Unfortunately, being on the bleeding edge means that we may make backwards-incompatible changes to Mapreduce. We will inform the community when this feature is no longer experimental.
Amazon EMR
u
Windows Azure
http://HadoopOnAzure.com
Currently in Customer Technology PreviewPartnership with Hortonworks
Windows updates to ApacheJavaScript frameworkHive ODBC connector
IBM SmartCloud
InfoSphere BigInsightsIBM distribution of Hadoop (0.20.2)Jaql query languageBigSheetsBigInsight Scheduler“Hadoop ecosystem”
Hive, Avro, Hbase, Pig, Oozie, Flume
Jim O’Neil Developer Evangelist, Microsoft
[email protected] @jimoneil
I meant what I said, and I said what I meant.
An elephant's faithful, one hundred percent.