BlueData Integration with Apache Ambari

9
Big Data Infrastructure Made Easy BlueData EPIC Integration with Apache Ambari

Transcript of BlueData Integration with Apache Ambari

Page 1: BlueData Integration with Apache Ambari

Big Data Infrastructure Made EasyBlueData EPIC Integration with Apache Ambari

Page 2: BlueData Integration with Apache Ambari

Overview of Apache Ambari

Apache Ambari is an open source management console for provisioning, managing and monitoring Hortonworks (HDP)

Hadoop clusters

Ambari provides a single control point for viewing, updating and managing Hadoop service life cycles, with these important

features:

Page 3: BlueData Integration with Apache Ambari

BlueData: Easy, Cost-Effective, On-Demand

GLUSTER

HDFS SWIFT NFS

UTILIZATION > 90%

SIMPLIFIED MANAGEMENT

NO DUPLICATION

OF DATA

NO CLUSTER SPRAWL

ElasticPlane: Self-service, multi-tenant clusters

DataTap: In-place access to enterprise data stores

IOBoost: Virtualization with bare-metal performance

EPIC Software Platform

MINUTES TO SPIN UP A VIRTUAL

CLUSTER

R&D ManufacturingMarketing Sales

Page 4: BlueData Integration with Apache Ambari

BlueData + Apache Ambari 1.7 Integration

Benefits FeaturesInfrastructure agility, elasticity, and efficiency – virtual HDP clusters with the functionality and performance of physical clusters

• Auto-provisioning of VM hosts with Ambari server and agent components

• Automated, transparent deployment of HDP using REST API for Stacks and Services

Time savings for Data Scientists and Big Data administrators

• Self-service virtual cluster creation by data scientists or business analysts

• Troubleshooting and management by Big Data admins using Ambari

Administrator productivity & flexibility • Ambari for monitoring, fine-grained configuration, and enterprise support

Page 5: BlueData Integration with Apache Ambari

Delivering self-service with AmbariSelf-service web interface – define cluster with a few mouse clicks

* Example screenshot from BlueData

integration with Apache Ambari

Page 6: BlueData Integration with Apache Ambari

Delivering self-service with AmbariCreating virtual Hadoop clusters with Ambari console within minutes

* Example screenshot from BlueData integration with Apache Ambari

Page 7: BlueData Integration with Apache Ambari

Delivering self-service with AmbariCreating virtual Hadoop clusters within minutes

* Example screenshot from BlueData integration with Apache Ambari

Page 8: BlueData Integration with Apache Ambari

Delivering self-service with AmbariHadoop cluster provisioning using Ambari API

Design optimized for cluster creation speed and user

feedback

Phase 1: VMs• Self-service request• VMs provisioned• Ambari server &

agents pre-deployed• HDFS dependency

removed

Phase 2: Core Stack• Agent registration with server• REST API call to deploy HDP stack• REST API to create core-site.xml

to use BlueData HDFS abstraction• Start YARN/MRv2• Shutdown HDFS service

Phase 3: Services• Add specific services

requested by end user via REST API calls

• Start ‘compute’ services (e.g. Hive, Pig) requested by user

• Update status of cluster

Page 9: BlueData Integration with Apache Ambari

BlueData + Ambari: Big Data Infrastructure Made Easy

SPEED & AGILITY

SECURITY & CONTROL

EFFICIENCY & LOWER

COST

70%Savings