Analyzing BigData with Machine Learning and …...Analyzing BigData with Machine Learning and Hadoop...
Transcript of Analyzing BigData with Machine Learning and …...Analyzing BigData with Machine Learning and Hadoop...
Analyzing BigData with Machine Learning and Hadoop Clusters
Sudhir Rawat, Data Engineer, Microsoft (@rawatsudhir)
Azure HDInsightManaged 100% Apache Hadoop
99.9%
availabilityAzure SLA
Terabytes to
PetabytesScale-out
Deployed in
minutesWithin a few clicks
Azure HDInsight running Windows/Linux
– Managed & supported by Microsoft
– Re-use common tools, documentation, samples from Hadoop/Linux ecosystem
– Add Hadoop projects that were authored on Linux to HDInsight
– Easier transition from on-premises to cloud
Business users access results from anywhere, on any device
Delivering Advanced Analytics
• HDInsight
• SQL Server VM
• SQL DB
• Blobs & Tables
Devices Applications Dashboards
Data Microsoft Azure Machine Learning
Storage space
Integrated development environment for Machine
Learning
ML
Studio
Business problem Business valueModeling Deployment
• Desktop files
• Excel spreadsheet
• Other data files on PC
Cloud
Local
Data to model to web services in minutes
http://studio.azurem
l.net
Web
Clients
API
Model is now a web service
Monetize this API
Business Scenarios
Recommendations,
customer churn,
forecasting, etc.
Perceptual Intelligence
Face, vision
Speech, text
Personal Digital Assistant
Cortana
Dashboards and
Visualizations
Power BI
Machine Learning
and Analytics
Azure
Machine Learning
Azure
Stream Analytics
DATA
Business apps
Custom apps
Sensors and devices
INTELLIGENCE ACTION
People
AutomatedSystems
Big Data Stores
AzureSQL Data Warehouse
Information
Management
Azure
Data Factory
Azure
Data Catalog
Azure
Event Hub
Azure
Data Lake Store
Azure
HDInsight (Hadoop)
Azure
Data Lake Analytics
ConceptualCortana Analytics Suite - Layer Stack
TransformationCollection Presentation and action
Event Queuing System
Long-term storage
Fleet Management – Data Flow
Search and query
Data analytics (Excel)
Web/thick client dashboards
Devices to take action
Event hub
Event producers
Applications
Web and social
Devices
Sensors
Live Dashboards
Apache HBase onHDInsight
DocumentDBSolr Azure SearchMongoDB SQL
Cloud gateways
(web APIs)
Field
gateways
Kafka/RabbitMQ/ActiveMQ
Event hubs
Azure ML
Storage
adapters
Stream processing
1
7
Apache Storm
on HDInsight