Azure Machine Learning - A Full Journey
-
Upload
solidqit -
Category
Data & Analytics
-
view
271 -
download
0
Transcript of Azure Machine Learning - A Full Journey
#SQLSAT454
Azure Machine LearningA full journeyDavide [email protected]@mauridb
#SQLSAT454
Sponsors
#SQLSAT454
Davide Mauri
Microsoft SQL Server MVP Works with SQL Server from 6.5, on BI from 2003 Specialized in Data Solution Architecture, Database Design,
Performance Tuning, High-Performance Data Warehousing, BI, Big Data
President of UGISS (Italian SQL Server UG) Regular Speaker @ SQL Server events Consulting & Training, Mentor @ SolidQ
E-mail: [email protected] Twitter: @mauridb Blog: http://sqlblog.com/blogs/davide_mauri/default.aspx
#SQLSAT454
Agenda
Machine Learning, what’s that? Supervised & Unsupervised Methods Tool & Languages
Experimenting On-Premises Ipython & R
Azure Machine Learning AzureML Studio Notebooks
Integrating AzureML in custom applications Creating AzureML Services with Python and R
#SQLSAT454
MACHINE LEARNINGWhat’s that?
#SQLSAT454
Machine Learning
Algorithms that learn from data
Nothing really new from a scientific point of view "Field of study that gives computers the ability to learn
without being explicitly programmed“ - 1959, Arthur Samuel
Requires *a lot* of compute power (even for not-so-big-data) Azure, here we come!
#SQLSAT454
Machine Learning
*Very* useful for Identify unknown and complex pattern Identify hidden correlations Automatically classify data Predict future trend and/or values basing on past
knowledge
#SQLSAT454
Machine Learning
Thanks to the cloud it’s now possible to integrate ML Algorithms into Line-Of-Business applications Choose the algorithm Train it Expose as a RESTful Web Service Call it from you App You’re Happy
#SQLSAT454
Machine Learning
Two main categories (but sometimes are even divided in up to five categories!) Supervised Unsupervised
Supervised: humans (usually) teach to algorithms what is the expected result
Unsupervised: algorithms tries to autonomously identify patterns and rules in given dataset
#SQLSAT454
Languages
Most common languages used for machine learning R Python
Less common but on the rise Julia Scala Go Rust
#SQLSAT454
Tools - Python
Python Packages Scikit-Learn SciPy, NumPy, Pandas, Matplotlib, Seaborn
IPython (Jupyter) Anaconda
Pytools for Visual Studio
#SQLSAT454
Tools - R
R RStudio Anaconda
https://www.continuum.io/conda-for-r
#SQLSAT454
Datasets
To learn ML, sample and well-known datasets are needed
Here some places where nice Datasets can be found http://archive.ics.uci.edu/ml/datasets.html http://www.kdnuggets.com/datasets/index.html http://
homepages.inf.ed.ac.uk/rbf/IAPR/researchers/MLPAGES/mldat.htm
https://en.wikipedia.org/wiki/Data_set#Classic_datasets
#SQLSAT454
IRIS Dataset
150 instances of Iris Flowers 3 classes: Virginica, Versicolor, Setosa 4 features: Sepal Width & Length, Petal Width & Length
One of the most used for educational purposes Simple, but…. Un class is linearly separable Other two classes are NOT linearly separable
Available at UC Irvine Machine Learning Repository http://archive.ics.uci.edu/ml/datasets/Iris
#SQLSAT454
IRIS Dataset
http://www.anselm.edu/homepage/jpitocch/genbi101/diversity3Plants.html
#SQLSAT454
EXPERIMENTSOn-Premises
#SQLSAT454
AZURE ML STUDIOOn the cloud!
#SQLSAT454
AzureML Studio
www.azureml.com
Azure ML Studio Web application (“Workspace”) for developing ML solutions
Development Process Experiment Score Evaluate Publish
#SQLSAT454
EXPERIMENTSAzure
#SQLSAT454
AzureML Web Services
Can be created also on-premises with R and Python
#SQLSAT454
LOB APP INTEGRATIONMaking it worth for real-life business scenario
#SQLSAT454
Evaluations
Don’t forget to compile evaluations form here http://speakerscore.com/sqlsat454
#SQLSAT454
THANKS!
#sqlsat454