Advanced Analytics in SQL 2016 -...
Transcript of Advanced Analytics in SQL 2016 -...
![Page 1: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/1.jpg)
1
Advanced Analytics
in SQL 2016
Adastra
Pavel Stejskal, Consultant
linkedin.com/in/pavelstejskal
20.4.2016
![Page 2: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/2.jpg)
AdvancedAnalytics
2
Excel + Power BI add-insQuery, Pivot, View, Map
SharePointPower Pivot Gallery, Power View
ExcelData Mining
Power BI Desktop Power BI Portal
Azure ML
Power BI Mobile App
Analytics Platform System(APS)
Enterprise-class big data analytics platform for R
![Page 3: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/3.jpg)
3
SQL Server R Services
![Page 4: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/4.jpg)
In-database Advanced Analytics
4
Build intelligent applications with SQL Server R Services
R built-in to SQL Server
Mission critical OLTP
![Page 5: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/5.jpg)
What is R ?
5
• A programming language for statistics, analytics, and data
science
• A data visualization framework
• Provided as Open Source
• Used by 2.5M+ data scientists, statisticians and analysts
• Taught in most university statistics programs
• New and recent graduates prefer it
• Active and thriving user groups across the world
• CRAN: 7000+ freely available algorithms, test data and
evaluation
• Many of these are applicable to big data if scaled
![Page 6: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/6.jpg)
R basics
6
Easy syntax
Strong graphic capabilities
+
Large collection packages
in CRAN repository
![Page 7: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/7.jpg)
R – main challenges for Enterprise environment
7
• Single threaded
• Memory limitations
• Data transfers between R and DB storage
• How to deploy model into production
2015 Acquisition of Revolution Analytics
![Page 8: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/8.jpg)
Microsoft R Server (Revolution R Enterprise before)
8
Enterprise-class big data analytics platform for R
Microsoft R Server
Microsoft R Server is an enterprise-class big data analytics platform for R .
Supporting a variety of big data statistics, predictive modeling and machine learning capabilities, R Server supports the full range of analytics – exploration, analysis, visualization and modeling based on Open Source R.
By leveraging and extending open source R, R Server is fully compatible with R scripts, functions and CRAN packages, while extending R to analyze data at enterprise scale.
Microsoft R Server provides a unique opportunity to deliver our advanced analytics capabilities to customers who have already invested in storing their data on non Microsoft platforms like Hadoop, Teradata and Linux
![Page 9: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/9.jpg)
SQL Server Enterprise Edition R Services – how it works
9
SQL Server 2016 Enterprise Edition
Integration Facilities:
• Component Integration• Launchers• Parameter Passing• Results Return• Console Output
Return• Parallel Data Exchange
(RTM)• Stored Procedures• Package Administration
SQL Server
Query
Processor Fast, Parallel, Storage Efficient Algorithms
Open Source R Interpreter
![Page 10: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/10.jpg)
Advanced Analytics scenarios suitable for R
10
EXAMPLE SOLUTIONS
DEMO
![Page 11: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/11.jpg)
Run Parallel Algorithms In-Database
11
SQL
In-Database Execution:
Remote Execution
Parallelized ComputeSQL
Server
Remote
Execution
Context
Explore and Model:
In Parallel In-Database
Parallelize Distributable R and CRAN
Operationlize:
Score In Parallel
Move
BIG
Work to
the Data
Large Data Sets in Chunks
Parallel
Algorithm
Iterate/ Sequence
![Page 12: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/12.jpg)
How Does Remote Execution Work?
12
Algorithm
Master
Big
Data
Predictive
Algorithm
Analyze
Blocks In
Parallel
Load Block
At A Time
Distribute Work,
Compile Results
The Results:
• Even Faster Computation
• Larger Data Set Capacity
• Fewer Security Concerns
• No Data Movement, No Copies
“Pack and Ship” Requests
to Remote Environments
![Page 13: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/13.jpg)
13
Demo
![Page 14: Advanced Analytics in SQL 2016 - download.microsoft.comdownload.microsoft.com/documents/cs-cz/enterprise... · 4/20/2016 · Advanced Analytics in SQL 2016 Adastra Pavel Stejskal,](https://reader033.fdocuments.net/reader033/viewer/2022042110/5e8a66849f78eb188775a838/html5/thumbnails/14.jpg)
ADASTRA CZECH REPUBLICAdastra, s.r.o.
Karolinská 654/2, 186 00 Praha 8
Tel.: +420 271 733 303
www.adastra.cz
ADASTRA GROUP North America8500 Leslie St.
Markham, Ontario, L3T 7M8
Tel: +1 905 881 7946
Restrictions for public release and use:This document can comprise confidential information. As such it may not, without Adastra’s prior consent, be copied or transferred.
Important:All brands and names of products given in this documentation are or can be registered trademarks of their owners.© 2016 Adastra, all rights reserved.
14
Thank you!