Analytics in Action transforming the way we use and ... · Rule Detection Simulations(Stress Tests)...
Transcript of Analytics in Action transforming the way we use and ... · Rule Detection Simulations(Stress Tests)...
Analytics in Action – transforming the way we use and consume information
Big Data Ecosystem – The Data
BIG DATA Repositories
Hadoop
MPP Appliances
Traditional Data
Internet
Data Streaming
Event Stream Processing
Big Data Ecosystem – Data Management
DATA MANAGEMENT ENVIRONMENT
BIG DATA RepositoriesETL/ELT
Engines
Hadoop
MPP Appliances
Data Profiling
Integration and Data
Quality Rules
Crawlers
Metadata
Data Governance
Permissions
Administration
Traditional Data
Internet
Data Streaming
Event Stream Processing
Big Data Ecosystem – Analytics
DATA MANAGEMENT ENVIRONMENT ANALYTICS ENVIRONMENT
BIG DATA Repositories Analytics EnginesETL/ELT
Engines
Business Rules and Analytical Models
Exploration / Modeling
Automation
Data
Virtualization
Hadoop
MPP Appliances
Data Profiling
Integration and Data
Quality Rules
Crawlers
In-
Memory
Rule Detection
Simulations(Stress Tests)
Social Network
Analysis
Data Exploration
Data & Text Mining
Forecasting
Risk Analysis (VAR)
Metadata
Data Governance
Permissions
Administration
Traditional Data
Internet
Data Streaming
Event Stream Processing
The Complete Big Data Ecosystem
MANUAL RESULTS
AUTOMATED
RESULTS
REAL TIME RESULTS
DATA MANAGEMENT ENVIRONMENT ANALYTICS ENVIRONMENT
BIG DATA Repositories Analytics EnginesETL/ELT
Engines
Business Rules and Analytical Models
Exploration / Modeling
Automation
Data
Virtualization
Hadoop
MPP Appliances
Data Profiling
Integration and Data
Quality Rules
Crawlers
In-
Memory
Alerts and
Notifications
Automated reports
Forecasting
Recommendations
Network Services
Ad- Hoc Reports
Analyses
Data Visualization
Rule Detection
Simulations(Stress Tests)
Social Network
Analysis
Data Exploration
Data & Text Mining
Forecasting
Risk Analysis (VAR)
Metadata
Data Governance
Permissions
Administration
Traditional Data
Internet
Data Streaming
Continuity of Business
• Make it relatively seamless for a SAS programmer
to treat Hadoop like any other data source
Bring SAS processing to the Data
• Move SAS closer to the data that is embedded
process and LASR
Leverage Hadoop for New Technology offerings
• New solutions built on Hadoop and LASR
SAS offers the widest breath and depth of modern
analytic methods
• Solution for advanced analytics on Hadoop
SAS and Hadoop
Technology Focus – SAS and Hadoop
Hadoop as a Data Platform(standalone or as part of a
broader ecosystem)
Hadoop as a component of
the next generation of
Business Analytics
SAS Grid Manager for Hadoop
Workload Management
High Availability
Parallel Processing
HADOOP
Most SASProcedures
SASClient
SAS/Grid Manager for Hadoop - Treat a Hadoop Cluster as a Grid for MVA SAS using Yarn
- Push SAS procedure processing to Hadoop
TEXT
MANAGE
DATA
EX
PL
OR
E
DA
TA
DEVELOP
MODELS
DE
PL
OY
&
MO
NIT
OR
• SAS Data Loader for Hadoop
• SAS Data Management (incl.
SAS/ACCESS)
• SAS Federation Server
• SAS Event Stream Processing• SAS Visual Analytics
• SAS In-memory Statistics
• SAS Visual Statistics
• SAS In-memory Statistics
• SAS High-Performance
Analytics Products
• SAS Scoring Accelerator for
Hadoop
Enabling The Entire Analytics Lifecycle Around Hadoop
Prepare data IN
Hadoop for analytics
SAS Data Loader for Hadoop
Move data FROM
Hadoop into a SAS
environment
vAPP
SAS Data Loader
(Web App)
SAS/Access to
Hadoop
Hadoop Cluster
Hadoop (1 Node)
SAS Data Loader for Hadoop Logical Architecture
SAS
Embedded
Process
SAS Code Accelerator
for Hadoop
SAS Data Quality
Accelerator for Hadoop
SAS LASR In-Memory
Analytic Server (Optional)
Query
Filter
Transform
De-duplicate
Profile
Cleanse
Join
Load
(Browser)
Text FilesOracle
Other RDBMS
SAS
Business User
(Analysts)
Use wizard-
based
“directives”
(no coding)
Deployed
With EP
Data Quality
Accelerator
Code Accelerator
Write SAS
Code or
HiveQL
User Profile Action Data Loader
(vApp )
SAS Embedded Process
Data Scientists/IT
(SAS coder, ETL
developer)
• Interpret
directives or
code
• Generate code
or HiveQL as
needed
• Send code to
Accelerators,
queries to Hive
HDFS
Hadoop
cluster
SAS Data Loader for Hadoop User Profiles
Take Real-time Action
• Decisive reaction to complex patterns and events
as they happen
Apply Multi-Phase Analytics
• Advanced analytics and multi-phase processing
Focus on Relevant Data
• Continuous loading of relevant streaming data
Streaming Analytics
Technology Focus – Streaming Analytics
Edge
Analytics
In-Motion
Analytics
At-Rest
Analytics
Network Systems, Surveillance
Monitor equipment on the
platform for failures and safety
issues, and take action.
Identify fraudulent
transactions and be
alerted in real-time.
Intelligently integrate customer
information with real-time
streaming data
Strategic Data IntegrationTransactions, Logs, Clickstreams
SAS-generated
Insights
Event Actions
SAS In-Memory
SAS®
Event Stream Processing Model
Continuous
Query
Pu
bli
sh
Su
bs
cri
be
Streaming Events
Enrichment
Data
Analytic
Models
Busines
s Rules
Streaming Analytics – Conceptual Overview
Unlimited Data Volumes
• Embrace the potential
Speed
• The ability to fail fast
Democratized Analytics
• Cater to the citizen data scientist
Approachable Analytics
Technology Focus – Approachable Analytics
Data
ManipulationReporting Exploration Modeling
STATISTICIAN / DATA SCIENTIST
PROGRAMMINGBUSINESS ANALYST
Gartner’s predicts that through 2017, the number of citizen data scientists will grow five times faster than the
number of highly skilled data scientists
“A person who creates models that use predictive or prescriptive analytics, but whose primary job function is outside of the field of statistics
and advanced analytics. They are "power users" who will be able to perform simple and moderately sophisticated analytic applications that
would previously have required more expertise. They often reside in the lines of business and have deep domain expertise” - Gartner Inc.
Sophisticated Analytics For Everyone
SAS® LASR ANALYTIC SERVER
SAS® IN-MEMORY
SAS® IN-MEMORY
SAS® IN-MEMORY
SAS® IN-MEMORY
SAS® IN-MEMORY
HADOOP & DW WEB CLIENTSAPPLICATIONS
SAS Visual Analytics
SAS Visual Statistics
ERP
SCM
CRM
Images
Audio
and Video
Machine
Logs
Text
fWeb and
Social
Approachable Analytics High Level Architecture
SAS Visual Data Builder
SAS Visual Scenario Designer
…
Decisions at Scale
Automate
• Build, monitor, and evaluate models using modern
methodologies
Empowerment
• Enable decision makers everywhere backed by
powerful analytics
Confidence
• Ensure analytic solutions are repeatable, reliable,
timely, and relevant across the enterprise
Technology Focus – Decisions at Scale
Deployment Considerations
Data Environment
Score
Output Rules
Models Rules
SCORE
CODE
db compliant
instructions
.99. 1.0, 500
Operationalizing Decision Making
SAS® Visual Analytics
SAS® Visual Statistics
SAS® In-Memory Statistics
SAS® Enterprise Miner / SAS®Factory Miner
SAS® Decision Manager / SAS®
Scoring Accelerator
GUI driven reporting,
visualization and
interactive data
exploration with
analytics
GUI driven analytic
model development and
evaluation
Programmatic data
wrangling, model
development and
evaluation
Robust production
modelling tools that
provide for repeatability
and easy
operationalization
Capabilities to deploy,
monitor and automate
analytics with
appropriate business
rules into operational
business processes
Visualize, explore, interact, explain, understand, democratize, prototypeFinalize, deploy, integrate, execute,
operationalize, industrialize
Approachable Analytics Decisions at Scale
How Does This All Fit Together?
Event Stream Processing
Big Data Ecosystem – Build On Existing Technology
MANUAL RESULTS
AUTOMATED
RESULTS
REAL TIME RESULTS
DATA MANAGEMENT ENVIRONMENT ANALYTICS ENVIRONMENT
BIG DATA Repositories Analytics EnginesETL/ELT
Engines
Business Rules and Analytical Models
Exploration / Modeling
Automation
Data
Virtualization
Hadoop
MPP Appliances
Data Profiling
Integration and Data
Quality Rules
Crawlers
In-
Memory
Alerts and
Notifications
Automated reports
Forecasting
Recommendations
Network Services
Ad- Hoc Reports
Analyses
Data Visualization
Rule Detection
Simulations(Stress Tests)
Social Network
Analysis
Data Exploration
Data & Text Mining
Forecasting
Risk Analysis (VAR)
Metadata
Data Governance
Permissions
Administration
Traditional Data
Internet
Data Streaming
SCALE YOUR DATA ANDYOUR ANALYTICS
GROW A CULTURE OF INNOVATION
ANALYZE ALL OFYOUR DATA
MODERNIZE YOUR LEGACY BI
STRATEGY
Data
Discovery Deployment
Analytics in Action - SAS Can Help