Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predictive Modeling

Transitioning from Traditional DW to Spark in OR Predictive Modeling

Ayad Shammout and Denny LeeOctober 21st, 2015

About Ayad Shammout

• Director of Business Intelligence, Beth Israel Deaconess Medical Center

• Helped build Business Intelligence, highly available / disaster recovery infrastructure for BIDMC

About Denny Lee

• Technology Evangelist, Databricks

• Former Sr. Director of Data Sciences Eng, Concur

• Helped bring Hadoop onto Windows and Azure

We are Databricks, the company behind Spark

Founded by the creators of Apache Spark in 2013

Share of Spark code contributed by Databricksin 2014

Data Value

Created Databricks on top of Spark to make big data simple.

Why is Operating Room Scheduling Predictive Modeling Important?

$15-$20 / minute for a basic surgical procedure

Time is an OR's most valuable resource

Lack of OR availabilitymeans loss of patient

OR efficiency differs depending on theOR staffing and allocation (8, 10, 13, or 16h), not the workload (i.e. cases)

“You are not going to get the elephant to shrink or change its size. You need to face the fact that the elephant is 8 OR tall and 11hr wide”

Steven Shafer, MD

Operating RoomBetter utilization =

Better profit margins

Reduce support andmaintenance costs

Medical StaffBetter utilization =

Better profit margins

Better medical staffefficiencies = Better

outcomes

PatientsShorter wait times

and less cancellations

Better medical staffefficiencies = Better

outcomes

Develop Predictive Model

• Develop a predictive model that would identify available OR time 15 business days in advance.

• Allow us to confirm wait list cases two weeks in advance, instead of when the blocks normally release four days out.

Forecast OR Schedule

• Case load 15 business days in advance

• Book more cases weeks in advance to prevent under-utilization

• Reduce staff overtime and idle time

Background

• Three surgical groups• GYN, urology, general surgery, colorectal, surgical

oncology• Eyes, plastics, ENT• Orthopedics, podiatry

• Currently built using SQL Server Data Mining

Using Traditional Data Warehousing Techniques

OR DWSSAS Data

MiningData Sources

OR Reports

Traditional Data Warehousing & Data Mining OR Predictive Model

Process mining model every 3 hours

OR Prediction DB

Data inserts every 3 hours

Prediction results

Original Design

• Multiple data sources pushing data into SQL Server and SQL Server Analysis Server Data Mining

• Hand built 225 different DM modules (5 days, 15 business days ahead, 3 different groups)

• Pipeline process had to run 225 times / day (3 pools x 75 modules)

Regression Calculations

SSAS Data Mining T-SQL Code

Intercept R2

Mean Adjusted R2

Coefficients Standard Deviation

Variance Standard Error

Taking advantage of Spark’s DW Capabilities and MLlib

OR DWData Sources

OR Reports

OR Predictive Model in Spark

Data inserts every 3 hours

demoOR Block SchedulingExtract History data and run linear regression with SGD with multiple variables

OR Schedule Report (example)

Why the model is working

• Can coordinate waitlist scheduling logistics with physicians and patients within two weeks of the surgery

• Plan staff scheduling and resources so there are less last-minute staffing issues for nursing and anesthesia

• Utilization metrics are showing us where we can maximize our elective surgical schedule and level demand

Key Learnings when Migrating from Traditional DW to Spark

Transitioning to the CloudBeth Israel Deaconess Medical Center is increasingly moving to cloud infrastructure services with the hopes of closing its data center when the hospital's lease is up in the next five years. CIO John Halamka says he's decommissioning HP and Dell servers as he moves more of his compute workloads to Amazon Web Services, where he's currently using 30 virtual machines to test and develop new applications. "It is no longer cost effective to deal with server hosting ourselves because our challenge isn't real estate, it's power and cooling," he says.

Transitioning to the Cloud

• Need time for engineers, analysts, and data scientists to learn how to build for the cloud

• Build for security right from start – process heavy, a lot of documentation, audits / reviews

• Differentiating data engineers and engineers (REST APIs, services, elasticity, etc.)

Transitioning to Spark

• No more stored procedures or indexes• Good for Spark SQL, services design

• Prototype, prototype, prototype • Leverage existing languages and skill sets • Leverage the MOOCs and other Spark training• Break down the silos of data engineers, engineers, data

scientists, and analysts

Transitioning DW to Spark• Understand Partitioning, Broadcast Joins, and Parquet

• Not all Hive functions are available in Spark (99% of the time that is okay) due to Hive context

• Don’t limit yourself to build star-schemas / snowflake schemas

• Expand outside of traditional DW: machine learning, streaming

Thank you.For more information, please contact ayad.shammout@hotmail.comdenny@databricks.com

Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predictive Modeling

Engineering

Transcript of Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predictive Modeling

DW - 2nd - Introduction To DW & BI

model DW-DW VOX

Delivering Real-Time Data with Azure · 2019-10-16 · Azure Event Hubs Azure IoT Hub Apache Kafka Data Storage Azure ... Spark Streaming Analytical Data Store SQL DW Hbase Spark

Transitioning from… Transitioning Baby · Transitioning Baby to the Breast ©2015 Nancy Mohrbacher, IBCLC, FILCA 8 FAMILIARITY Issues When transitioning from another feeding method,

images.thdstatic.com · 2020. 6. 11. · (DW - 141.5mrn) DW + 1/8" (DW + 3mm) DW-7 1/2" (DW - 190mrn) DW -9" (DW - 230mm) x4 x12 x32 x36 x32 x2 x12 x16 Corner Joining Bracket Split

Transitioning Multiagent Technology to UAV Applicationspscerri/papers/AAMAS_L3_08.pdf · Transitioning Multiagent Technology to UAV Applications ... Transitioning Multiagent Technology

SPARK SPARK VRT

26. DW (APB) - Lenntech · DW (APB) 50 Hz Impeller Shaft seal Wear parts Pos. Pump type DW.50.08 DW.50.07 DW.50.09 DW.65.27 DW.65.39.H DW.100.39 Description/ Kit No 97524525 97524500

NGK SPARK pÚü6s RESISTOR TYPE SPARK PLUGS SPARK PLUGS ... · ngk spark pÚü6s resistor type spark plugs spark plugs bougies bujias

DW Office PHONE - RS Components · Product Information Product name DW Office PHONE - EU DW Office PHONE - UK DW Office PHONE - US DW Office PHONE - AUS Order name DW 10 PHONE –

Transitioning to Leadership - 7 Strategies for effectively transitioning up

TRANSITIONING APPLICATIONS TO THE WEB APPBUILDERwvgis.wvu.edu/conference/usergroups/2015/Transitioning... · 2015. 10. 23. · TRANSITIONING APPLICATIONS TO THE WEB APPBUILDER Eastern

Spark Architecture · Spark Architecture Spark Shuffle ... Spark Shuffle Spark DataFrames . ... – Entry point of the Spark Shell (Scala, Python, R) – The place where SparkContext

Transitioning from Traditional DW to Apache® Spark™ in Operating Room Predictive Modeling

Spark SQL | Apache Spark

เลเลอกไมถือกไม่ถกูก เลือก SET50 DW · โครงสร้างราคาของ DW. Time Value 10 บาท ราคา DW

DW series Shredders - Doppstadt · PDF fileDW series Shredders DW 2060 e DW 2560 e1 DW 3060 e1 DW 3080 e2 DW 206 CerON ... transmission is achieved by way of a direct drive with planetary

DW Series - DW Pro 1 headset - DW Pro 2 headset

DW>K/^ h d DW^

Transitioning Compute Models: Hadoop MapReduce to Spark