"Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr....
-
Upload
dataconomy-media -
Category
Data & Analytics
-
view
119 -
download
0
Transcript of "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr....
![Page 1: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/1.jpg)
Machine Learning and Internet of Things The Future of Medical Prevention
![Page 2: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/2.jpg)
Introduction
![Page 3: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/3.jpg)
Dataiku
• Founded in 2013 • 60 + employees • Paris, New-York, London, San Francisco
Data Science Software Editor of Dataiku DSS
DESIGN
Load and prepare your data
PREPARE Build your
models
MODEL Visualize and share
your work
ANALYSE
Re-execute your workflow at ease
AUTOMATE Follow your production
environment
MONITOR Get predictions
in real time
SCORE PRODUCTION
![Page 4: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/4.jpg)
Epilepsy Stats and figures
1-‐3% of the popula/on
15.5 billion Euros / year spent trea/ng seizures
6 Types of epilepsy
Dozens of exis/ng treatments
Days to weeks of hospital /me required to diagnose
![Page 5: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/5.jpg)
Epilepsy + IoT Faster, more comfortable diagnosis
+
![Page 6: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/6.jpg)
EEG Seizures on an EEG
![Page 7: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/7.jpg)
EEG Spikes on an EEG
![Page 8: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/8.jpg)
Goals Improve Epilepsy Diagnosis
1. Allow at-home EEG recording via wearable device
2. Detect seizures automatically
3. Detect spikes automatically
4. Shorten time-to-diagnosis for patients with epilepsy
![Page 9: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/9.jpg)
Ageing Stats and figures
x3 Over last 60 years
12 million fall every year in the U.S
700 million People older than 60 in 2006
28.5 % of this popula/on leave alone in the EU
Third leading Cause of death : strokes.
3/4 of all strokes happen to people over 65
![Page 10: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/10.jpg)
Safe Aging with Sphere Improve Falling Detection
Credit Aakansh Gupta : http://datascience.blog.uhuru.co.jp/machine-learning/safe-aging-with-iot-and-machine-learning/
![Page 11: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/11.jpg)
Goals Improve Falling Detection
1. Predict falls and detect strokes so that help may be summoned
2. Analyse eating behaviour - including whether people are taking
prescribed medication
3. Detect periods of depression or anxiety and intervene using a computer
based therapy
![Page 12: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/12.jpg)
A data science workflow Six steps to a predictive model
Data Exploration &
Understanding
Data Preparation Model Creation
Evaluation Deployment Data
Acquisition
Dataset 1
Scored dataset
Model as an API
Iteration 1 Iteration 2
Iteration n
Creating a predictive model is an highly iterative process. Data Science Studio enables its users to create and manage these projects from end-to-end. This process is not industry specific, and can be applied to many use cases.
Dataset 2
Dataset n
Business/Problem Understanding
Adapted from the CRISP-DM methodology
![Page 13: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/13.jpg)
Data Acquisition / Preparation
![Page 14: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/14.jpg)
The concept We’re taking a digital sample of an analog signal
Data Collected True Signal
![Page 15: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/15.jpg)
The Data
1024 Hz x 24 channels = 353 Mb / (hour x patient)
20 patients X 24 hours = 170 Gb
Nightly transfers of data from device to cloud (via wifi)
We want to scale to hundreds of patients with
days of data
Epilepsy
![Page 16: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/16.jpg)
The Data • Accelerometer - Sampled at 20 Hz;
• RGB-D - Bounding box information
• Environmental - The values of passive infrared (PIR) sensors
Safe Aging
![Page 17: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/17.jpg)
The Data Needs
• Interpolation • Missing data, synchronization fail
• Smart Sampling • Zoom at different frequency levels
• Different sensors -> different frequency.
-> how to merge ?
• Aggregation
![Page 18: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/18.jpg)
Time Series as Relational Data
Time Stamp 10001 10002 10003 10004 10005
Sensor1 40 - - 43 42
Sensor2 - 50 55 20 -
Sensor3 30 34 60 - 40
Aggregation
Resampling
Interpolation
![Page 19: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/19.jpg)
Time Series as JSON
{ "sensor1": { 10001: 40, 10004: 43, 10005: 42 }, "sensor2": { 10002: 50, 10003: 55, 10004: 20 } … }
Aggregation
Resampling
Interpolation
![Page 20: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/20.jpg)
Time Series as Time Series Time Series Database
Aggregation
Resampling
Interpolation
![Page 21: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/21.jpg)
Signal Processing Lots of libraries, lots of options
Rename Generate Rolling mean Rolling max Rolling min Rolling median Wavelet decomposition STL decomposition Peak detection Low pass filter High pass filter Convolution Correlation Short-time FFT
Implemented with common interface
+
+
![Page 22: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/22.jpg)
Machine Creation and Evaluation
![Page 23: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/23.jpg)
Machine learning Features
• Descriptive features Epilepsy Pa/ent informa/on : EMR…
Safe aging EMR, age, height
• Time series features
Epilepsy Current values, previous values, correla/ons, Fourier, Wavelets, …
Safe aging Current values, previous values, rolling averages, …
![Page 24: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/24.jpg)
Machine learning Features
• More data Means less feature engineering
Safe aging (lot’s of values)
Xgboost on current and previous values: Let the model find the interac/ons
Epilepsy (millions of lines) RNN, LSTM. Network Architecture = Feature engineering
![Page 25: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/25.jpg)
Training, Testing, Validation 4 patients, 4 readings from each patient
![Page 26: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/26.jpg)
Split 1: Awesome performance 4 patients, 4 readings from each patient
Training Testing
AUC = 0.94
![Page 27: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/27.jpg)
Split 2: OK performance 4 patients, 4 readings from each patient
Training Testing
AUC = 0.70
![Page 28: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/28.jpg)
Worries About our spike detection model
Poor generalization to new patients
What about new devices?
What about different doctors creating annotations?
Solution: more patients, more doctors, more devices
![Page 29: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/29.jpg)
Worries About our position detection model
Average generalization to new patients
What about new devices?
What about different home / rooms ?
Data Solution : more patients, more houses, more devices
Practical Solution : warm start with house + person. Expensive
![Page 30: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/30.jpg)
Deploy
![Page 31: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/31.jpg)
Deploy
• Model Deployment Epilepsy Diagnosis Batch scoring on all record a_er X days
Epilepsy Spike detec:on Batch scoring (used for diagnosis)
Epilepsy seizure detec:on Real /me scoring
Safe aging Real /me scoring every second
Theory
• Maintain your feature flow !
![Page 32: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/32.jpg)
Deploy
• Don’t underestimate real life conditions
• Anomalies
• Headset in wrong position
• Bracelet in wrong hand
• Hardware / sensors deficiency
Practice
![Page 33: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/33.jpg)
Summary
![Page 34: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/34.jpg)
Summary
• IoT devices can improve early detection (epilepsy, fall,…) • IoT devices produce lots of data – use databases made for IoT • Standard workflow – acquire, visualize, prepare, model – can be replicated for IoT devices using open source
software • Differences between patients remains a challenge for prediction algorithms
IoT devices for medical applications
![Page 35: "Machine Learning and Internet of Things, the future of medical prevention", Pierre Gutierrez, Sr. Data Scientist at Dataiku](https://reader031.fdocuments.net/reader031/viewer/2022030305/58713ade1a28abf0568b6c91/html5/thumbnails/35.jpg)
@prrgu/errez