Madhumita Sushil, Simon Šuster, Kim Luyckx, Walter Daelemans · 2020. 7. 16. · UCTO. CLIN28...

Unsupervised patient representationswith interpretable classification decisions

Madhumita Sushil, Simon Šuster, Kim Luyckx, Walter Daelemans

CLIN28

Patient Representations Task-independent generalized semantic representations of patients, s.t.

Similar patients - similar representations

Patient similarity encompasses holistic patient condition

CLIN28

Patient Similarity

CLIN28

Pipeline - Representation Learning and Evaluation

CLIN28

icons by pictohaven, LAFS, ProSymbols, Lucas Almeida, Juraj Sedlák; thenounproject.com

MIMIC-III adult patient; min. 1 non-discharge note

25k-3k-3k(train-val-test)

1-879 notes, 14 categories

PATIENT DOCUMENT

PATIENTTOKENS

Avg. 13k tokens

CLIN28

Unsupervised Representation Learning

PATIENTTOKENS

BAG-OF-WORDS (BoW)

MEDICAL CONCEPTS

+ ASSERTION

BAG-OF-CONCEPTS (BoCUI)

SDAE-BoW / SDAE-BoCUI

Vectors

Stacked Denoising Autoencoder

(SDAE)

CLIN28

PATIENTTOKENS

DOC2VEC Vectors

DOC2VEC

CLIN28

DOC2VEC Vectors SDAE-BoW

Vectors

SDAE-BoW +

DOC2VECVectors

CLIN28

Supervised Representation Evaluation

13% PATIENT DEATH

4% PATIENTDEATH

12% PATIENT DEATH

FEEDFORWARD NEURAL NETWORKS

IN-HOSPITALMORTALITY

30 DAYSMORTALITY

1 YEARMORTALITY

PRIMARYDIAGNOSTICCATEGORY

PRIMARYPROCEDURAL

CATEGORYGENDER

57% MALE40% MAJORITY 39% MAJORITY

Patient Representations

CLIN28

Results - Representation Evaluation

All generalized representation models significantly outperform sparse models when no. of positive instances is low (30 days mortality).

CLIN28

For all tasks except distant patient mortality, BoW model is a strong baseline (strong lexical features present).

CLIN28

Recommended to combine SDAE and doc2vec vectors for unknown tasks.

CLIN28

Model InterpretabilityCritical for

● Error analysis● Exploratory analysis

Goal: Quantitative explanation of a trained model without retraining

CLIN28

Feature Extraction - Pretraining AutoencodersRank features according to mean squared feature reconstruction error across all instances

CLIN28

0.87-0.88 Spearman correlation coefficient between feature reconstruction error and frequency

Feature entropy too high?

CLIN28

Finding Influential features - Classification PhaseGradient-based feature sensitivity calculation across two neural networks

For arbitrary number of instances and output classes

Transferable to different representation learning architectures

CLIN28

Feature Extraction - Classification Phase

Collecting all encoder layers of SDAE

Supervised classifier

CLIN28

So1, i1(1) =

Sensitivity

CLIN28

Feature influence = Maximum root mean squared sensitivity across all instances

So1, i1(1) =

Sensitivity

CLIN28

Most influential features for one instance each

CLIN28

ConclusionsGeneralized patient representations help for low number of positive instances

Recommended to combine autoencoder and doc2vec representations

During representation learning, autoencoder has high reconstruction error for frequent terms, perhaps due to their high entropy

The most influential features for classification however are frequent terms, and context of features plays a role.

CLIN28

Unsupervised patient representations from clinical notes with interpretable classification decisions

Madhumita Sushil, Simon Šuster, Kim Luyckx, Walter Daelemans

Workshop on Machine Learning for Health, NIPS 2017.

THANK YOU!

CLIN28

Results

Madhumita Sushil, Simon Šuster, Kim Luyckx, Walter Daelemans · 2020. 7. 16. · UCTO. CLIN28...

Documents

Transcript of Madhumita Sushil, Simon Šuster, Kim Luyckx, Walter Daelemans · 2020. 7. 16. · UCTO. CLIN28...

Marc luyckx - why is brussels so important

Marc Luyckx Ghisi CBA Dean, Zagreb 1 IFL Brussels April 24 2006 WHY IS BUSINESS CHANGING? Marc LUYCKX GHISI Dean « CBA Business School »ZAGREB Brussels.

STANDARD - Luyckx

TRAZAB RODUCTO UCTO TER MINADO, - lt.advisorf.comlt.advisorf.com/imagenes/documentos/doc_8_20100821145421.pdf · bilidad de prod TRAZAB CONSUMIB ... MINADO, N PLANTA DIFICACI ...

Information Extraction Language Technology (A Machine Learning Approach) 24 March 2005 Antal van den Bosch and Walter Daelemans antalb/ltua.

LEOLUCA PASQUA BERT DAELEMANS - fragmenta.cat

GIP 6BI David Luyckx

NEW TRENDS IN MANAGEMENT EDUCATION IN EASTERN EUROPE Marc LUYCKX GHISI Dean Cotrugli Business Academy Zagreb, Croatia.

Natuurlijke Taal Verwerking Walter Daelemans CNTS, Universiteit Antwerpen walter.daelemans@ua.ac.be (AI-LAB 1986-1989)

Text Mining Walter Daelemans CNTS Department of Linguistics University of Antwerp walter.daelemans@ua.ac.be.

Education - aflat.orgwalter/cv.pdf · CURRICULUM VITAE Walter Daelemans Last updated March 2020 Place and date of birth: Deurne (Belgium), June 3rd, 1960 Nationality: Belgian Position:

Marc Luyckx Ghisi CBA Dean, Zagreb, Brussels 1 Febelux 23 mai 2006 WHY IS BUSINESS CHANGING? Marc LUYCKX GHISI Dean « CBA Business School »ZAGREB Brussels.

UCTO - D. JASON BISHOP...UCTO Author's note: Dennis Shrock led a nationally acclaimed graduate program in choral conducting at the University of Oklahoma for 28 years until his retirement

Capturing ruminative exploration: Extending the four ...wbeyers/scripties2018/artikels/Luyckx et al... · Capturing ruminative exploration: Extending the four-dimensional model of

Marc Luyckx GHISI, 1 NEW T0OLS FOR THE NEW SOCIETY Marc LUYCKX GHISI Member of the Auroville International Advisory Council, Former member of the “Forward.

CHARLES LUYCKX GARY CAMPBELL - McNulty Foundation

Simulation of Language Acquisition Walter Daelemans

Bert Daelemans, S.J. - ecat.server.grupo-sm.comecat.server.grupo-sm.com/ecat_Documentos/ES158719_007950.pdf · los relieves de bronce que figuran en este libro como fotografías se

Cooking Thin with Chef Kathleen - Houghton Mifflin … · Press Release Cooking Thin with Chef Kathleen by Kathleen Daelemans • Introduction • About the Author "Kathleen is simply

DE L EARYOUS TRAINING INTERPERSONAL COMMUNICATION SKILLS USING UNCONSTRAINED TEXT INPUT Frederik Vaassen, Walter Daelemans Jeroen Wauters, Frederik Van.