The Mythos of Model Interpretability -...

25
The Mythos of Model Interpretability Zachary C. Lipton https://arxiv.org/abs/1606.03490

Transcript of The Mythos of Model Interpretability -...

Page 1: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

The Mythos of Model Interpretability

Zachary C. Lipton

https://arxiv.org/abs/1606.03490

Page 2: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Outline

• What is interpretability?

• What are its desiderata?

• What model properties confer interpretability?

• Caveats, pitfalls, and takeaways

Page 3: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

What is Interpretability?

• Many papers make axiomatic claims that some model is interpretable and therefore preferable

• But what interpretability is and precisely what desiderata it serves are seldom defined

• Does interpretability hold consistent meaning across papers?

Page 4: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Inconsistent Definitions

• Papers use the words interpretable, explainable, intelligible, transparent, and understandable, both interchangeably (within papers) and inconsistently (across papers)

• One common thread, however, is that interpretability is something other than performance

Page 5: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

We want good models

Evaluation Metric

Page 6: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

We also want interpretable models

Evaluation Metric

Interpretation

Page 7: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

The Human Wants Something the Metric Doesn’t

Evaluation Metric

Interpretation

Page 8: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

What Gives?• So either the metric captures everything and people

seeking interpretable models are crazy or…

• The metric / loss functions we optimize are fundamentally mismatched from real life objectives

• We hope to refine the discourse on interpretability, introducing more specific language

• Through the lens of the literature, we create a taxonomy of both objectives & methods

Page 9: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Outline

• What is interpretability?

• What are its desiderata?

• What model properties confer interpretability?

• Caveats, pitfalls, and takeaways

Page 10: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Trust

• Does the model know when it’s uncertain?

• Does the model make same mistakes as human?

• Are we comfortable with the model?

Page 11: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Causality• We may want models to

tell us something about the natural world

• Supervised models are trained simply to make predictions, but often used to take actions

• Caruana (2015) shows a mortality predictor (for use in triage) that assigns lower risk to asthma patients

Page 12: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Transferability• The idealized training

setups often differ from real world

• Real problem may be non-stationary, noisier, etc.

• Want sanity-checks that the model doesn’t depend on weaknesses in setup

Page 13: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Informativeness• We may train a model

to make a decision

• But it’s real purpose is to aid a person in making a decision

• Thus an interpretationmay simply be valuable for the extra bits it carries

Page 14: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Outline

• What is interpretability?

• What are its desiderata?

• What model properties confer interpretability?

• Caveats, pitfalls, and takeaways

Page 15: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Transparency

• Proposed solutions conferring interpretability tend to fall into two categories

• Transparency addresses understanding how the model works

• Explainability concerns the model’s ability to offer some (potentially post-hoc) explanation

Page 16: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Simulatability• One notion of transparency

is simplicity

• This accords with papers advocating small decision trees

• A model is transparent if a person can step through the algorithm in reasonable time

Page 17: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Decomposability

• A relaxed notion requires understanding individualcomponents of a model

• Such as: weights of a linear model or the nodes of a decision tree

Page 18: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Transparent Algorithms

• A yet weaker notion would require onlythat we understand the behavior algorithm

• E.g. convergence of convex optimizations, generalization bounds

Page 19: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Post-Hoc Interpretability

Ah yes, something cool is happening in node 750,345,167… maybe it sees a cat?

Maybe we’ll see something awesome if we jiggle the inputs?

Page 20: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Verbal Explanations• Just as people generate

explanations (absent transparency), we might train a (possibly separate)model to generate explanations

• We might consider image captions as interpretations of object predictions

(Image: Karpathy et al 2015)

Page 21: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Saliency Maps

• While the full relationship between input and output might be impossible to describe succinctly, local explanations are potentially useful.

(Image: Wang et al 2016)

Page 22: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Case-Based Explanations• Another way to generate

a post-hoc explanation might be to retrieve labeled items that are deemed similar by the model

• For some models, we can retrieve histories from similar patients

(Image: Mikolov et al 2014)

Page 23: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Outline

• What is interpretability?

• What are its desiderata?

• What model properties confer interpretability?

• Caveats, pitfalls, and takeaways

Page 24: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Discussion Points

• Linear models not strictly more interpretable than deep learning

• Claims about interpretability must be qualified

• Transparency may be at odds with the goals of AI

• Post-hoc interpretations may potentially mislead

Page 25: The Mythos of Model Interpretability - zacklipton.comzacklipton.com/media/papers/mythos-of-model-interpretability... · The Mythos of Model Interpretability ... Transferability ...

Thanks!Acknowledgments:Zachary C. Lipton was supported by the Division of Biomedical Informatics at UCSD, via training grant (T15LM011271) from the NIH/NLM. Thanks to Charles Elkan, Julian McAuley, David Kale, Maggie Makar, Been Kim, Lihong Li, Rich Caruana, Daniel Fried, Jack Berkowitz, & Sepp Hochreiter

References:The Mythos of Model Interpretability (ICML Workshop on Human Interpretability 2016) - ZC Liptonhttp://arxiv.org/abs/1511.03677

Directly Modeling Missing Data with RNNs (MLHC 2016) - ZC Lipton, DC Kale, R Wetzelhttp://arxiv.org/abs/1606.04130

Learning to Diagnose (ICLR 2016) - ZC Lipton, DC Kale, Charles Elkan, R Wetzelhttp://arxiv.org/abs/1511.03677

Intelligible models for healthcare: Predicting pneumonia risk and hospital 30-day readmission. (2015) - R Caruana et alhttp://dl.acm.org/citation.cfm?id=2788613