Richard Jackson - Big Data in Mental Health - 23rd July 2014

28
NATURAL LANGUAGE PROCESSING FOR INFORMATION EXTRACTION

description

Organised by the Bioinformatics group at the BRCMH, IoP, SLaM and Maudsley Digital, this symposium showcased talks regarding the important roles of big data in mental health biomedical research and treatments.

Transcript of Richard Jackson - Big Data in Mental Health - 23rd July 2014

Page 1: Richard Jackson - Big Data in Mental Health - 23rd July 2014

NATURAL LANGUAGE PROCESSING FOR

INFORMATION EXTRACTION

Page 2: Richard Jackson - Big Data in Mental Health - 23rd July 2014

SLAM Clinical records

~250 000 patient records 18 million free text documents Available for research via the CRIS

project

Page 3: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 4: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 5: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 6: Richard Jackson - Big Data in Mental Health - 23rd July 2014

Information Extraction (IE)

Patient ID Diagnosis Age Address1 Depression 31 Flat 1, XYZ road

2 Cancer 67 2 Another Lane

3 Heart Attack 58 78 A place

Page 7: Richard Jackson - Big Data in Mental Health - 23rd July 2014

42%

58%

Unique Medications per Patient

69%

31%

Unique Diagnosis per Patient

11%

89%

Unique MMSE Scores per Patient

StructuredFree text

Page 8: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 9: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 10: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 11: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 12: Richard Jackson - Big Data in Mental Health - 23rd July 2014

TEXT HUNTER - CONCEPT

EXTRACTION SYSTEM

NEGATIVE SYMPTOMS CASE STUDY

Page 13: Richard Jackson - Big Data in Mental Health - 23rd July 2014

Negative symptoms of psychosis Deficits of normal emotional behaviour

Social withdrawalAnhedonia (inability to experience pleasure)Poverty of speechEtc.

Less treatable by medication Greater affect on quality of life

Page 14: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 15: Richard Jackson - Big Data in Mental Health - 23rd July 2014

Example sentences‘Patient X has poor eye contact’‘I assessed the patient on 01/03/12. I noted

that eye contact was poor’‘Saw patient X yesterday. Eye contact was

bad, even worse than before’

‘I spoke to patient X over the telephone, and was thus unable to assess eye contact’

‘Patient X presented with the same level of eye contact as on our last meeting’

Page 16: Richard Jackson - Big Data in Mental Health - 23rd July 2014

Support Vector Machines

SVM produces hyperplane to classify unseen examples

Page 17: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 18: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 19: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 20: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 21: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 22: Richard Jackson - Big Data in Mental Health - 23rd July 2014
Page 23: Richard Jackson - Big Data in Mental Health - 23rd July 2014

Outputs Excel/CSV Knowtator format (for Arc) Gate XML Direct to database

Page 24: Richard Jackson - Big Data in Mental Health - 23rd July 2014

Psychosis Symptomatology

app P R F1Apathy 0.85 1 0.93

Blunted/Flat affect 1 0.74 0.84Concrete thinking 0.97 0.6 0.74

Emotional withdrawal 0.78 0.76 0.77Motivation 0.75 0.63 0.68

Poverty of speech 0.81 0.87 0.84Rapport 0.85 1 0.91

Social withdrawal 0.9 1  Anhedonia 0.96 0.83 0.89

Associations 1 0.87 0.94Circumstantial 0.9 1 0.94

Coherence 0.85 0.98 0.91Delusions 0.91 1 0.95Derailment 0.91 0.96 0.94

Flight of ideas 0.93 0.97 0.94Hallucinations 0.85 0.98 0.91Incoherence 0.82 0.99 0.9

Poverty of thought 0.92 0.96 0.94Tangential 0.92 1 0.95

Page 25: Richard Jackson - Big Data in Mental Health - 23rd July 2014

2007 2008 2009 2010 2011 2012 2013Month of Document_Date

0K

5K

10K

15K

20K

25K

30K

35K

40K

45K

50K

Number of Mentions

All Mentions

mlObservation1negative

unknown

positive

Page 26: Richard Jackson - Big Data in Mental Health - 23rd July 2014

Conclusion User friendly concept extraction Open Source Designed for simple concepts

○ > 90% P○ > 80% R

Page 27: Richard Jackson - Big Data in Mental Health - 23rd July 2014

CRIS team Rob Stewart, Matthew Broadbent, Mike Denis Chin-Kuo Chang, Richard Hayes, Alex Tulloch,

Max Henderson, Gayan Perera Felicity Callard (Oversight Committee) Andrea Fernandes (Administrator) Ryan Little (data linkage) Hitesh Shetty (data extraction) Michael Ball (NLP specialist)

Page 28: Richard Jackson - Big Data in Mental Health - 23rd July 2014

Sheffield team Angus Roberts Genevieve Gorrell Ian Roberts Adam Funk Mark Greenwood